Site Reliability Engineer

2 weeks ago


San Diego, California, United States Insight Global Full time
Job Description

An enterprise eCommerce client is seeking a Site Reliability Engineer to join their team. This SRE role will focus on providing direct, level one and two support to internal engineering teams. It will require collaborating with multiple global teams to ensure each customer request is addressed in a way that is reliable, secure, and supportable.

Responsibilities:
  • Build, deploy and operate a combination of open source, custom written, and vendor provided software to support the Network platform infrastructure
  • Contribute to additional automation and testing for service deployments to improve deployment processes, working towards 100% automation
  • Engage directly with engineering customers on troubleshooting requests and guiding them on solutions
  • Identify opportunities for process improvement to reduce customer queue time
  • Perform monthly service deployments for cloud platform services
  • Perform on-call duties for general troubleshooting of core services
  • Provide Tier 1/2 support for all foundational platform services

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day.

We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances.

If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to.

To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy:

Skills and Requirements

  • 7-10 years professional experience operating complex system with at least 3 years at large scale
  • BS in Computer Science, Software Engineering, or equivalent experience
  • Experience building and operating various core infrastructure services (prefer experience with multiple of these or similar technologies): Cloud Networking, Certificate Management, Software Delivery, Configuration Management, DNS, Traffic Management, Identity & Access Management, Network Access Management, Observability, Remote Access Solutions, Secure Images
  • Experience in public cloud services and deployment (AWS experience preferred)
  • Strong software development experience in Python, JavaScript, or Go (Python preferred)
  • Excellent troubleshooting skills that span code, system, and network
  • Hands on experience in working with distributed systems and availability, reliability, scalability
  • Proven experience at building, deploying and operating services at scale in public cloud environments


  • San Diego, California, United States Qualcomm Full time

    Job Title: Site Reliability EngineerJoin Qualcomm as a Site Reliability Engineer and be part of a highly collaborative team focused on provisioning and maintaining infrastructure and services with stability, sustainability, and security always on your mind.About the RoleWe are seeking a skilled Site Reliability Engineer to join our team. As a Site...


  • San Diego, California, United States Qualcomm Full time

    Job Title: Site Reliability EngineerAt Qualcomm, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the stability, scalability, and security of our infrastructure and services.Key Responsibilities:Monitor system health and detect anomaliesInvestigate and...


  • San Diego, California, United States ACL Digital Full time

    Job DescriptionDuration: 0-12 monthsJob Summary: We are seeking a highly skilled Site Reliability Engineer to join our team at ACL Digital. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based applications.Key Responsibilities:Hands-on application management and support for AWS...


  • San Diego, California, United States Onebrief, Inc Full time

    About Onebrief, Inc.Onebrief, Inc. is a cutting-edge technology company that revolutionizes military planning with its innovative all-in-one tool. Our product, Onebrief, supports both creative and process-oriented aspects of military planning, ensuring seamless and efficient decision-making.Job SummaryWe are seeking a highly skilled Site Reliability Engineer...


  • San Diego, California, United States BAE SYSTEMS Full time

    Job DescriptionAt BAE Systems, we're pushing the boundaries of innovation in the field of Site Reliability Engineering. We're seeking a highly skilled and motivated individual to join our team as a Site Reliability Engineer, where you'll play a critical role in ensuring the seamless delivery of our cloud-based solutions.Key Responsibilities:Deliver...


  • San Diego, California, United States BAE SYSTEMS Full time

    Job DescriptionAt BAE Systems, we're pushing the boundaries of innovation in the field of Site Reliability Engineering. We're seeking a highly skilled and motivated individual to join our team as a Site Reliability Engineer, where you'll play a critical role in ensuring the seamless delivery of our cloud-based solutions.Key Responsibilities:Deliver...


  • San Diego, California, United States Addison Group Full time

    Job Title: Site Reliability Engineer - Cloud ExpertAbout the Role:We are seeking a skilled Site Reliability Engineer to join our team at Addison Group. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • San Diego, California, United States BAE Systems USA Full time

    Job DescriptionAt BAE Systems USA, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the seamless delivery of our cloud-based services.Key Responsibilities:Work collaboratively with cross-functional teams to design, implement, and maintain scalable and reliable...


  • San Diego, California, United States Onebrief, Inc Full time

    About Onebrief, Inc.Onebrief, Inc. is a cutting-edge technology company that specializes in developing innovative solutions for military planning and operations. Our flagship product, Onebrief, is an all-in-one tool that streamlines the planning process, enabling users to create and manage complex plans with ease.Job SummaryWe are seeking a highly skilled...


  • San Francisco, California, United States Resource Informatics Group Full time

    Job Title:Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Resource Informatics Group. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our large-scale Oracle database systems.Key Responsibilities:Administer and troubleshoot...


  • San Diego, California, United States Talent Software Services Full time

    Job Title: Site Reliability Engineer - Platform SupportJoin Talent Software Services as a Site Reliability Engineer - Platform Support and be part of a tight-knit team that operates and supports the core infrastructure foundation of our platform.About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Platform Experience Group. As...


  • San Diego, California, United States Apple Full time

    About the RoleAs a Site Reliability Engineer for Atlassian Services at Apple, you will play a critical role in ensuring the reliability and performance of our Atlassian services. You will be responsible for designing, implementing, and maintaining scalable and efficient systems that meet the needs of our customers.Key ResponsibilitiesDesign and implement...


  • San Diego, California, United States Platform Science Full time

    About UsAt Platform Science, we're revolutionizing the way businesses connect and interact with the world around them. Our open IoT platform empowers innovative fleets, application developers, and equipment providers to deliver cutting-edge solutions to supply chain professionals globally.The RoleWe're seeking a highly skilled Senior Site Reliability...


  • San Francisco, California, United States Wasmer Full time

    About the RoleWe are seeking an exceptional Site Reliability Engineer to join our team at Wasmer. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable and reliable infrastructure solutions for our Edge computing platform.Key ResponsibilitiesDesign and implement scalable and reliable infrastructure...


  • San Francisco, California, United States Instabase Full time

    About InstabaseInstabase is a cutting-edge AI innovation company that empowers organizations to solve complex unstructured data problems. With a global presence and a customer-centric approach, we deliver top-tier solutions that provide unmatched advantages for everyday business operations.Job Title: Site Reliability EngineerWe are seeking a highly skilled...


  • San Leandro, California, United States Omni Inclusive Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Omni Inclusive. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our digital platforms.Key Responsibilities:Design, implement, and maintain scalable and reliable...


  • San Francisco, California, United States Instabase Full time

    About InstabaseAt Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry.With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index...


  • San Diego, California, United States Platform Science Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team in San Diego, CA (or remote). As a key member of our SRE team, you will be responsible for ensuring the reliability and performance of our cloud-based platform.Key ResponsibilitiesDevelop and enhance CI/CD pipelines to streamline application deployment and...


  • San Diego, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Data Analytics team at Apple. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our data analytics applications and infrastructure.Key ResponsibilitiesDesign, develop, and maintain complex data infrastructure at the...


  • San Francisco, California, United States Apollo Solutions Full time

    Site Reliability EngineerApollo Solutions has partnered with a pioneering artificial intelligence business that is revolutionizing the use of AI/ML in gaming and security.The company is working closely with government contracts and gaming console companies and is seeking a Site Reliability Engineer to join their growing team.The Site Reliability Engineer...