Current jobs related to Senior Site Reliability Engineer - San Diego, California - Platform Science


  • San Francisco, California, United States Tampa Gardens Senior Living Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure Team. As a key member of our team, you will be responsible for deploying, managing, optimizing, and upgrading the systems that run Sight Machine software.You will work closely with our Development Engineering team to ensure the stability,...


  • San Francisco, California, United States Astranis Full time

    Astranis MissionAstranis is revolutionizing global connectivity by developing the next generation of smaller, more cost-effective spacecraft. Our mission is to bridge the digital divide and connect the four billion people worldwide who lack internet access.Job SummaryWe are seeking a highly motivated and experienced Senior Site Reliability Engineer to join...


  • San Diego, California, United States Qualcomm Full time

    Job Title: Site Reliability EngineerAt Qualcomm, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the stability, sustainability, and security of our infrastructure and services.Key Responsibilities:Monitor system health and detect anomalies to prevent service...


  • San Diego, California, United States Insight Global Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will play a critical role in ensuring the high availability and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and highly available cloud...


  • San Diego, California, United States Insight Global Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will play a critical role in ensuring the high availability and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and highly available cloud...


  • San Diego, California, United States Commserve Technologies Inc Full time

    Job Title: Site Reliability EngineerAt Commserve Technologies Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our enterprise-level applications.Key Responsibilities:Configure, architect, and maintain...


  • San Diego, California, United States Commserve Technologies Inc Full time

    Job Title: Site Reliability EngineerAt Commserve Technologies Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our enterprise-level applications.Key Responsibilities:Configure, architect, and maintain...


  • San Diego, California, United States BAE Systems USA Full time

    Job DescriptionAt BAE Systems USA, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the seamless delivery of our cloud-based services.Key Responsibilities:Work collaboratively with cross-functional teams to design, implement, and maintain scalable and reliable...


  • San Diego, California, United States BAE Systems USA Full time

    Job DescriptionBAE Systems USA is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement robust automation solutions to streamline infrastructure deployment and...


  • San Diego, California, United States Becton, Dickinson & Company Full time

    About the RoleA Site Reliability Engineering Manager at Becton, Dickinson & Company is responsible for ensuring the smooth operation of complex systems and services. They oversee a team of Site Reliability Engineers to maintain infrastructure, handle incident response, and implement continuous improvement initiatives.Key ResponsibilitiesLead a team of Site...


  • San Diego, California, United States BAE Systems USA Full time

    Job DescriptionAt BAE Systems USA, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you'll play a critical role in ensuring the seamless delivery of our cloud-based services. Your expertise in cloud technologies, service lifecycle management, and infrastructure automation will be instrumental in driving our...


  • San Francisco, California, United States Outdefine Full time

    About the RoleWe are seeking a skilled Senior Site Reliability Engineer to join our team at Outdefine. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our blockchain-based infrastructure.Key ResponsibilitiesDesign and implement scalable and reliable infrastructure solutions for our...


  • San Diego, California, United States IntelliPro Group Inc. Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at IntelliPro Group Inc. As a Site Reliability Engineer, you will be responsible for maintaining and optimizing our robust database infrastructure, leveraging automation to ensure reliability, performance, and security. You will design scalable solutions that meet our...


  • San Francisco, California, United States Twitter Full time

    Job Summary:Twitter is seeking a Senior Site Reliability Engineer to lead a team of engineers working to keep our services reliable and scalable. The ideal candidate will have experience managing services in a distributed environment and be comfortable working with on-prem and cloud-based infrastructure.Responsibilities:Lead a team of site reliability...


  • San Francisco, California, United States WEX Full time

    Job SummaryThe WEX Site Reliability Engineering team is seeking a highly motivated and quick-learning individual to join our team as a Site Reliability Engineer Level 1. As a key member of our team, you will be responsible for ensuring the reliability, performance, and security of our systems.Key Responsibilities:Actively participate in training and...


  • San Diego, California, United States BAE Systems USA Full time

    Job DescriptionBAE Systems USA is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our Platform Engineering group, you will play a critical role in developing and deploying cutting-edge IaaS, PaaS, and SaaS solutions using the latest technologies.Key Responsibilities:Work in a team of SREs to ensure seamless, continuous...


  • San Diego, California, United States BAE SYSTEMS Full time

    Job Title: Principal Site Reliability EngineerAt BAE Systems, we are seeking a highly skilled Principal Site Reliability Engineer to join our team. As a key member of our Platform Engineering group, you will play a critical role in developing and deploying cutting-edge technologies to support our customers' missions.Key Responsibilities:Design and implement...


  • San Jose, California, United States Hireio, Inc. Full time

    Job OverviewWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Hireio, Inc.The ideal candidate will have a strong background in software development, systems engineering, and cloud infrastructure. They will be responsible for designing, implementing, and maintaining large-scale, distributed systems that are highly available,...


  • San Jose, California, United States Triune Infomatics Inc Full time

    Role:Senior Site Reliability ManagerTriune Infomatics Inc is seeking an experienced Senior Site Reliability Manager to join our team and contribute to the design and upkeep of our cloud-based IoT edge orchestration solution.Job Summary:The Senior Site Reliability Manager will be responsible for ensuring the availability of our SaaS platform and meeting the...


  • San Diego, California, United States BD Full time

    Job Title: Site Reliability Engineering ManagerJob Summary:A Site Reliability Engineering Manager is responsible for ensuring that systems and services run smoothly, reliably, and efficiently at scale. They manage a team of SREs to maintain infrastructure, handle incident response, and improve the system's reliability and performance.Key...

Senior Site Reliability Engineer

2 months ago


San Diego, California, United States Platform Science Full time
About the Role

We are seeking a highly skilled Senior Site Reliability Engineer to join our team in San Diego, CA (or remote). As a key member of our SRE team, you will be responsible for ensuring the reliability and performance of our cloud-based platform.

Key Responsibilities
  • Develop and enhance CI/CD pipelines to streamline application deployment and management
  • Maintain Helm charts to optimize application deployment and management
  • Establish standardized observability solutions to empower development teams
  • Lead the effort in promoting and prioritizing reliability, driving achievement of uptime goals, and mentoring colleagues in SRE best practices
  • Conduct comprehensive Production Readiness Reviews, working with teams to identify and establish Service Level Indicators and Service Level Objectives (SLIs/SLOs)
  • Design and develop software solutions to address operational challenges effectively to improve system stability and reliability
  • Fulfill on-call duties, providing expert support to development teams for mission-critical applications in production environments
Requirements
  • 5+ years of hands-on experience in SRE or Platform Engineering roles
  • Demonstrated expertise with automation technologies like Jenkins, ArgoCD, or similar
  • Experience with Kubernetes (2+ years), Helm, and Docker within production environments
  • Proficiency with current software development lifecycle (SDLC) concepts and best practices, CI/CD pipelines, and test-driven development
  • Experience with AWS, encompassing proficiency in EKS, IAM, autoscaling, networking, and load balancing/request routing in a production environment
  • Proficient in Python, Bash, Nodejs, and/or Go
  • Proficient with distributed tracing methodologies and observability tools such as Prometheus, ELK, or Datadog
  • Strong emphasis on documentation and fostering knowledge-sharing practices within the team and organization
  • Track record of successfully training and mentoring engineers
  • Proven expertise in optimizing performance and managing costs within cloud environments
  • Sound understanding of SLI/SLO concepts and adherence to SRE best practices
What We Offer
  • Competitive salary range: $109,950 - $176,979
  • Bonus, equity, and benefits package
  • Opportunity to work with a cutting-edge cloud-based platform
  • Collaborative and dynamic work environment
  • Professional development and growth opportunities

Please note that the compensation details listed reflect the base salary only, and do not include bonus, equity, or benefits.