Current jobs related to Senior Systems Reliability Specialist - Seattle, Washington - PeopleGene


  • Seattle, Washington, United States Blue Origin Full time

    Job Title: Senior Reliability EngineerAt Blue Origin, we're pushing the boundaries of space exploration and development. As a Senior Reliability Engineer, you'll play a critical role in ensuring the reliability and safety of our engines and propulsion systems.Responsibilities:Develop and implement reliability requirements for engine control systemsSupport...


  • Seattle, Washington, United States Amentum Full time

    Job Title: Reliability SpecialistWe are seeking a highly skilled Reliability Specialist to join our team at Amentum. As a key member of our maintenance team, you will be responsible for ensuring the reliability and maintainability of our equipment and processes.Key Responsibilities:Develop and maintain the reliability program, including predictive...


  • Seattle, Washington, United States Apex Systems Full time

    Job SummaryThe Senior Systems Engineer IV is a technical leader within the Systems Engineering team, responsible for resolving complex issues, mentoring less experienced team members, and providing technical guidance and organizational support for the team. This role requires expertise in evaluating, designing, building, and supporting complex systems, as...

  • Reliability Engineer

    4 weeks ago


    Seattle, Washington, United States SpaceX Full time

    Job Title: Senior Reliability EngineerSpaceX is a pioneering space technology company that is revolutionizing the space industry. We are seeking a highly skilled Senior Reliability Engineer to join our team.Job Summary:The Senior Reliability Engineer will be responsible for developing and implementing reliability prediction models, conducting accelerated...


  • Seattle, Washington, United States Jobot Full time

    About the RoleWe are seeking a highly skilled Senior Reliability Engineer to join our team at Jobot. As a key member of our cross-functional team, you will play a critical role in ensuring the reliability, performance, and longevity of our satellite components.Key ResponsibilitiesDevelop and implement comprehensive reliability test strategies to minimize...


  • Seattle, Washington, United States Saxon Global Full time

    Job SummaryStarbucks is seeking a highly skilled Senior Site Reliability Engineer to join their Data Platform Services team. This team is responsible for maintaining and improving the data platform that many Starbucks services rely on.Key ResponsibilitiesEnsure the health and stability of production systemsDevelop and implement monitoring dashboards and...


  • Seattle, Washington, United States Jobot Full time

    {"Job Title": "Senior Reliability & Test Engineer", "Company": "Jobot", "Job Description": "Job SummaryWe are seeking a highly skilled Senior Reliability & Test Engineer to join our team. As a key member of our cross-functional team, you will play a critical role in ensuring the reliability, performance, and longevity of our satellite components.Key...


  • Seattle, Washington, United States F5 Networks Full time

    About the RoleF5 Networks is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior SRE, you will be responsible for ensuring the reliability and performance of our systems and services.Key ResponsibilitiesDesign and implement zero-downtime deployment approaches, real-time logging, alerting, and monitoring solutions.Develop...


  • Seattle, Washington, United States SingleStore Full time

    Senior Site Reliability EngineerAt SingleStore, we're seeking a seasoned Senior Site Reliability Engineer to drive our Kubernetes product strategy and help shape the future of our managed service.Key ResponsibilitiesDesign and build elastic Kubernetes clusters across on-prem, AWS, Azure, and Google Cloud environments.Develop and maintain production container...


  • Seattle, Washington, United States Amazon Full time

    About the RoleAmazon is seeking a highly skilled Senior Hardware Reliability Engineer to join our team. As a key member of our Hardware Engineering team, you will be responsible for designing and developing reliable hardware systems that meet the needs of our customers.Key ResponsibilitiesDesign and develop reliable hardware systems that meet customer...


  • Seattle, Washington, United States Tik Tok Full time

    Job Title: Senior Site Reliability Engineer, InfrastructureAt TikTok, we're looking for a highly skilled Senior Site Reliability Engineer to join our Infrastructure team. As a key member of our team, you will be responsible for designing, building, and operating large-scale, massively distributed infrastructures.Responsibilities:Design and implement...


  • Seattle, Washington, United States Elit IT Inc. Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Elit IT Inc. in Seattle, WA. As a key member of our cloud operations team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and...


  • Seattle, Washington, United States SingleStore Full time

    Job Title: Senior Site Reliability EngineerAt SingleStore, we're seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, building, and running elastic Kubernetes clusters across on-prem, AWS, Azure, and Google Cloud environments.Key Responsibilities:Help...


  • Seattle, Washington, United States Apple Full time

    Job DescriptionAs a Senior Site Reliability Engineer for Object Storage at Apple, you will play a critical role in ensuring the reliability and scalability of our cloud infrastructure. Your expertise will be instrumental in designing, implementing, and maintaining high-performance systems that meet the demands of our global user base.Key...


  • Seattle, Washington, United States F5 Networks Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at F5 Networks. As a key member of our engineering team, you will be responsible for ensuring the reliability and performance of our systems.Key ResponsibilitiesDesign and implement scalable and efficient system architecturesDevelop and maintain monitoring and...


  • Seattle, Washington, United States Saxon Global Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for ensuring the health and performance of our cloud-based systems. You will work closely with our development team to design, implement, and maintain scalable and reliable cloud infrastructure.Key...


  • Seattle, Washington, United States University of Washington Full time

    Job Title: Senior Computing SpecialistThe University of Washington is seeking a highly skilled Senior Computing Specialist to join our team. As a key member of our IT department, you will be responsible for providing technical support and maintenance for our computing systems, including servers, workstations, and network infrastructure.Key...


  • Seattle, Washington, United States CloudBC Labs Full time

    Job Title: Senior SRE - Data DevOps SpecialistCloudBC Labs is seeking a highly skilled Senior SRE - Data DevOps Specialist to join our team. As a key member of our Cloud Infrastructure team, you will be responsible for ensuring the health and reliability of our production systems.Key Responsibilities:Develop and maintain monitoring dashboards to ensure...


  • Seattle, Washington, United States eTek IT Services, Inc. Full time

    Job OverviewThe Senior Site Reliability Engineer plays a critical role in ensuring the reliability, scalability, and performance of our systems and services. They are responsible for designing and implementing tools and automated solutions to improve system reliability, monitoring, and incident response.Key ResponsibilitiesDesign and Implement Infrastructure...


  • Seattle, Washington, United States Apple Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our Object Storage team. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining the reliability and scalability of our object storage services.Key ResponsibilitiesDesign and implement monitoring and automation tools to...

Senior Systems Reliability Specialist

2 months ago


Seattle, Washington, United States PeopleGene Full time

Position Title: Senior Systems Reliability Specialist

Work Environment: Hybrid model with onsite presence required three days a week

Technology Stack: Python, AWS, Terraform, Ansible

Compensation Structure: $USD/hr W2

Key Responsibilities:

  • Design, implement, and manage systems while establishing and promoting best practices in cloud infrastructure utilizing self-healing mechanisms, infrastructure-as-code, security protocols, and automation techniques.
  • Create and maintain effective telemetry, alerts, and responses to proactively identify and mitigate reliability challenges.
  • Engage in an on-call support rotation alongside other engineering teams.
  • Explore, test, and advocate for innovative technologies, concepts, and best practices within the wider engineering community.

Preferred Qualifications:

  • At least 5 years of experience in technical operations or systems reliability roles.
  • A minimum of 3 years managing complex, large-scale enterprise applications or websites.
  • Experience with configuration management and orchestration tools (e.g., Chef, Terraform, Cloud Formation).
  • Proficiency in one or more programming languages (e.g., GO, Python, Java, Ruby).
  • Familiarity with containerization technologies (e.g., Docker, Kubernetes, Mesos, Elastic Container Service).
  • Expertise in Cloud/PaaS environments (e.g., AWS, Google Cloud Compute).
  • Comprehensive understanding of continuous integration tools (e.g., Jenkins).
  • Experience with F5 load balancing is advantageous.
  • Solid background in UNIX/LINUX and some Windows server environments, including skills in system installation, configuration, administration, troubleshooting, performance optimization, preventative maintenance, capacity planning, monitoring, and security practices.
  • Expertise in web (IIS, Apache) and Java application (Tomcat, Jboss, etc.) server management, including installation, administration, configuration, troubleshooting, performance optimization, preventative maintenance, capacity planning, monitoring, and security practices.