Senior Site Reliability Engineer

4 days ago


Seattle, Washington, United States SingleStore Full time
Job Title: Senior Site Reliability Engineer

At SingleStore, we're seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, building, and running elastic Kubernetes clusters across on-prem, AWS, Azure, and Google Cloud environments.

Key Responsibilities:

  • Help SingleStore craft its production container orchestration strategy.
  • Design, build, and run elastic Kubernetes clusters across on-prem, AWS, Azure, and Google Cloud environments.
  • Experience designing systems for peak reliability, scalability, and performance.
  • Efficiently operate within a data center environment; monitoring performance and health of hardware and software, installing new servers, and upgrading as needed.
  • Participate in a SLA-driven on-call rotation, which will include after-hours, weekend, and rotating holiday participation.

Requirements:

  • Expert-level knowledge of Kubernetes and the container ecosystem.
  • Strong working knowledge of configuration management tools such as Ansible and Puppet.
  • Experience with Unix/Linux operating systems internals and administration (e.g., filesystems, inodes, system calls) and networking (e.g., TCP/IP, routing, network topologies and hardware, SDN) and a keen interest in relational databases.
  • Familiar with at least one of AWS, Azure, or Google Cloud.
  • Experience debugging, diagnosing and troubleshooting complex, production software.
  • C, Python, POSIX shell programming experience required. Experience with C++ / Go are a strong plus.
  • Familiarity with JunOS, routing protocols (BGP), IPSec and Ceph storage a plus.
  • B.S. Degree in Computer Science or related field.


  • Seattle, Washington, United States F5 Networks Full time

    About the RoleF5 Networks is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior SRE, you will be responsible for ensuring the reliability and performance of our systems and services.Key ResponsibilitiesDesign and implement zero-downtime deployment approaches, real-time logging, alerting, and monitoring solutions.Develop...


  • Seattle, Washington, United States Saxon Global Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our Data Platform Services team at Starbucks. As a key member of our team, you will be responsible for maintaining and improving the data platform that supports various Starbucks services.Key ResponsibilitiesEnsure the health and stability of our production systemDevelop and...


  • Seattle, Washington, United States F5 Networks Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at F5 Networks. As a key member of our engineering team, you will be responsible for ensuring the reliability and performance of our systems and services.Key ResponsibilitiesDesign and implement scalable and reliable systems and servicesCollaborate with...


  • Seattle, Washington, United States F5 Full time

    About the RoleF5 is a leading provider of digital transformation solutions, and we're seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesLead the...


  • Seattle, Washington, United States F5 Networks Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at F5 Networks. As a key member of our engineering team, you will be responsible for ensuring the reliability and performance of our systems and services.Key ResponsibilitiesDesign and implement scalable and reliable systems and servicesCollaborate with...


  • Seattle, Washington, United States Apple Full time

    Job SummaryApple is seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Services team. As a key member of our team, you will be responsible for designing, implementing, and operating large-scale cloud infrastructure to support Apple's internet services.About the RoleWe are looking for a strong, enthusiastic developer with a passion...


  • Seattle, Washington, United States MokshaaLLC Full time

    Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Mokshaa LLC. As a key member of our cloud infrastructure team, you will be responsible for ensuring the scalability, reliability, and performance of our Azure cloud environment.Key Responsibilities:Design and implement scalable and highly...


  • Seattle, Washington, United States Saxon Global Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for ensuring the health and performance of our cloud-based systems. You will work closely with our development team to design, implement, and maintain scalable and reliable cloud infrastructure.Key...


  • Seattle, Washington, United States eTek IT Services, Inc. Full time

    Job OverviewThe Senior Site Reliability Engineer plays a critical role in ensuring the reliability, scalability, and performance of our systems and services. They are responsible for designing and implementing tools and automated solutions to improve system reliability, monitoring, and incident response.Key ResponsibilitiesDesign and Implement Infrastructure...


  • Seattle, Washington, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Apple Services Engineering Cloud Services team. As a key member of this team, you will play a critical role in designing, building, and operating the cloud infrastructure that supports our services.Key ResponsibilitiesDesign and implement scalable and reliable cloud...


  • Seattle, Washington, United States Apple Full time

    Cloud Services SRE RoleAt Apple, we're looking for a skilled Site Reliability Engineer to join our Cloud Services team. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, scalability, and performance of our cloud services.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud...


  • Seattle, Washington, United States Hulu Full time

    Job SummaryAs a Senior Site Reliability Engineer at Hulu, you will be a key member of our Performance and Reliability embedded teams. We focus on planning, scoping, solution architecting, software design, and implementation based on functional and performance capability requirements. We leverage cloud-native, commercial, and open-source tools and frameworks...


  • Seattle, Washington, United States Tik Tok Full time

    Job Title: Senior Site Reliability Engineer, InfrastructureAt TikTok, we're looking for a highly skilled Senior Site Reliability Engineer to join our Infrastructure team. As a key member of our team, you will be responsible for designing, building, and operating large-scale, massively distributed infrastructures.Responsibilities:Design and implement...


  • Seattle, Washington, United States Moloco Full time

    About MolocoMoloco is a cutting-edge machine learning company that empowers organizations to unlock the full value of their unique first-party data. With a powerful combination of machine learning technologies, we play a unique role in shaping the digital economy, allowing companies to stay independent and scale.Our MissionWe are advancing the advertising...


  • Seattle, Washington, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Object Storage team. As a key member of our team, you will be responsible for designing, implementing, and maintaining the scalability, reliability, and performance of our object storage infrastructure.Key ResponsibilitiesDesign and implement scalable and reliable...


  • Seattle, Washington, United States SingleStore Full time

    Senior Site Reliability EngineerAt SingleStore, we're seeking a seasoned Senior Site Reliability Engineer to drive our Kubernetes product strategy and help shape the future of our managed service.Key ResponsibilitiesCollaborate with our engineering team to design, build, and run elastic Kubernetes clusters across on-prem, AWS, Azure, and Google Cloud...


  • Seattle, Washington, United States SingleStore Full time

    Senior Site Reliability EngineerAt SingleStore, we're seeking a seasoned Senior Site Reliability Engineer to drive our Kubernetes product strategy and help shape the future of our managed service.Key ResponsibilitiesCollaborate with our engineering team to design, build, and run elastic Kubernetes clusters across on-prem, AWS, Azure, and Google Cloud...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key Responsibilities:Design, implement, and maintain security measures,...


  • Seattle, Washington, United States Phaidra Full time

    About PhaidraPhaidra is a cutting-edge technology company that's revolutionizing the industrial automation sector. Our mission is to empower facilities to adapt and improve over time, leveraging AI-powered control systems that learn and evolve continuously.We're a team of innovators, engineers, and problem-solvers who share a passion for creating...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key Responsibilities:Design, implement, and maintain security measures,...