Current jobs related to Lead Site Reliability Engineer - Seattle, Washington - SingleStore


  • Seattle, Washington, United States Sogeti Full time

    Job Title: Lead Site Reliability Engineer Job Summary: We are seeking a highly skilled Lead Site Reliability Engineer to join our team at Sogeti. The successful candidate will be responsible for developing and maintaining cloud observability systems, building monitoring and alerting systems, and optimizing system performance. Key Responsibilities: *...


  • Seattle, Washington, United States DAT Freight Solutions Full time

    About DAT Freight SolutionsDAT Freight Solutions is a leading provider of transportation management software and services. We are seeking a highly skilled Site Reliability Engineering Lead to join our team.The successful candidate will be responsible for leading major technical initiatives and mentoring engineers to enhance their skills. They will work...


  • Seattle, Washington, United States Sogeti Full time

    Job Title: Lead Site Reliability Engineer Job Summary: We are seeking a highly skilled Lead Site Reliability Engineer to join our team. As a Lead Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining cloud observability systems, as well as building flexible monitoring and alerting to proactively address issues before...


  • Seattle, Washington, United States DAT Solutions Full time

    About DAT SolutionsWe are a next-generation SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 45 years.We continue to transform the industry year over year, by deploying a suite of software solutions to millions of customers every day - customers who depend on us for the most relevant data...


  • Seattle, Washington, United States DAT Solutions Full time

    About DAT SolutionsAs a leading employer of choice, DAT Solutions is a next-generation SaaS technology company that has been at the forefront of innovation in transportation supply chain logistics for decades.We continue to transform the industry by deploying a suite of software solutions to millions of customers every day, providing them with the most...


  • Seattle, Washington, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok U.S. Data Security is a subsidiary of TikTok in the U.S., dedicated to protecting user data and ensuring the security of our platform.ResponsibilitiesWe are seeking a highly motivated and experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the...


  • Seattle, Washington, United States Apple Full time

    Site Reliability Engineering ManagerAt Apple, we're looking for a skilled Site Reliability Engineering Manager to join our team. As a Site Reliability Engineering Manager, you will be responsible for leading a team that provides the platform for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and...


  • Seattle, Washington, United States Sogeti Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using Azure or...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key Responsibilities:Design, implement, and maintain security measures,...


  • Seattle, Washington, United States HireIO Inc Full time

    Job Title: Site Reliability EngineerHireIO Inc is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our distributed systems.Key Responsibilities:Design and implement scalable and reliable systemsCollaborate with cross-functional...


  • Seattle, Washington, United States Oracle Full time

    About the Role:Oracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, develop, and deploy software to improve the availability, scalability, and efficiency of...


  • Seattle, Washington, United States Phaidra Full time

    About PhaidraPhaidra is a cutting-edge technology company that's revolutionizing the industrial automation sector. Our mission is to empower facilities to adapt and improve over time, leveraging AI-powered control systems that learn and evolve continuously.We're a team of innovators, engineers, and problem-solvers who share a passion for creating...


  • Seattle, Washington, United States Sogeti Full time

    Site Reliability Engineer **Job Summary** We are seeking an experienced Site Reliability Engineer to join our team. As a key member of our operations team, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure. **Key Responsibilities** * Design, implement, and maintain scalable and reliable cloud...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.About the RoleWe are seeking a talented and motivated individual to join our dynamic...


  • Seattle, Washington, United States Qualtrics Full time

    We're seeking a skilled Site Reliability Engineer Manager to lead our Gov1 environment support team in the Foundation Product Unit. As a key member of our team, you'll be responsible for managing a team of US-based Support Engineers who will provide support for non-US teams in the Foundation org.This is a unique opportunity to start the first SRE team at...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key Responsibilities:Design, implement, and maintain security measures,...


  • Seattle, Washington, United States Oracle Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Oracle. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure. You will work closely with our development teams to design, implement, and operate large-scale distributed...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled and motivated Site Reliability Engineer to join our dynamic and growing team at Apple.About the RoleAs a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key ResponsibilitiesDesign, implement,...


  • Seattle, Washington, United States Apple Full time

    Job SummaryAs a Site Reliability Engineering Manager at Apple, you will lead a team responsible for providing the platform for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to flourish. This is a hands-on role to establish SRE practices for a private cloud service to accelerate our...


  • Seattle, Washington, United States Phaidra Full time

    About PhaidraPhaidra is a pioneering company in the industrial automation sector, leveraging AI-powered control systems to enable facilities to adapt and improve over time.Our mission is to revolutionize the way industrial facilities operate, making them more efficient, sustainable, and responsive to their environment.Job DescriptionWe are seeking a highly...

Lead Site Reliability Engineer

2 months ago


Seattle, Washington, United States SingleStore Full time
Position Overview

SingleStore is on the lookout for a Lead Site Reliability Engineer to spearhead our Kubernetes product initiatives related to our managed service offerings.

You will play a pivotal role in shaping the architecture, realizing the collective vision, and maintaining your strategic approach to product development.


This position is crucial in enhancing our managed service product line and will significantly influence the organization's future trajectory.

Role and Responsibilities

• Assist SingleStore in developing its production container orchestration strategy.

• Architect, implement, and manage scalable Kubernetes clusters across various environments including on-premises, AWS, Azure, and Google Cloud.

• Design systems for optimal reliability, scalability, and performance.

• Effectively manage operations within a data center environment; oversee the performance and health of both hardware and software, install new servers, and perform upgrades as necessary.

• Engage in a SLA-driven on-call rotation, which will include after-hours, weekend, and rotating holiday duties.

Required Skills and Experience

• Advanced knowledge of Kubernetes and the container ecosystem.

• Proficient in configuration management tools such as Ansible and Puppet.


• Strong understanding of Unix/Linux operating systems, including internals and administration (e.g., filesystems, inodes, system calls) and networking (e.g., TCP/IP, routing, network topologies and hardware, SDN), along with a keen interest in relational databases.

• Familiarity with at least one of the major cloud platforms: AWS, Azure, or Google Cloud.

• Proven experience in debugging, diagnosing, and troubleshooting complex production software.

• Proficiency in C, Python, and POSIX shell programming is essential. Experience with C++ or Go is highly desirable.

• Knowledge of JunOS, routing protocols (BGP), IPSec, and Ceph storage is a plus.

• B.S. Degree in Computer Science or a related discipline.