Senior Site Reliability Engineer
4 weeks ago
We are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure Team. As a key member of our team, you will be responsible for deploying, managing, optimizing, and upgrading the systems that run Sight Machine software.
You will work closely with our Development Engineering team to ensure the stability, reliability, and availability of all platform components. Your expertise in Kubernetes, Docker, and cloud infrastructure will be instrumental in driving innovation and improving operational efficiency.
Responsibilities- Employ DevOps principles to provide technical operational support for comprehensive cloud infrastructure operations.
- Troubleshoot and resolve complex systems problems that cross multiple layers of the systems stack.
- Instrument and respond to Monitoring and Alerting infrastructure for critical services.
- Participate in our on-call support schedule.
- Proactively pursue opportunities of operational innovation to improve stability, reliability, and availability of all platform components.
- 5+ years of experience with Kubernetes / Docker in at least one of the top-tier cloud providers.
- 5+ years of experience coding with languages Python, Java, Go, Terraform, etc.
- 5+ years of experience using IaC and CI/CD tools like FluxCD (or similar), Jenkins, Terraform, Github, etc.
- Strong experience with the Linux OS.
- Strong working knowledge of Networking (TCP/IP and Application).
- A willingness to author technical documentation for design, workflows, processes, best practices, etc.
- Willing to mentor other team members and engineers.
We offer a competitive salary, stock options, and a comprehensive benefits package. You will also have access to a hybrid work environment, flexible vacation policy, and a range of perks including catered lunches, snacks, and beverages.
We are an equal opportunity employer and consider candidates regardless of age, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status.
-
Senior Site Reliability Engineer
4 weeks ago
San Francisco, California, United States Astranis Full timeAstranis MissionAstranis is revolutionizing global connectivity by developing the next generation of smaller, more cost-effective spacecraft. Our mission is to bridge the digital divide and connect the four billion people worldwide who lack internet access.Job SummaryWe are seeking a highly motivated and experienced Senior Site Reliability Engineer to join...
-
Senior Site Reliability Engineer
4 weeks ago
San Francisco, California, United States Twitter Full timeJob Summary:Twitter is seeking a Senior Site Reliability Engineer to lead a team of engineers working to keep our services reliable and scalable. The ideal candidate will have experience managing services in a distributed environment and be comfortable working with on-prem and cloud-based infrastructure.Responsibilities:Lead a team of site reliability...
-
Site Reliability Engineer
3 weeks ago
San Francisco, California, United States WEX Full timeJob SummaryThe WEX Site Reliability Engineering team is seeking a highly motivated and quick-learning individual to join our team as a Site Reliability Engineer Level 1. As a key member of our team, you will be responsible for ensuring the reliability, performance, and security of our systems.Key Responsibilities:Actively participate in training and...
-
Senior Staff Site Reliability Engineer
3 weeks ago
San Francisco, California, United States WEX Full timeThe WEX Site Reliability Engineering team is seeking a Senior Staff SRE who is passionate about developing software and solutions focused on observability, incident response, reliability, and performance.The team will be part of the Benefits Reliability organization which supports our internal stakeholders and our Benefits Platform teams.As part of the...
-
Senior Staff Site Reliability Engineer
3 weeks ago
San Francisco, California, United States WEX Full timeAbout the RoleThe WEX Site Reliability Engineering team is seeking a technical leader to drive the design and implementation of complex systems at scale. As a Senior Staff SRE, you will work closely with engineering teams to ensure that our systems are reliable, performant, and secure.Key ResponsibilitiesProvide technical guidance and mentorship to other...
-
San Francisco, California, United States TBWA\Chiat\Day Full timeJob Title:Senior Site Reliability Engineer with Perplexity AIJob Summary:We are seeking a highly skilled Senior Site Reliability Engineer to join our team at Perplexity AI. As a key member of our infrastructure team, you will be responsible for designing, implementing, and scaling our cloud infrastructure to support our AI-powered search...
-
Senior Site Reliability Engineer
3 weeks ago
San Francisco, California, United States Astranis Full timeAstranis is a pioneering company that aims to bridge the digital divide by connecting people worldwide who lack internet access.We're building the next generation of smaller, more cost-effective spacecraft to bring the world online.As a team, we've made significant progress, launching two satellites into orbit, signing ten commercial deals worth over $1...
-
Senior Site Reliability Engineer
4 weeks ago
San Francisco, California, United States HashiCorp Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Production Engineering team at HashiCorp. As a key member of our team, you will be responsible for ensuring the reliability, performance, and robustness of our Terraform Platform.Key Responsibilities:Dive into complex problems with a focus on both immediate remediation...
-
Site Reliability Engineer
4 weeks ago
San Francisco, California, United States Unreal Gigs Full timeJob Title: Site Reliability EngineerAt Unreal Gigs, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the high availability, scalability, and performance of our complex distributed systems.Key Responsibilities:Design and implement monitoring, logging, and alerting...
-
Site Reliability Engineer
4 weeks ago
San Francisco, California, United States Unreal Gigs Full timeJob Title: Site Reliability EngineerAt Unreal Gigs, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the high availability, scalability, and performance of our complex distributed systems.Key Responsibilities:Design and implement monitoring, logging, and alerting...
-
Site Reliability Engineer
4 weeks ago
San Francisco, California, United States DaVita Full timeAbout the RoleThe WEX Site Reliability Engineering team is seeking a skilled Site Reliability Engineer to join our Platform Reliability organization. As a key member of our team, you will be responsible for developing software and solutions focused on observability, incident response, reliability, and performance.You will collaborate with our engineering...
-
Senior Site Reliability Engineer
4 weeks ago
San Francisco, California, United States Crusoe Full timeAbout Crusoe Energy SystemsCrusoe Energy Systems is a pioneering company that aims to unlock value in stranded energy resources through the power of computation. By co-locating mobile data centers with stranded energy resources, such as flare gas and underloaded renewables, Crusoe delivers low-cost, carbon-negative distributed computing solutions. Our...
-
Site Reliability Engineer
4 weeks ago
San Francisco, California, United States Instabase Full timeAbout InstabaseAt Instabase, we're passionate about harnessing the power of AI innovation to democratize access to cutting-edge technology and empower organizations to solve complex unstructured data problems. With a strong presence in the market and a talented team, we're committed to delivering top-tier solutions that drive business success.Job...
-
Site Reliability Engineer
4 weeks ago
San Francisco, California, United States Instabase Full timeAbout InstabaseInstabase is a global company with offices in San Francisco, New York, London, and Bengaluru. We're a people-first organization that values experimentation, curiosity, and customer obsession.Job SummaryWe're seeking a Site Reliability Engineer to join our Site Reliability and Platform Engineering team. As a key member of our team, you'll be...
-
Site Reliability Engineer
4 weeks ago
San Francisco, California, United States Withorb Full timeAbout UsOrb is a cutting-edge technology company on a mission to revolutionize the way businesses approach revenue growth. Our team is passionate about building a robust infrastructure that enables our customers to unlock their full potential.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our...
-
Senior Site Reliability Engineer
4 weeks ago
San Francisco, California, United States Hinge Health Full timeAbout the RoleHinge Health is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our platform, including automation, logging, monitoring, and alerting.You will thrive in a collaborative environment, have excellent communication skills, and be...
-
Site Reliability Engineer
4 weeks ago
San Francisco, California, United States Outdefine Full timeAbout the JobWe are seeking a highly skilled Site Reliability Engineer to join our team at Outdefine. As a key member of our engineering team, you will be responsible for ensuring the reliability, scalability, and performance of our ecommerce platform.Key ResponsibilitiesDesign and implement scalable and highly available cloud infrastructure using Kubernetes...
-
Site Reliability Engineer
4 weeks ago
San Francisco, California, United States Roman Health Pharmacy LLC Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Xero. As a key member of our Reliability Enablement team, you will play a critical role in ensuring the reliability and performance of our systems.Key ResponsibilitiesInvestigate operational surprises and support teams in post-incident activitiesConduct in-depth...
-
Site Reliability Engineer
4 weeks ago
San Francisco, California, United States YO HR CONSULTANCY Full timeJob Title: Site Reliability EngineerJob Description:At YO HR CONSULTANCY, we are seeking a highly skilled Site Reliability Engineer to join our team.Key Responsibilities:* Extensive experience working with Linux flavors like RHEL/CentOS OS, shells, filesystems, and utilities* Knowledge of distributed computing and experience working with container...
-
Site Reliability Engineer
4 weeks ago
San Francisco, California, United States Orb Full timeAbout the RoleOrb is seeking a skilled Site Reliability Engineer to join our team. As a key member of our engineering organization, you will play a critical role in maintaining and scaling our robust infrastructure, ensuring stability, scalability, and performance.You will be responsible for tackling complex engineering challenges, from scaling our data...