Site Reliability Engineer P-051
4 weeks ago
The role
As a Site Reliability Engineer (SRE) at company, your mandate is to ensure the availability and reliability of our most critical services, and ensure that they meet the requirements of our customers. Our SRE team is growing, so you’ll be a crucial early member to help establish the team, processes, and best practices. Success in this role looks like collaborating with other teams to build and run sustainable production systems that can evolve and adapt to the changes in our fast-paced environment.
This role is responsible for:
Working proactively with engineering teams to help them set SLOs and implement best practices for logging and telemetry collection
Design, implement and maintain the tools and systems that support service reliability, monitoring, and alerting
Participating in a 24x7 on-call rotation supporting the health of our services
Driving the incident management process and support a blameless post-mortem culture
Participating in application design consulting and capacity planning
Defining and formalizing SRE practices and help guide the overall reliability engineering direction
Providing mentorship both formally and informally to engineers
Continuously optimizing systems and workflows by improving architecture, infrastructure, automation, CI/CD, and observability
Combining software and systems knowledge to engineer high-volume distributed systems in a reliable, scalable, and fault-tolerant manner
You bring
5+ years of relevant industry experience with a focus on distributed cloud native systems design, observability, operation, maintenance, and troubleshooting
5+ years operational experience with an observability platform like Datadog, Splunk, Prometheus/Grafana, or AppDynamics
Fluency in one or more programming languages (e.g. Python, Typescript, Go)
A strong conviction in software development best practices, including version control, automated testing, and continuous integration and delivery
You're self-motivated, inquisitive, and always looking to learn new technologies
You’re a great teammate who communicates clearly and transparently
The Triple H Factor: Humble, Hungry and Honest
An act-like-an-owner mentality. We have a bias toward taking action.
-
Senior Site Reliability Engineer
2 weeks ago
San Jose, United States Hireio, Inc. Full timeJob DescriptionJob DescriptionJob DescriptionPosition Description:Location: Usa/Usa/California/Sf Bay Area, SeattleBase Salary: 187K - 280KSponsor Visa? YesLanguage Requirements: English, Mandarin (Preferred)Our Team:Site Reliability Engineering(SRE) team combines software and systems engineering to build and run large-scale, massively distributed, and...
-
Site Reliability Engineer
3 days ago
San Diego, United States ObjectWin Technology Full timeJob Title: Site Reliability Engineer Location: San Diego, CA or Remote in CA Duration: 6 Months Description: It is an exciting time to be part of SIEs CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make PlayStation highly reliable,...
-
Site Reliability Engineer
4 days ago
San Francisco, United States Vertisystem Full timeDuration: 6 months contractPay rate: $90/hr on W2Job Summary:It is an exciting time to be part of the organization’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make the organization highly reliable, scalable, operable and...
-
Site Reliability Engineer
4 days ago
San Francisco, United States Vertisystem Full timeDuration: 6 months contractPay rate: $90/hr on W2Job Summary:It is an exciting time to be part of the organization’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make the organization highly reliable, scalable, operable and...
-
Site Reliability Engineer
3 days ago
San Francisco, United States Vertisystem Full timeDuration: 6 months contract Pay rate: $90/hr on W2 Job Summary: It is an exciting time to be part of the organizations CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make the organization highly reliable, scalable, operable and...
-
Site Reliability Engineer
3 days ago
San Diego, CA, United States Talent Software Services Full timeSite Reliability Engineer - Senior (NE) Job Summary: Talent Software Services is in search of a Site Reliability Engineer - Senior (NE) for a contract position in San Diego, CA. The opportunity will be one year with a strong chance for a long-term extension. Position Summary: As a member of the CICD and Cloud Reliability team you'll work at the heart of...
-
Site Reliability Engineer
3 days ago
San Diego, CA, United States Talent Software Services Full timeSite Reliability Engineer - Senior (NE) Job Summary: Talent Software Services is in search of a Site Reliability Engineer - Senior (NE) for a contract position in San Diego, CA. The opportunity will be one year with a strong chance for a long-term extension. Position Summary: As a member of the CICD and Cloud Reliability team you'll work at the heart of...
-
AI Ops Site Reliability Engineer
4 days ago
San Jose, CA, United States TikTok Full timeDescriptionTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.Why Join UsCreation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This...
-
Senior Site Reliability Engineer
4 weeks ago
San Diego, United States ACL Digital Full timeW2 Contract/ Local candidates onlyJob Title: Site Reliability EngineerLocation: San Diego, CA (Open to other locations in California)Job Description:It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team...
-
Senior Site Reliability Engineer
4 weeks ago
San Diego, United States ACL Digital Full timeW2 Contract/ Local candidates onlyJob Title: Site Reliability EngineerLocation: San Diego, CA (Open to other locations in California)Job Description:It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team...
-
Senior Site Reliability Engineer
3 weeks ago
San Diego, United States ACL Digital Full timeW2 Contract/ Local candidates only Job Title: Site Reliability Engineer Location: San Diego, CA (Open to other locations in California) Is this the role you are looking for If so read on for more details, and make sure to apply today. Job Description: It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs...
-
Senior Site Reliability Engineer
3 weeks ago
San Diego, United States ACL Digital Full timeW2 Contract/ Local candidates only Job Title: Site Reliability Engineer Location: San Diego, CA (Open to other locations in California) Job Description: It is an exciting time to be part of SIEs CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team...
-
Sr. Reliability Engineer
7 days ago
San Jose, United States Antora Energy Full timeJob DescriptionJob DescriptionAt Antora, we're on a mission to stop climate change. And we can't do that unless we tackle the 30% of global emissions that come from industry.Antora is unlocking zero-emissions industrial energy, cheaper than fossil fuels. Antora's thermal batteries store energy from renewables as heat for days on end, delivering...
-
Sr. Reliability Engineer
6 days ago
San Jose, United States Antora Energy Full timeJob DescriptionJob DescriptionAt Antora, we're on a mission to stop climate change. And we can't do that unless we tackle the 30% of global emissions that come from industry.Antora is unlocking zero-emissions industrial energy, cheaper than fossil fuels. Antora's thermal batteries store energy from renewables as heat for days on end, delivering...
-
Sr. Reliability Engineer
7 days ago
San Jose, United States Antora Energy Full timeAt Antora, we're on a mission to stop climate change. And we can't do that unless we tackle the 30% of global emissions that come from industry. Antora is unlocking zero-emissions industrial energy, cheaper than fossil fuels. Antora's thermal batteries store energy from renewables as heat for days on end, delivering that stored energy as heat and power at...
-
Sr. Reliability Engineer
4 days ago
San Jose, United States Antora Energy Full timeAt Antora, we're on a mission to stop climate change. And we can't do that unless we tackle the 30% of global emissions that come from industry. Antora is unlocking zero-emissions industrial energy, cheaper than fossil fuels. Antora's thermal batteries store energy from renewables as heat for days on end, delivering that stored energy as heat and power at...
-
Sr. Reliability Engineer
6 days ago
San Jose, United States Antora Energy Full timeAt Antora, we're on a mission to stop climate change. And we can't do that unless we tackle the 30% of global emissions that come from industry. Antora is unlocking zero-emissions industrial energy, cheaper than fossil fuels. Antora's thermal batteries store energy from renewables as heat for days on end, delivering that stored energy as heat and power at...
-
Site Reliability Engineer Manager
1 week ago
San Francisco, United States Illuminate Literacy Full timeJob Description Job Description As the Site Reliability Engineer at Illuminate Literacy, you will serve a critical role in our mission to eradicate illiteracy. You will lead and oversee our production environment's reliability, security, and quality assurance. This role involves managing a multifaceted team responsible for operational health, security...
-
Sr. Reliability Engineer
2 weeks ago
San Jose, United States Antora Energy Full timeJob DescriptionJob DescriptionAt Antora, we're on a mission to stop climate change. And we can't do that unless we tackle the 30% of global emissions that come from industry.Antora is unlocking zero-emissions industrial energy, cheaper than fossil fuels. Antora's thermal batteries store energy from renewables as heat for days on end, delivering...
-
AI Ops Site Reliability Engineer
4 days ago
San Jose, United States Tik Tok Full timeDescriptionTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.Why Join UsCreation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This...