Staff Site Reliability Engineer, User Protection SRE

3 weeks ago

San Francisco, United States Google Full time

Staff Site Reliability Engineer, User Protection SRE Join to apply for the Staff Site Reliability Engineer, User Protection SRE role at Google Applicants in San Francisco: Qualified applications with arrest or conviction records will be considered for employment in accordance with the San Francisco Fair Chance Ordinance for Employers and the California Fair Chance Act. Minimum qualifications Bachelor's degree in Computer Science or a related technical field or equivalent practical experience. 8 years of experience with site reliability engineering focused on building and maintaining scalable, reliable systems. Experience developing/launching products/technologies within AI/ML or a related area. Experience in designing, analyzing and troubleshooting distributed systems. Experience architecting for resilient systems. Preferred qualifications 3 years of experience working effectively cross-functionally with proven track record of driving results. Experience with AI or passion for the AI space with proficiency in using one or more leading AI platforms. Ability to debug, optimise code, and to automate routine tasks. Excellent problem-solving approach, coupled with effective communication skills and a sense of ownership and drive. Excellent communication skills with the ability to build technical consensus across teams. Excellent programming skills in one or more languages (e.g., Java, Python, Go, C++). About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault‑tolerant systems. SRE ensures that Google's services—both our internally critical and our externally‑visible systems—have reliability, uptime appropriate to users' needs and a fast rate of improvement. Additionally SRE’s will keep an ever‑watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google, while using your expertise in coding, algorithms, complexity analysis and large‑scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame‑free environment and promote self‑direction to work on meaningful projects. Compensation The US base salary range for this full‑time position is $197,000–$291,000 plus bonus, equity and benefits. Compensation details listed in US role postings reflect the base salary only and do not include bonus, equity or benefits. Responsibilities Build best practices and tooling for managing AI/ML deployments including observability, mitigation capabilities, SLOs and capacity management while continuing to improve deployment velocity. Drive the whole life‑cycle of service from inception and design, through to deployment, operation and refinement. Monitor services once they are live by measuring and tracking availability, latency and overall system health. Evolve systems sustainably through mechanisms like automation, and advanced systems by pushing for changes that improve reliability and velocity. Lead sustainable incident response and blameless post‑mortems. Equal Employment Opportunity Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form. #J-18808-Ljbffr

Staff Site Reliability Engineer, User Protection SRE

4 weeks ago

San Francisco, CA, United States Google Full time

Staff Site Reliability Engineer, User Protection SRE Increase your chances of an interview by reading the following overview of this role before making an application. Join to apply for the Staff Site Reliability Engineer, User Protection SRE role at Google Applicants in San Francisco: Qualified applications with arrest or conviction records will be...
Staff Site Reliability Engineer, User Protection SRE

3 weeks ago

San Francisco, United States Google Inc. Full time

Staff Site Reliability Engineer, User Protection SRE Google San Francisco, CA, USA Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep expertise in domain. Apply Information: X Applicants in San Francisco: Qualified applications with arrest or conviction records will be considered for...
Staff Site Reliability Engineer, User Protection SRE

3 weeks ago

San Francisco, CA, United States Google Inc. Full time

Staff Site Reliability Engineer, User Protection SRE A variety of soft skills and experience may be required for the following role Please ensure you check the overview below carefully. Google San Francisco, CA, USA Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep expertise in...
Staff Site Reliability Engineer, User Protection SRE

4 weeks ago

San Francisco, CA, United States Google Full time

Staff Site Reliability Engineer, User Protection SRE Join to apply for the Staff Site Reliability Engineer, User Protection SRE role at Google Applicants in San Francisco: Qualified applications with arrest or conviction records will be considered for employment in accordance with the San Francisco Fair Chance Ordinance for Employers and the California Fair...
Staff Site Reliability Engineer

3 weeks ago

San Francisco, United States Heartflow Full time

Heartflow is a medical technology company advancing the diagnosis and management of coronary artery disease, the #1 cause of death worldwide, using cutting-edge technology. The flagship product—an AI-driven, non-invasive cardiac test supported by the ACC/AHA Chest Pain Guidelines called the Heartflow FFRCT Analysis—provides a color-coded, 3D model of a...
Staff Site Reliability Engineer

3 weeks ago

San Francisco, United States Heartflow Full time

Heartflow is a medical technology company advancing the diagnosis and management of coronary artery disease, the #1 cause of death worldwide, using cutting-edge technology. The flagship product—an AI-driven, non-invasive cardiac test supported by the ACC/AHA Chest Pain Guidelines called the Heartflow FFRCT Analysis—provides a color‑coded, 3D model of a...
Staff Site Reliability Engineer

3 weeks ago

San Francisco, CA, United States Heartflow Full time

Heartflow is a medical technology company advancing the diagnosis and management of coronary artery disease, the #1 cause of death worldwide, using cutting-edge technology. The flagship product—an AI-driven, non-invasive cardiac test supported by the ACC/AHA Chest Pain Guidelines called the Heartflow FFRCT Analysis—provides a color‑coded, 3D model of a...
Senior / Staff Site Reliability Engineer (SRE)

2 weeks ago

San Francisco, United States DevOps projects Full time

2025-10-25 Senior / Staff Site Reliability Engineer (SRE) Fluidstack is building GPU supercomputers for top AI labs, governments, and enterprises. Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more. Our team is small, highly motivated, and focused on providing a world class supercomputing experience. We put out customers first in...
Software Engineer, Protected Data Site Reliability Engineering

1 week ago

San Francisco, California, United States Google Full time $141,000 - $202,000

Minimum qualifications:Bachelor's degree in Computer Science, a related field, or equivalent practical experience.2 years of experience with software development in one or more programming languages.Preferred qualifications:Master's degree in Computer Science or Engineering. 2 years of experience designing, analyzing, and troubleshooting distributed...
Software Engineer, Site Reliability

2 weeks ago

San Francisco, United States Sierra Business Solution Full time

Software Engineer, Site Reliability (SRE) Software Engineer, Site Reliability (SRE) at Sierra Business Solution. About Us We are an in‑person company based in San Francisco with growing offices in Atlanta, New York, and London, building a platform that helps businesses create better, more human customer experiences with AI. Our core values are Trust,...

Americas

Europe

Asia / Oceania

Africa

Staff Site Reliability Engineer, User Protection SRE