Sr Specialist Site Reliability Engineer
2 days ago
About This Position
We are seeking a highly skilled and proactive Senior Specialist, Site Reliability Engineering (SRE) to help drive reliability, scalability, and performance across our critical platforms. This role is ideal for a senior-level engineer who combines deep technical expertise with a passion for automation, observability, and operational excellence.
As a Senior Specialist, you'll work on complex reliability challenges, lead technical initiatives, and collaborate across engineering, product, and infrastructure teams to ensure our systems are resilient and efficient.
What You'll Do
- Reliability Engineering
- Architect and implement solutions to improve system reliability, scalability, and performance.
- Define and manage SLIs/SLOs and error budgets across services.
- Lead efforts to automate operational tasks and improve system observability.
- Incident Management & Root Cause Analysis
- Serve as a technical lead during major incidents and drive resolution.
- Conduct deep root cause analyses and implement long-term fixes.
- Champion blameless postmortems and continuous improvement.
- Technical Leadership
- Lead cross-functional reliability initiatives and mentor junior engineers.
- Influence system design and architecture to embed reliability from the ground up.
- Collaborate with software engineers to optimize deployment pipelines and infrastructure.
- Monitoring & Tooling
- Enhance observability through metrics, logging, and tracing.
- Develop and maintain dashboards, alerts, and automated recovery systems.
What You'll Need
- 7+ years of experience in SRE, DevOps, or infrastructure engineering.
- Deep expertise in cloud platforms (AWS, GCP, or Azure), container orchestration (Kubernetes), and infrastructure-as-code (Terraform, CloudFormation).
- Strong proficiency in observability tools (e.g., Prometheus, Grafana, Splunk) and CI/CD pipelines.
- Proven track record of solving complex reliability challenges in distributed systems.
- Excellent communication and collaboration skills.
- Experience in Python, Powershell, or other similar languages
- Active use of artificial intelligence (AI) tools and techniques to enhance performance, drive innovation, and improve decision-making across business functions
- Ability to leverage AI tools and platforms to streamline workflows, improve decision-making, and drive innovation
- Curiosity and adaptability in exploring emerging AI technologies, with a mindset for continuous learning and experimentation
Preferred Qualifications
- Experience in regulated or high-availability environments (e.g., financial services, healthcare).
- Familiarity with chaos engineering, performance tuning, and capacity planning.
- Background in software development with strong coding skills (e.g., Python, Go, Bash).
About Waystar
Through a smart platform and better experience, Waystar helps providers simplify healthcare payments and yield powerful results throughout the complete revenue cycle.
Waystar's healthcare payments platform combines innovative, cloud-based technology, robust data, and unparalleled client support to streamline workflows and improve financials so providers can focus on what matters most: their patients and communities. Waystar is trusted by 1M+ providers, 1K+ hospitals and health systems, and is connected to over 5K commercial and Medicaid/Medicare payers. We are deeply committed to living out our organizational values: honesty; kindness; passion; curiosity; fanatical focus; best work, always; making it happen; and joyful, optimistic & fun.
Waystar products have won multiple Best in KLAS or Category Leader awards since 2010 and earned multiple #1 rankings from Black Book surveys since 2012. The Waystar platform supports more than 500,000 providers, 1,000 health systems and hospitals, and 5,000 payers and health plans. For more information, visit or follow @Waystar ) on Twitter.
WAYSTAR PERKS
- Competitive total rewards (base salary + bonus, if applicable)
- Customizable benefits package (3 medical plans with Health Saving Account company match)
- We offer generous paid time off for our non-exempt team members, starting with 3 weeks + 13 paid holidays, including 2 personal floating holidays. We also offer flexible time off for our exempt team members + 13 paid holidays
- Paid parental leave (including maternity + paternity leave)
- Education assistance opportunities and free LinkedIn Learning access
- Free mental health and family planning programs, including adoption assistance and fertility support
- 401(K) program with company match
- Pet insurance
- Employee resource groups
Waystar is proud to be an equal opportunity workplace. We celebrate, value, and support diversity and inclusion. Qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, national origin, disability status, genetics, marital status, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws.
This applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.
Job Category:
Technology/Engineering
Job Type:
Full time
Req ID:
R2820
-
Sr. Site Reliability Engineer
1 week ago
Atlanta, Georgia, United States VeriiPro Full timeJob DescriptionResponsibilities:As a lead engineer with Retail, Site Reliability Engineering team, you will be at the forefront of Cloud and Big Data technology. In this role you will establish yourself as a technical leader by exposing yourself to a broad range of industry leading technologies that will help to drive acceleration. The ideal candidate...
-
Site Reliability Engineer
1 week ago
Atlanta, Georgia, United States Florence HC Full timeWhat We Do:Florence software advances cures by helping the world's most important research sites do their best work. Our solutions are now used by over 30,000 research teams in 70 countries around the world—we're the most widely deployed site workflow tool in the industry. By the end of the decade, we'll double the pace at which new medicines get to market...
-
Site Reliability Engineer
1 day ago
Atlanta, Georgia, United States Florence Healthcare Full timeWhat We Do: Florence software advances cures by helping the world's most important research sites do their best work. Our solutions are now used by over 30,000 research teams in 70 countries around the world—we're the most widely deployed site workflow tool in the industry. By the end of the decade, we'll double the pace at which new medicines get to...
-
Lead Site Reliability Engineer
2 weeks ago
Atlanta, Georgia, United States Cox Automotive Inc. Full timeThe Lead Site Reliability Engineer will be part of the Site Reliability Engineering (SRE) team. The SRE team drives reliability, observability, and engineering practice maturity across over 150 teams made up of over a thousand engineers in our part of Cox Automotive. We build processes, documentation, and tools that scale: deep observability to detect and...
-
Site Reliability Engineer
1 week ago
Atlanta, Georgia, United States Avance Consulting Full timeTitle: Site Reliability Engineer - GCPLocation: Atlanta,GADuration: ContractJob Description:• Build and manage cloud infrastructure and deploy cloud-native and third-party services to support an analytics platform• Develop Infrastructure as Code using Terraform (preferred), CloudFormation, or similar tools• Design and implement CI/CD workflows using...
-
Senior Site Reliability Engineer
2 weeks ago
Atlanta, Georgia, United States Sinch Full timeSinch is pioneering the way the world communicates. More than 150,000 businesses — including Google, Uber, Paypal, Visa, Tinder, and many others — rely on Sinch's Customer Communications Cloud to power engaging customer experiences through mobile messaging, voice, and email.Whether you need to verify users or craft omnichannel campaigns, Sinch makes it...
-
Atlanta, Georgia, United States Chick-fil-A Corporate Support Center Full timeChick-fil-A, Inc. is dedicated to expanding its business through innovative endeavors both domestically and globally. This technology-dependent approach supports diverse business requirements and operational environments inherent in global expansion and vertical integration.We are searching for a skilled and motivated engineer to lead Cloud Operations and...
-
Senior Site Reliability Engineer
1 week ago
Atlanta, Georgia, United States Euna Solutions Full timeThe OpportunityWe're seeking a highly skilled Senior Site Reliability Engineer (SRE) with not only deep SRE/DevOps expertise but also a strong foundation in software programming across multiple languages. If you've built systems from the ground up, understand how code behaves in production, and can bridge the gap between software development and...
-
Cloud Operations and Site Reliability Engineer
2 weeks ago
Atlanta, Georgia, United States Chick-fil-A Full timeOverviewChick-fil-A, Inc. is dedicated to expanding its business through innovative endeavors both domestically and globally. This technology-dependent approach supports diverse business requirements and operational environments inherent in global expansion and vertical integration.We are searching for a skilled and motivated engineer to lead Cloud...
-
Lead Site Reliability Engineer
2 weeks ago
Atlanta, Georgia, United States Saviynt Full timeSaviynt is an identity authority platform built to power and protect the world at work. In a world of digital transformation, where organizations are faced with increasing cyber risk but cannot afford defensive measures to slow down progress, Saviynt's Enterprise Identity Cloud gives customers unparalleled visibility, control and intelligence to better...