Expert Site Reliability Engineer

1 month ago


Washington, United States Allscripts Full time

Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today’s healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics, and artificial intelligence (AI) to develop scalable data-driven solutions that bring significant value to all healthcare stakeholders. Together, we can transform healthcare and enable smarter care for millions of people.

Veradigm Provider

Veradigm offers provider practices a suite of easy-to-use healthcare provider solutions that help streamline clinical and financial workflows. We then deliver actionable insights to drive improved outcomes, reduce patients’ out-of-pocket costs, and enhance patient understanding of their disease state and medication therapy.

Our healthcare provider solutions help practices to:

Reduce the administrative burden associated with ever-changing regulatory and reimbursement requirements Improve practice financial performance and take advantage of the benefits of health information technology innovations Enhance patient satisfaction by reducing high costs and long wait times common to many prescriptions Get patients all their specialty medications faster and more easily

We are seeking an expertly skilled and motivated Expert Site Reliability Engineer (SRE) to enhance our dynamic team. In this senior role, you will be instrumental in safeguarding the reliability, performance, and uninterrupted availability of our systems and services. You'll not only manage and promptly resolve incidents as they arise but also use your advanced knowledge of Azure and AWS cloud services to prevent potential issues through strategic innovation.

As a leader in our SRE department, you will bring at least 8 years of relevant industry experience, including a minimum of 3 years in a senior capacity such as Senior SRE or Senior DevOps Engineer. Your day-to-day responsibilities will extend beyond incident management to embrace the mentorship of other engineers, guiding them through career progression and cultivating a culture of excellence and continuous improvement.

The ideal candidate for this position is someone with a passion for delving into and resolving complex technical challenges. You must be self-driven, possess exceptional problem-solving capabilities, and exhibit outstanding communication skills. Your role will involve frequent collaboration with cross-functional teams, where your ability to articulate technical concepts to a diverse audience will be key.

Expertise in developing, implementing, and tracking Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs) are essential, as these will be your tools to quantify and achieve our high standards of service reliability.

What you will contribute:

Serve as an on-call engineer, responsible for managing and resolving incidents that affect the availability and performance of our systems. Collaborate with teams with many years of experience in development, operations, and infrastructure to design, implement, and maintain robust, scalable, and reliable systems. Proactively monitor and analyze system metrics to identify potential issues and take necessary actions to prevent or mitigate them. Conduct thorough root cause analysis of incidents, identifying underlying issues and implementing long-term solutions to prevent recurrence. Automate manual processes and tasks to improve efficiency and reduce human error. Participate in capacity planning and performance optimization efforts to ensure system scalability and reliability. Stay updated with the latest industry trends and emerging technologies related to cloud services and site reliability engineering.

The ideal candidate will have:

Bachelor’s degree in computer science, engineering, a related field or equivalent work experience. 7+ years of experience in development, operations, and infrastructure, with a current or most recent role as a Senior SRE, Senior DevOps Engineer, or an equivalent senior position for at least 2-3 years. Coding proficiency in a high-level programming language (C# preferred) and applied knowledge of Object-Oriented Programming: Java, Objective-C, C#, C/C++, Python.  Proficient in scripting and automation using languages such as Python, Bash, or PowerShell. 3+ years of experience with service-oriented architectures and microservices . Possesses a profound understanding of Site Reliability Engineering principles, with a proven track record in effectively implementing Service Level Agreements (SLAs), Service Level Indicators (SLIs), and Service Level Objectives (SLOs) to drive and measure system reliability and performance. Extensive experience in incident management and on-call support, preferably in a high-availability production environment. Strong knowledge of cloud services, particularly in Azure and AWS, including virtual machines, networking, storage, and load balancing. Excellent troubleshooting and problem-solving skills, with a keen attention to detail. Self-driven and motivated, with the ability to work independently and prioritize tasks effectively. Strong communication and interpersonal skills, with the ability to collaborate and communicate effectively with cross-functional teams. Familiarity with DevOps practices and tools, such as CI/CD pipelines and infrastructure-as-code. Experience with monitoring and logging tools, such as Splunk, Prometheus, Grafana, ELK stack, or similar.

Additional Requirements for Expert SRE

Mentorship: Demonstrated ability to mentor peers and guide them in their career progression, fostering a culture of continuous learning and improvement within the team. Customer-Focused Role: The candidate's current role must involve dealing directly with production environments and external customers, ensuring high standards of reliability and service quality.

Bonus Skills and Certifications

Certifications in Azure, AWS, Terraform, Kubernetes

#LI-CT1
#LI-Remote

Enhancing Lives and Building Careers

Veradigm believes in empowering our associates with the tools and flexibility to bring the best version of themselves to work and to further their professional development. Together, we are In the Network . Interested in learning more?

Take a look at our .

We strongly advocate that our associates receive all CDC recommended vaccinations in prevention of COVID-19.

Visa Sponsorship is not offered for this position.

At Veradigm, our greatest strength comes from bringing together talented people with diverse perspectives to support the needs of healthcare providers, life science companies, health plans, and the patients they serve. The Veradigm Network is a dynamic, open community of solutions, external partners, and cutting-edge artificial intelligence technologies that provide advanced insights, technology, and data-driven solutions. Veradigm offers a comprehensive compensation and benefits package, including holidays, vacation, medical, dental, and vision insurance, company paid life insurance and retirement savings.

Veradigm’s policy is to provide equal employment opportunity and affirmative action in all of its employment practices without regard to race, color, religion, sex, national origin, ancestry, marital status, protected veteran status, age, individuals with disabilities, sexual orientation or gender identity or expression or any other legally protected category. Applicants for North American based positions with Veradigm must be legally authorized to work in the United States or Canada. Verification of employment eligibility will be required as a condition of hire. Veradigm is proud to be an equal opportunity workplace dedicated to pursuing and hiring a diverse and inclusive workforce.

From a "VEVRAA Federal Contractor" We request Priority Referral of Protected Veterans

This is an official Veradigm Job posting. To avoid identity theft, please only consider applying to jobs posted on our official corporate site.

Thank you for reviewing this Veradigm opportunity. Does this look like a great match for your skill set? If so, scroll on down and tell us more about yourself



  • Washington, United States Cinder LLC Full time

    [Full Time] Site Reliability Engineer at Cinder (United States) | BEAMSTART Jobs Site Reliability Engineer Cinder United StatesDate Posted31 Oct, 2022Work LocationWashington, DC, United StatesSalary Offered$110 — $220 yearlyJob TypeFull TimeExperience Required1+ yearsRemote WorkYesStock OptionsNoVacancies1 availableAbout Cinder Cinder provides a...


  • Washington, United States Vontier Full time

    We are seeking an energetic, self-motivated Site Reliability Engineer to join our team! The ideal candidate will be highly energetic and committed to an excellent product, culture, and will be a strong communicator with solid problem solving skills. The site reliability engineer should be an IT expert who uses automation tools to monitor and observe...


  • Washington, United States Talent Discovery Pros Full time

    Job DescriptionJob Description TITLE : Site Reliability Engineer LOCATION : Washington DC CLEARANCE REQUIRED : TS/SCI WORK AUTHORIZATION : US Citizen As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal...


  • Washington, United States StaffWorthy Inc Full time

    We are a dynamic technology services provider committed to delivering exceptional solutions to government clients. For over two decades, we have been assembling top-tier teams dedicated to innovation and excellence. Our mission revolves around the value we bring to our customers and the unwavering passion we have for our people. Position: Site Reliability...


  • Washington, United States Kansas Action for Children Full time

    at T-Mobile USA, Inc. in Overland Park, Kansas, United States Job Description Be unstoppable with us! T-Mobile is synonymous with innovation-and you could be part of the team that disrupted an entire industry! We reinvented customer service, brought real 5G to the nation, and now we're shaping the future of technology in wireless and beyond. Our work is as...


  • Washington, United States Evolver Federal Full time

    Job DescriptionJob DescriptionEvolver Federal is seeking a Site Reliability Engineer. This is a senior engineering and technical role that is focused on influencing, shaping, and managing the systems and processes that are relied upon for building and deploying the GovInfo application and constituent parts.ResponsibilitiesWork as a member of the team to...


  • Washington, United States Evolver Federal Full time

    Evolver Federal is seeking a Site Reliability Engineer. This is a senior engineering and technical role that is focused on influencing, shaping, and managing the systems and processes that are relied upon for building and deploying the GovInfo application and constituent parts. Responsibilities Work as a member of the team to support the GovInfo Program,...


  • Washington, United States Alldus Full time

    Our client is a Series A startup within the Generative AI space and they are hiring an Site Reliability Engineer to join the team. Backed by one of the leading venture capital firms in the industry, this is an exciting opportunity to join a SaaS company that is revolutionizing their industry. Responsibilities: As the Site Reliability Engineer, you will...


  • Washington, United States Harbor Compliance Full time

    Site Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...


  • Washington, United States Harbor Compliance Full time

    Site Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...


  • Washington, United States Harbor Compliance Full time

    Site Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...


  • Washington, United States EmergencyMD Full time

    Evolver Federal is seeking a Site Reliability Engineer. This is a senior engineering and technical role that is focused on influencing, shaping, and managing the systems and processes that are relied upon for building and deploying the GovInfo application and constituent parts. Responsibilities Work as a member of the team to support the GovInfo Program, and...


  • Washington, United States Harbor Compliance Full time

    Job DescriptionJob DescriptionSite Reliability Engineer - Full-time RemoteAdvance Your Career with Cutting-Edge Infrastructure at Harbor ComplianceLocation: Full-time Remote (Excluding CA, CO, MT, NY)About Harbor Compliance:Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology...


  • Washington, United States Harbor Compliance Full time

    Job DescriptionJob DescriptionSite Reliability Engineer - Full-time RemoteAdvance Your Career with Cutting-Edge Infrastructure at Harbor ComplianceLocation: Full-time Remote (Excluding CA, CO, MT, NY)About Harbor Compliance:Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology...


  • Washington DC, United States StaffWorthy Inc. Full time

    We are a leading technology services provider with a rich history of assembling exceptional teams dedicated to delivering outstanding solutions. For over two decades, we have been committed to excellence, with a mission centered around our passion for our people and the value they deliver to our customers. Responsibilities As a Site Reliability Engineer...


  • Washington, United States System One Full time

    Site Reliability Engineer Work Location: 3 days onsite DC - JBAB, 2 days remote Clearance: Active TS/SCI with ability to clear PSD As a Site Reliability Engineer (SRE), you’ll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. What You’ll Do Monitor platform and...


  • Washington, United States Mount Indie Full time

    Job Description Job Description As a Site Reliability Engineer (SRE) , youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background...


  • Washington, United States Mount Indie Full time

    Job Description Job Description As a Site Reliability Engineer (SRE) , youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background...


  • Washington DC, United States Harbor Compliance Full time

    Site Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to grow, we seek a Site Reliability Engineer who is...


  • Washington, United States Mount Indie Full time

    Job Description Job Description As a Site Reliability Engineer (SRE) , youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background...