Site Reliability Engineer

3 weeks ago


Irving, United States Omega Hires Full time

Role: Site Reliability Engineer Location: Irving, TX (3 days in 2 weeks Hybrid) JOB SUMMARY The Site Reliability Engineering (SRE) team provides leadership, direction, and accountability for building and running large-scale software systems. As a Site Reliability Engineer, you will identify and deliver automation solutions designed to ensure high availability and resiliency using your expertise in software development, complexity analysis, and scalable system design. Strong collaboration skills will be required to work closely with other engineering teams to ensure services/systems are highly stable and performant, meeting the expectations of our business partners and end users. JOB DUTIES Partner with the architecture and development teams on how to make applications highly available, reliable, and performant at global scale Collaborate with the architecture team to ensure Reliability factors are accounted for in business features and enablers Guide development teams in understanding established service level objectives and consequences, and implementing appropriate SLIs to support the objectives Collaborate with development team members to swarm, troubleshoot, and resolve problems Guide ad-hoc teams to brainstorm solutions and build implementation plans based on the Root Cause Analysis of production issues Design and build automated solutions to optimize application/service/platform uptime with minimal human intervention Be available for an on-call rotation to participate in troubleshooting and communication efforts outside of normal business hours Implement and help create standards and best practices, and mentor other team members in order to drive adoption across development teams Perform other duties as assigned Conform with all company policies and procedures JOB SPECIFICATION Knowledge Expert in defining, implementing, and evaluating Service Level Objectives (SLO) and Service Level Indicators (SLI), and associated consequences Software development expertise in two or more high-level programming and scripting languages Experience in evolutionary database design, query performance analysis, and indexing as a cornerstone for delivering scalable, performant products and services Experience in designing, building, and optimizing automated pipelines with automated testing and automated security controls Experience in performing Root Cause Analysis and Problem Management Experience working in Agile Scrum teams with demonstrated success leading improvements (getting better/faster/happier) SKILLS Help establish and maintain a culture of learning through the development and sharing of skills, knowledge, process and tools; combat traditional silos that create “us and them” environments A driving passion for finding solutions to hard problems at scale and operationalizing them Exceptional critical thinking and communication skills, with a passion for leveraging documentation as a tool for constant improvement Additional Knowledge Skills and Abilities Pipeline Automation: Azure DevOps (YAML, ARM), Terraform, Jenkins, Chef, Octopus Deploy Code Scanning: SonarQube, Checkmarx Source Code repos: Git Containerization: Azure Kubernetes Service, Kubernetes (open source), Docker High level programming languages: Java, C# (.NET MVC and .NET Core), Go Scripting: PowerShell, Bash Database: Oracle, Microsoft SQL Server, NoSQL (e.g. CosmosDB) Test Automation: Xamarin. UITest, Specflow, DevTest, Selenium, Test Data Manager, Postman, Maven, TestNG, JMeter Operating systems: Windows, Linux Cloud Platforms: Azure Metrics and Monitoring: Splunk #J-18808-Ljbffr



  • Irving, United States Optomi Full time

    Optomi, in partnership with our client, is seeking an experienced SRE II to join their team for a 6‑month contract‑to‑hire opportunity that is 2 days hybrid onsite in Irving, TX. W2 only – no C2C/sponsorship at this time. We are seeking a highly skilled Site Reliability Engineer II to join our engineering organization. This role focuses on building...


  • Irving, United States Optomi Full time

    This range is provided by Optomi. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $140,000.00/yr - $150,000.00/yr Site Reliability Engineer II | 6-month Contract to Hire | Hybrid (Irving, TX) | 2x onsite per week Optomi, in partnership with a leading financial services company, is seeking...


  • Irving, United States Wellfit Technologies Full time

    Overview Wellfit is the dental industry’s fintech solution, breaking down financial barriers so patients, providers, employers, and payors can all access better care. As a healthcare fintech innovator, we’re transforming the patient journey and redefining what’s possible in dental care. This role: Site Reliability Engineer (SRE) with deep expertise in...


  • Irving, United States Rx Savings Solutions Full time

    Overview Site Reliability Engineer role at Rx Savings Solutions . In this role, you will be instrumental in ensuring the reliability, scalability, and performance of our critical healthcare technology systems. You will apply software engineering principles to operations, focusing on automation, monitoring, and proactive problem-solving to maintain high...


  • Irving, United States McKesson Corporation Full time

    McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well‑being of you and those we serve - we care. What you do at McKesson matters. We foster a...


  • Irving, United States Gartner Full time

    Site Reliability Engineer (Irving, TX) Join a world‑class team of skilled engineers who build creative digital solutions to support our colleagues and clients. We make a broad organizational impact by delivering cutting‑edge technology solutions that power Gartner. Gartner IT values its culture of nonstop innovation, an outcome‑driven approach to...


  • Irving, United States Gartner Full time

    A leading technology solutions provider in Irving, TX seeks a Site Reliability Engineer to enhance customer experiences through improved application reliability. This role involves measuring SLOs, incident response, and collaboration with development and operations teams. The ideal candidate has extensive IT experience, especially in DevOps and cloud...


  • Irving, United States OneMain Financial Full time

    We are looking for a highly skilled and experienced Site Reliability Engineering Team Lead to guide our SRE team, foster best practices, and ensure operational excellence across our infrastructure. Position Overview As the SRE Team Lead, you will be responsible for the technical leadership of a talented team of site reliability engineers dedicated to...


  • Irving, United States Wellfit Technologies Full time

    Base pay range $130,000.00/yr - $150,000.00/yr Location: Irving, TX Wellfit is the dental industry’s fintech solution , breaking down financial barriers so patients, providers, employers, and payors can all access better care. As a healthcare fintech innovator, we’re transforming the patient journey and redefining what’s possible in dental care. About...


  • Irving, United States Gartner Full time

    Job Posting Title: Site Reliability Engineer (Irving, TX)About Gartner:Join a world-class team of skilled engineers who build creative digital solutions to support our colleagues and clients. We make a broad organizational impact by delivering cutting-edge technology solutions that power Gartner. Gartner IT values its culture of nonstop innovation, an...