Site Reliability Engineer
3 days ago
McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve - we care.
What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patients, our communities, and our people. If you want to be part of tomorrow's health today, we want to hear from you.
Rx Savings Solutions (RxSS), part of McKesson's CoverMyMeds business segment, is seeking a talented Site Reliability Engineer (SRE) to join our team In this role, you will be instrumental in ensuring the reliability, scalability, and performance of our critical healthcare technology systems. You will apply software engineering principles to operations, focusing on automation, monitoring, and proactive problem-solving to maintain high availability and deliver exceptional user experiences.
* Our preferred candidate will reside in Columbus, OH, or one of our other hub locations of Overland Park KS, Irving TX or Atlanta GA. Position allows for primarily working from home, with occasional in-office time. We may consider a well-qualified candidate based not located in one of the above hub areas.
* At this time, we are not able to offer sponsorship for employment visas. We're unable to consider individuals currently on H1B, F-1 OPT, STEM OPT, or any other visa status that would require future sponsorship. Candidates must be authorized to work in the United States on a permanent basis without the need for current or future sponsorship.
Job Responsibilities:
* System Reliability & Performance: Design, implement, and maintain robust and scalable infrastructure and applications to ensure high availability, performance, and disaster recovery capabilities
* Automation & Tooling: Develop and implement automation scripts, tools, and processes to streamline operational tasks, reduce manual effort, and improve efficiency across the software development lifecycle
* Monitoring & Alerting: Establish and maintain comprehensive monitoring, alerting, and logging systems to proactively identify and diagnose issues, understand system behavior, and track key performance indicators
* Incident Response & Post-Mortem: Participate in on-call rotations, respond to and resolve critical incidents, and conduct thorough post-mortems to identify root causes and implement preventative measures
* Capacity Planning & Optimization: Collaborate with development teams to analyze system capacity, forecast future needs, and optimize resource utilization to support business growth
* Collaboration & Mentorship: Work closely with software engineers, product managers, and other SREs to promote a culture of reliability, share best practices, and contribute to continuous improvement
* Documentation: Create and maintain clear and concise documentation for systems, processes, and incident runbooks
* Security: Contribute to the implementation and enforcement of security best practices within our infrastructure and applications
Job Qualifications:
* Education / Experience: Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience, and 2+ years of experience in a Site Reliability Engineering, DevOps, or highly related software engineering role
* Programming Skills: Strong proficiency in at least one scripting language (e.g., Python, Go, Ruby, Bash) for automation and tool development
* Cloud Platforms: Hands-on experience with cloud computing platforms (e.g., AWS, Azure, GCP). AWS experience is highly preferred
* Containerization & Orchestration: Experience with container technologies (e.g., Docker) and container orchestration platforms (e.g., Kubernetes)
* CI/CD: Familiarity with Continuous Integration and Continuous Delivery (CI/CD) pipelines and tools
* Monitoring & Alerting Tools: Experience with monitoring and observability tools (e.g., Datadog, Prometheus, Grafana, Splunk)
* Operating Systems: Strong understanding of Linux/Unix operating systems
* Networking: Fundamental understanding of networking concepts (TCP/IP, DNS, HTTP, Load Balancing)
* Problem-Solving: Excellent analytical and problem-solving skills with a proactive approach to identifying and resolving complex technical issues
* Communication: Strong verbal and written communication skills, with the ability to articulate complex technical concepts to both technical and non-technical audiences
We are proud to offer a competitive compensation package at McKesson as part of our Total Rewards. This is determined by several factors, including performance, experience and skills, equity, regular job market evaluations, and geographical markets. The pay range shown below is aligned with McKesson's pay philosophy, and pay will always be compliant with any applicable regulations. In addition to base pay, other compensation, such as an annual bonus or long-term incentive opportunities may be offered. For more information regarding benefits at McKesson, please click here.
Our Base Pay Range for this position
$84,300 - $140,500
McKesson is an Equal Opportunity Employer
McKesson provides equal employment opportunities to applicants and employees and is committed to a diverse and inclusive environment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability, age or genetic information. For additional information on McKesson's full Equal Employment Opportunity policies, visit our Equal Employment Opportunity page.
Join us at McKesson
-
Site Reliability Engineer
2 weeks ago
Columbus, OH, United States Rx Savings Solutions Full timeMcKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve - we care. What you do at McKesson matters. We foster a...
-
Columbus, OH, United States General Motors Full timeJob Description The Role The rapid adoption of advanced software in vehicles marks a new era for automakers and consumers, bringing both advantages and challenges. As part of Site Reliability Engineering (SRE) at General motors, you'll join a dedicated team focused on enhancing the reliability, efficiency, and scalability of our distributed systems. We...
-
Network Reliability Engineer
2 weeks ago
Columbus, OH, United States GovCIO Full timeOverview GovCIO is currently hiring for Network Reliability Engineer to support our client's contract needs.The Network Reliability Engineer will support, maintain, optimize, monitor, and participate in troubleshooting efforts for a mature network environment for a large Government Agency.This position is located in the within the United States and is fully...
-
Network Reliability Engineer
3 days ago
Columbus, OH, United States GovCIO Full timeOverview GovCIO is currently hiring for Network Reliability Engineer to support our client's contract needs.The Network Reliability Engineer will support, maintain, optimize, monitor, and participate in troubleshooting efforts for a mature network environment for a large Government Agency.This position is located in the within the United States and is fully...
-
Remote Site Civil Engineer
2 weeks ago
Columbus, OH, United States Actalent Full timeDescription This engineering consulting firm was founded in 2024 and has already grown a significant backlog of work. There is currently no brick & mortar location, but there are plans to establish one in the coming years. This role will be 100% remote, with preference on Central Ohio. All candidates MUST be in Ohio. Responsibilities: Develop detailed...
-
Civil/Site Land Development Engineer
2 weeks ago
Columbus, OH, United States Colliers Engineering & Design Full timeOverview Design. Develop. Deliver. Shape the Future of Land Development in Columbus! Colliers Engineering & Design is seeking a passionate and driven Project Engineer to join our Land Development team in Columbus, OH! At Colliers Engineering & Design, we’re more than engineers—we’re strategic partners helping clients bring their visions to life. From...
-
Civil/Site Land Development Engineer
1 week ago
Columbus, OH, United States Colliers Engineering & Design Full timeOverview Design. Develop. Deliver. Shape the Future of Land Development in Columbus! Colliers Engineering & Design is seeking a passionate and driven Project Engineer to join our Land Development team in Columbus, OH! At Colliers Engineering & Design, we’re more than engineers—we’re strategic partners helping clients bring their visions to life. From...
-
Project Engineer
1 week ago
Columbus, OH, United States American Structurepoint Full timeProject Engineer - Civil Site - Columbus OHJob Locations US-OH-ColumbusJob ID 2025-2580Category/Group Civil GroupEmployment Type Regular Full-TimeOverviewJoin American Structurepoint and become part of a team that goes the extra mile for our clients and communities. We live by our values - respect, staff development, results and family. Our team is...
-
Project Engineer
7 days ago
Columbus, OH, United States American Structurepoint Full timeProject Engineer - Civil Site - Columbus OHJob Locations US-OH-ColumbusJob ID 2025-2580Category/Group Civil GroupEmployment Type Regular Full-TimeOverviewJoin American Structurepoint and become part of a team that goes the extra mile for our clients and communities. We live by our values - respect, staff development, results and family. Our team is...
-
Principal Network Reliability Engineer
1 week ago
Columbus, OH, United States Oracle Full timeJob Description The mission of our Network Reliability Engineering team is to provide exceptional network reliability and automation services that enable our customers to drive operational excellence in OCI networks at scale. By focusing on both reactive and proactive functions, we aim to minimize downtime, quickly resolve incidents, and continuously enhance...