Site Reliability Engineer Lead

3 weeks ago


Austin, United States Infosys Full time
Job Description

Infosys is seeking Site Reliability Engineer Lead. This position's primary responsibility will be to manage a team of SREs to proactively ensure the stability, resilience and scale of our services by automation, testing and engineering. To build on expertise from product teams' systems/operations, cloud infrastructure (AWS/GCP), build and release engineering, software development and stress/load testing to make sure our services are available, cost optimized and fit for purpose early in the development lifecycle. The SRE Lead will also work alongside the development, architecture and service management teams, to ensure technical solutions are aligned to architectural principles, that deliver value to our customers as well as ensuring consistent monitoring, logging and alerting. The SRE Lead is responsible for building capability and maturing operational ways of working across multiple cross-function delivery teams, with focus on technical excellence and a high-performance culture.

This position is based in Austin TX. Candidate should be located within commuting distance or be willing to relocate to this area. This position may require relocation and or travel to project locations.

U.S. citizens and those authorized to work in the U.S. are encouraged to apply.

Required Qualifications
  • Bachelor's degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
  • At least 4 years of IT industry experience.
  • Experience in DevOps, Cloud experience (any of PCF, AWS, GCP, Azure), support experience
  • Experience in automation using Scripting/Programming knowledge (bash, PowerShell, or python).
  • Experience in administration of ServiceNow, Harness, Jira, Bamboo and other Atlassian products.
  • Expert in Logging and Monitoring tools (Splunk, ThousandEyes, Prometheus, Grafana), incorporating frameworks and instrumentations into C# code.
  • Highly proficient with Kubernetes, Terraform and AWS/GCP.
Preferred Skills:
  • Atleast 6 years of experience in DevOps, Cloud experience (any of PCF, AWS, GCP, Azure), support experience
  • Atleast 6 years of experience in automation using Scripting/Programming knowledge (bash, PowerShell, or python)
  • Operational experience in maintaining applications
  • Strong leadership skills to ensure scrum teams and co-workers are motivated and engaged to deliver against a roadmap
  • Has significant experience in evolving practices and ways of working through multi-disciplinary teams, business frameworks and culture
  • Has strong project management background and experience in leading technology change programs
  • An individual who can perform highly in a multi-faceted role - facets that include a very strong technical knowledge, and awareness of emergent trends
  • A very strong communicator, able to lead and facilitate discussions across functions like architecture, technical specialists, business analysis, team leaders, senior management group, and executives
  • Experience working with Windows and Linux Containers (focus currently on Windows)
  • High understanding in NF testing (Performance, Security, Cost Optimization etc)
  • Ability to get up to speed with domain knowledge

The job entails sitting as well as working at a computer for extended periods of time. Should be able to communicate by telephone, email or face to face. Travel may be required as per the job requirements.

EEO/About Us

About Us
Infosys is a global leader in next-generation digital services and consulting. We enable clients in more than 50 countries to navigate their digital transformation. With over four decades of experience in managing the systems and workings of global enterprises, we expertly steer our clients through their digital journey. We do it by enabling the enterprise with an AI-powered core that helps prioritize the execution of change. We also empower the business with agile digital at scale to deliver unprecedented levels of performance and customer delight. Our always-on learning agenda drives their continuous improvement through building and transferring digital skills, expertise, and ideas from our innovation ecosystem.

Infosys provides equal employment opportunities to applicants and employees without regard to race; color; sex; gender identity; sexual orientation; religious practices and observances; national origin; pregnancy, childbirth, or related medical conditions; status as a protected veteran or spouse/family member of a protected veteran; or disability.

  • Austin, Texas, United States Infosys Full time

    Position Overview:Infosys is in search of a Lead Engineer for Site Reliability. This role's primary focus will be to oversee a team of Site Reliability Engineers (SREs) to proactively guarantee the stability, resilience, and scalability of our services through automation, testing, and engineering practices.Key Responsibilities:The successful candidate will...


  • Austin, Texas, United States Expedia Group Full time

    Principal Site Reliability EngineerWe are looking for a highly qualified and seasoned Principal Site Reliability Engineer (SRE) to enhance our operations. The successful candidate will play a crucial role in guaranteeing the stability, scalability, and efficiency of our systems and services. You will collaborate closely with both development and operational...


  • Austin, Texas, United States Expedia Group Full time

    Principal Software Development Engineer - Site ReliabilityWe are looking for a highly proficient and seasoned Principal Software Development Engineer (SRE) to enhance our team. The successful candidate will be accountable for maintaining the reliability, scalability, and performance of our systems and services. You will collaborate closely with both...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Apple. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and services.Key ResponsibilitiesDesign, build, and maintain robust infrastructure and automation solutionsWork closely with...


  • Austin, Texas, United States Expedia Group Full time

    Principal Software Development Engineer - Site ReliabilityWe are in search of a highly qualified and seasoned Principal Software Development Engineer (SRE) to enhance our operations. The ideal candidate will be tasked with ensuring the dependability, scalability, and efficiency of our services and systems. You will collaborate closely with both development...


  • Austin, Texas, United States Apex Systems Full time

    Job DescriptionPosition: Site Reliability EngineerLocation: RemoteDuration: 1 yearRate: $67/hr W-2We are seeking a highly skilled Site Reliability Engineer to join our team at Apex Systems. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key...


  • Austin, Texas, United States Cape Henry Associates, Acquired by JANUS Research Group Full time

    Janus is looking for a seasoned Site Reliability Engineer / DevSecOps Developer to help grow our capability with our DoD clients.Develop Infrastructure as Code (IaC) designing, implementing, and maintaining infrastructure using IaC technologies(e.g. terraform or similar) ensuring scalable, reliable, and efficient platformsCollaborate with data and other...


  • Austin, United States JobRialto Full time

    Skills: 6+ years of experience in systems and platform operations and technology Experience with On Prem and Public Cloud - AWS, EKS Scripting languages like Python Linux Administration and Cloud, DevOps experience would be a plus Team As a member of the Site Reliability Engineering & Production Services team, you will work with other technology...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Apple. As a Site Reliability Engineering Manager, you will be responsible for leading a team that provides the platform for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineering Manager to join our Apple Service Engineering team. As a key member of our team, you will be responsible for establishing and maintaining the reliability and scalability of our cloud services.Key ResponsibilitiesLead a team of engineers in providing a platform for mission-critical...


  • Austin, United States Computer Futures Full time

    Position Summary: We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) to join our client in Austin. The ideal candidate will have a strong background in infrastructure as code (IaC), automation, container orchestration, and monitoring solutions. As an SRE, you will play a critical role in ensuring the reliability, scalability, and...


  • Austin, Texas, United States NinjaOne Full time

    About the RoleAt NinjaOne we are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Site Reliability Engineering Manager to join our Platform Engineering team and help us scale our products to millions of end-users. You will have the opportunity to build the SRE team from the ground up...


  • Austin, United States Terminal Industries Full time

    About Us Terminal builds software that digitizes, indexes, and automates the yard, leveraging best-in-class machine learning. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers and personnel. These are the fundamental operating assets of commerce - and represent the last...


  • Austin, Texas, United States Thales Full time

    About the RoleThales is seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and security of our cloud-based services.Key ResponsibilitiesCollaborate with project managers and service delivery managers to analyze traffic trends and capacity...


  • Austin, United States Terminal Industries Full time

    About Us Terminal builds software that digitizes, indexes, and automates the yard, leveraging best-in-class machine learning. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers and personnel. These are the fundamental operating assets of commerce - and represent the last...

  • Software Engineer

    3 days ago


    Austin, United States Apple Full time

    Carrier Services offer seamless integration of Apple Retail Stores and Apple Online store with major US Carriers for iPhone activations. We are looking for a talented Site Reliability Engineer to join our growing team. As an SRE, you will be responsi Engineer, Software Engineer, Liability, Reliability Engineer, Retail, Reliability, Technology


  • Austin, United States Terminal Industries Full time

    About Us Terminal builds software that digitizes, indexes, and automates the yard, leveraging best-in-class machine learning. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers and personnel. These are the fundamental operating assets of commerce - and represent the last...


  • Austin, United States Terminal Industries Full time

    About Us Terminal builds software that digitizes, indexes, and automates the yard, leveraging best-in-class machine learning. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers and personnel. These are the fundamental operating assets of commerce - and represent the last...


  • Austin, United States Terminal Industries Full time

    About Us Terminal builds software that digitizes, indexes, and automates the yard, leveraging best-in-class machine learning. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers and personnel. These are the fundamental operating assets of commerce - and represent the last...


  • Austin, United States Terminal Industries Full time

    About Us Terminal builds software that digitizes, indexes, and automates the yard, leveraging best-in-class machine learning. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers and personnel. These are the fundamental operating assets of commerce - and represent the last...