Site Reliability Engineer

3 days ago


Austin, United States NeerInfo Solutions Full time

Role : Site Reliability Engineer Lead

Fulltime

Location : Austin TX (Onsite)


Seeking Site Reliability Engineer Lead. This position's primary responsibility will be to manage a team of SREs to proactively ensure the stability, resilience and scale of our services by automation, testing and engineering. To build on expertise from product teams’ systems/operations, cloud infrastructure (AWS/GCP), build and release engineering, software development and stress/load testing to make sure our services are available, cost optimized and fit for purpose early in the development lifecycle. The SRE Lead will also work alongside the development, architecture and service management teams, to ensure technical solutions are aligned to architectural principles, that deliver value to our customers as well as ensuring consistent monitoring, logging and alerting. The SRE Lead is responsible for building capability and maturing operational ways of working across multiple cross-function delivery teams, with focus on technical excellence and a high-performance culture.


This position is based in Austin TX. Candidate should be located within commuting distance or be willing to relocate to this area. This position may require relocation and or travel to project locations.


U.S. citizens and those authorized to work in the U.S. are encouraged to apply.


Required Qualifications

  • Bachelor’s degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
  • At least 4 years of IT industry experience.
  • Experience in DevOps, Cloud experience (any of PCF, AWS, GCP, Azure), support experience
  • Experience in automation using Scripting/Programming knowledge (bash, PowerShell, or python).
  • Experience in administration of ServiceNow, Harness, Jira, Bamboo and other Atlassian products.
  • Expert in Logging and Monitoring tools (Splunk, ThousandEyes, Prometheus, Grafana), incorporating frameworks and instrumentations into C# code.
  • Highly proficient with Kubernetes, Terraform and AWS/GCP.

Preferred Skills:

  • Atleast 6 years of experience in DevOps, Cloud experience (any of PCF, AWS, GCP, Azure), support experience
  • Atleast 6 years of experience in automation using Scripting/Programming knowledge (bash, PowerShell, or python)
  • Operational experience in maintaining applications
  • Strong leadership skills to ensure scrum teams and co-workers are motivated and engaged to deliver against a roadmap
  • Has significant experience in evolving practices and ways of working through multi-disciplinary teams, business frameworks and culture
  • Has strong project management background and experience in leading technology change programs
  • An individual who can perform highly in a multi-faceted role – facets that include a very strong technical knowledge, and awareness of emergent trends
  • A very strong communicator, able to lead and facilitate discussions across functions like architecture, technical specialists, business analysis, team leaders, senior management group, and executives
  • Experience working with Windows and Linux Containers (focus currently on Windows)
  • High understanding in NF testing (Performance, Security, Cost Optimization etc)
  • Ability to get up to speed with domain knowledge



  • Austin, Texas, United States Unreal Gigs Full time

    Job Title: Site Reliability EngineerAt Unreal Gigs, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the high availability, scalability, and performance of our complex distributed systems.Key Responsibilities:Design and implement monitoring, logging, and alerting...


  • Austin, Texas, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based services.Key ResponsibilitiesDesign, develop, and deploy automation tools to improve the efficiency and reliability of our cloud...


  • Austin, Texas, United States Cisco Full time

    About the RoleCisco is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement automated solutions to improve the reliability and...


  • Austin, Texas, United States Thales Full time

    Job Title: Site Reliability EngineerThales is seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and security of our cloud-based services.Key Responsibilities:Collaborate with project managers and service delivery managers to analyze traffic...


  • Austin, Texas, United States Thales Full time

    Job Title: Site Reliability EngineerThales is seeking an experienced Site Reliability Engineer to join our team in Austin, TX. As a Site Reliability Engineer, you will be responsible for designing, developing, and maintaining our CTE product line and solutions for deployment in various environments, including on-premises, multiple clouds, and big data and...


  • Austin, Texas, United States Thales Full time

    Job Title: Site Reliability EngineerThales is seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, developing, and maintaining our cloud-based infrastructure and applications.Key Responsibilities:Collaborate with project managers and service delivery managers to analyze...


  • Austin, United States JobRialto Full time

    Skills: 6+ years of experience in systems and platform operations and technology Experience with On Prem and Public Cloud - AWS, EKS Scripting languages like Python Linux Administration and Cloud, DevOps experience would be a plus Team As a member of the Site Reliability Engineering & Production Services team, you will work with other technology...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking an innovative Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will design, build, and maintain our core infrastructure, enabling thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple customers.Key ResponsibilitiesCollaborate with...


  • Austin, Texas, United States Cisco Full time

    About the RoleCisco is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement automated solutions to improve infrastructure stability and scalabilityCollaborate with...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking an innovative Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will design, build, and maintain our core infrastructure, enabling thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple customers.Key ResponsibilitiesCollaborate with...


  • Austin, Texas, United States JobRialto Full time

    About the RoleWe are seeking a highly motivated and experienced Systems and Platform Operations Expert to join our Site Reliability Engineering & Production Services team. As a member of this team, you will work closely with other technology professionals to support Asset Management Technology - Cloud Platform solutions.Key ResponsibilitiesProvide level 2...


  • Austin, Texas, United States Apple Full time

    Site Reliability Engineering ManagerAt Apple, we're committed to delivering exceptional customer experiences through innovative products and services. As a Site Reliability Engineering Manager, you'll play a critical role in ensuring the reliability and scalability of our cloud services.Key ResponsibilitiesLead a team of SRE engineers in establishing and...


  • Austin, Texas, United States Terminal Industries Full time

    About UsTerminal Industries is a pioneering company that leverages cutting-edge machine learning to digitize, index, and automate the yard. Our platform empowers warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers, and personnel. These fundamental operating assets of commerce represent the last...


  • Austin, Texas, United States Info Way Solutions Full time

    Splunk Administration and SRE ExpertiseWe are seeking a highly skilled Splunk administrator with strong expertise in Site Reliability Engineering (SRE) and DevOps to join our team at Info Way Solutions.Key Responsibilities:Administer and optimize Splunk infrastructure for maximum performance and efficiencyDevelop and implement SRE practices to ensure high...


  • Austin, Texas, United States Terminal Industries Full time

    About UsTerminal Industries is a software company that leverages machine learning to digitize, index, and automate the yard. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers, and personnel.OverviewOur world-class vision engineering team has built an engine that can process...


  • Austin, Texas, United States Publishing Full time

    Job DescriptionAt Publishing, we're seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud infrastructure to support our growing business.ResponsibilitiesDesign and implement scalable cloud...


  • Austin, Texas, United States Apple Full time

    Role SummaryApple is seeking a talented Site Reliability Engineer to ensure the reliability, scalability, and performance of our systems and services. As an SRE, you will work closely with our engineering and operations teams to design, build, and maintain robust infrastructure and automation solutions.Key ResponsibilitiesDesign and implement scalable...


  • austin, United States NeerInfo Solutions Full time

    Role : Site Reliability Engineer LeadFulltimeLocation : Austin TX (Onsite)Seeking Site Reliability Engineer Lead. This position's primary responsibility will be to manage a team of SREs to proactively ensure the stability, resilience and scale of our services by automation, testing and engineering. To build on expertise from product teams’...


  • Austin, United States NeerInfo Solutions Full time

    Role : Site Reliability Engineer LeadFulltimeLocation : Austin TX (Onsite)Seeking Site Reliability Engineer Lead. This position's primary responsibility will be to manage a team of SREs to proactively ensure the stability, resilience and scale of our services by automation, testing and engineering. To build on expertise from product teams’...


  • Austin, Texas, United States Weedmaps Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Weedmaps. As a key member of our infrastructure team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based services.Key ResponsibilitiesLeverage your engineering expertise to build, monitor, and improve our...