Site Reliability Engineer

6 hours ago


Chicago, Illinois, United States Meta IT North America Full time

What are we looking for?
The Site Reliability Engineer (SRE) position is a software development-oriented role, focusing heavily on coding, automation, and ensuring the stability and reliability of our global platform. The ideal candidate will primarily be a skilled software developer capable of participating in on-call rotations. The SRE team develops sophisticated telemetry and automation tools, proactively monitoring platform health and executing automated corrective actions. As guardians of the production environment, the SRE team leverages advanced telemetry to anticipate and mitigate issues, ensuring continuous platform stability.

What Will You be Doing?

  • Develop and maintain advanced telemetry and automation tools for monitoring and managing global platform health.
  • Actively participate in on-call rotations, swiftly diagnosing and resolving system issues and escalations from the customer support team (this is not a customer-facing role).
  • Implement automated solutions for incident response, system optimization, and reliability improvement.
  • Provide operational support for backend services and Kafka producers/consumers written in Python running on ECS.
  • Full-Stack Troubleshooting: Support, debug, and enhance the entire application stack, from our frontend to our Python backend services (Flask, Litestar, Celery, ESK, MSK)
  • Hands-on experience building and/or supporting applications written with Must have professional experience building and/or supporting applications with Effectively troubleshoot issues between the frontend UI and backend APIs.

What Will You Bring to the Table?

  • Minimum 3 years of experience with Python
  • Experience with Icinga2, Prometheus, or Splunk a plus
  • Experience with AWS a plus
  • Solid understanding of functional programming, object oriented programming and computer science foundations
  • Good understanding of backend and server side components
  • Ability to work on-call rotation for support with global team members on a semi-frequent basis
  • Proven and strong communication skills
  • Must be self-directed, flexible and have the ability to prioritize and handle multiple projects simultaneously
  • Experience working in an Agile environment a plus

Location of this position: Hybrid in
Chicago/US

Why build your carrer at Meta?
We offer autonomy, clear goals and a dynamic and challenging environment, where professionals have the opportunity to interact with different technologies, participate in all types of projects, bring new ideas and work from anywhere in Brazil and (why not?) anywhere in the world. In addition, we are one of the best companies to work for in Brazil according to Great Place to Work and one of the 10 fastest growing technology companies in the country for 3 consecutive years, according to Anuário Informático Hoje.

What are our values?

  • We are people serving people
  • We all think and act like owners
  • We are hungry for performance
  • We grow and learn together
  • We pursue excellence and simplicity
  • We have innovation and creativity in our DNA

All people are welcome regardless of their condition, disability, ethnicity, religious belief, sexual orientation, appearance, age or others. We want you to grow up with us in a welcoming environment full of opportunities.

Did you relate? Then, #ComeBeMeta



  • Chicago, Illinois, United States Qorali Full time

    Site Reliability Engineer – Cloud & AutomationLocation:ChicagoVisa Sponsorship:Not availableA technology-driven organization is seeking an experiencedSite Reliability Engineerto support and enhance the reliability of its next-generation platform. The role focuses on automation, cloud infrastructure, and system performance.Key Responsibilities:Ensure...


  • Chicago, Illinois, United States CADDi Full time $100,000 - $150,000

    For security reasons, the candidate must be a US Citizen, or a Permanent Resident (Green Card)OverviewAs a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership of US...


  • Chicago, Illinois, United States CADDi Full time

    For security reasons, the candidate must be a US Citizen, or a Permanent Resident (Green Card)OverviewAs a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership of US...


  • Chicago, Illinois, United States Acquire Me Full time

    Site Reliability Engineer / SRE / Production EngineeringMy client is a renowned scientific led quantitative trading firm who are made up of Computer Scientists, Technologists, and Academics who pool their cumulative experience to develop creative solutions that tackle some of the biggest questions in finance. They pair this expertise with machine learning,...


  • Chicago, Illinois, United States Enova International Full time

    We are interested in every qualified candidate who is eligible to work in the United States. However, we are not able to sponsor visas or take over sponsorship at this time.About the Role:Resilience Engineering is a subset of the Site Reliability Engineering team that strives to foster a culture of continuous improvement through incident analysis, process...


  • Chicago, Illinois, United States Storm2 Full time $140,000 - $200,000 per year

    Senior Site Reliability EngineerLocations:Scottsdale, AZ | Chicago, IL | New York, NYType:Full-time | HybridSalary - 140, , % bonusAbout Our ClientOur client is a rapidly growing technology company at the forefront of digital infrastructure innovation. They partner with leading organizations to deliver secure, scalable, and high-performance platforms that...


  • Chicago, Illinois, United States Ahold Delhaize USA Full time

    Category/Area of Expertise:IT & TechnologyJob Requisition:436142Address:USA-IL-Chicago-300 South Riverside PlazaStore Code:Development Ahold Delhaize USA, a division of global food retailer Ahold Delhaize, is part of the U.S. family of brands, which includes five leading omnichannel grocery brands - Food Lion, Giant Food, The GIANT Company, Hannaford and...


  • Chicago, Illinois, United States Information Technology Senior Management Forum Full time $175,800 - $220,700 per year

    Posted Date10/28/2025DescriptionManager, Site Reliability Engineer (Global Payment Network)Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers, doers and...


  • Chicago, Illinois, United States WEX Full time

    About The Team & RoleWe are looking for a highly motivated and high-potential Senior Staff Site Reliability Engineer (SRE) to join our team as a senior technical leader, driving transformational change and delivering significant business impact across WEX's platform ecosystem.This is a truly exciting moment to be part of the SRE organization at WEX. Our...


  • Chicago, Illinois, United States Trading Technologies Full time

    Application Deadline:27 November 2026Department:EngineeringLocation:ChicagoCompensation:$120,000 - $165,000 / yearDescriptionThe Site Reliability Engineer (SRE) position is a software development-oriented role, focusing heavily on coding, automation, and ensuring the stability and reliability of our global platform. The ideal candidate will primarily be a...