Site Reliability Engineer

2 weeks ago


Austin, United States Interactive Resources - iR Full time

Get AI-powered advice on this job and more exclusive features.Our client is seeking a highly motivated and skilled Site Reliability Engineer (SRE) to join their Advisor Platform Engineering team. This critical position focuses on maintaining the availability, performance, and scalability of a mission-critical Azure-hosted platform serving thousands of financial professionals nationwide.As an individual contributor, you’ll leverage your growing expertise in cloud infrastructure, automation, and observability to improve and support platform reliability. You’ll work hand-in-hand with Agile development teams, integrating reliability practices throughout the application lifecycle and driving meaningful improvements.This role is ideal for someone who thrives on solving complex infrastructure challenges and enjoys working with cutting-edge cloud technologies.This is an FTE Direct hire opportunity. You will be working onsite with the team in Austin, TexasWhat you get to go do in this exciting role:Azure Infrastructure Management: Oversee the performance, availability, and capacity of key Azure services including VMs, App Services, Function Apps, Container Apps, Azure SQL, Cosmos DB, and more.Enhance Observability: Define and refine SLIs/SLOs, configure monitoring, logging, and alerts using Azure Monitor, Application Insights, and Log Analytics (KQL).Automation & Tooling: Eliminate manual processes by developing automation scripts and tools using PowerShell, Bash, Python, and optionally C#/.NET.Incident Management: Take part in a rotating on-call schedule, leading incident resolution, root cause analysis, and implementing post-incident improvements.Cross-Team Collaboration: Partner with developers, QA, and tech teams throughout the SDLC to ensure performance and reliability goals are met.Capacity & Performance: Contribute to system load testing, performance tuning, and capacity planning, especially within a .NET/React microservices architecture.Documentation & Knowledge Sharing: Maintain system documentation, runbooks, FAQs, and mentor peers in SRE practices.Integration Support: Troubleshoot API integrations, SSO setups, and secure file transfer protocols.Continuous Improvement: Contribute ideas and solutions to improve automation, cost efficiency, security posture, and reliability processes.What you need to be successful in this role:Strong hands-on experience with Azure cloud services, including IaaS and PaaS (networking, compute, storage, databases, messaging, security).Deep knowledge of Azure observability tools and KQL for metrics/log analysis.Proficient in scripting languages such as PowerShell, Bash, or Python.Experience with CI/CD practices and tools (preferably Azure DevOps).Solid understanding of Git workflows and platforms (e.g., Azure Repos, GitHub).Strong foundation in networking concepts (DNS, HTTP/HTTPS, TLS, firewalls, etc.).Skilled in diagnosing complex, distributed system issues.Strong communicator with collaborative mindset; effective independently or in a team.Familiarity with Agile methodologies and Scrum practicesBachelor’s degree in Computer Science, Information Technology, Engineering, or related field—or equivalent hands-on experience.2–5 years of experience in Site Reliability Engineering, DevOps, Systems Administration, or a related field with operational focus.Prior experience supporting production systems in Azure is strongly preferred.Proven ability to implement observability practices and contribute to on-call operations.Track record of successful automation and operational improvement.Background in financial services or other regulated industries is a plus.Certifications (Preferred but not required)ITIL v4 Foundation or equivalent service-management credentialOther relevant cloud or infrastructure certificationsSeniority levelSeniority levelMid-Senior levelEmployment typeEmployment typeFull-timeJob functionIndustriesStaffing and RecruitingReferrals increase your chances of interviewing at Interactive Resources - iR by 2xInferred from the description for this jobMedical insuranceVision insurance401(k)Get notified when a new job is posted.Sign in to set job alerts for “Site Reliability Engineer” roles.Austin, TX $126,400.00-$222,200.00 5 days agoSite Reliability Engineer (SRE, Remote US)Austin, TX $120,000.00-$160,000.00 2 months agoAustin, TX $168,000.00-$322,000.00 2 weeks agoProduct Engineer, Cloud Compute and StorageAustin, Texas Metropolitan Area 1 day agoAustin, TX $198,000.00-$250,000.00 1 week agoSite Reliability Engineer (Remote, US-based)Site Reliability Engineer (Intermediate or Senior)Senior Site Reliability Engineer, HPC and LSFSite Reliability Engineer with KubernetesSenior Site Reliability Engineer - InfrastructureAustin, TX $148,000.00-$287,500.00 2 weeks agoAustin, TX $150,000.00-$170,000.00 4 months agoAustin, TX $85,000.00-$95,000.00 4 days agoSr. Site Reliability Engineer, Energy SoftwareSenior Site Reliability Engineer - FulltimeWe’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr



  • Austin, United States Cafell Technologies Full time

    Senior Manager - Recruitment / Client Relations Role: Site Reliability Engineer (SRE) – Onsite Experience: 7 to 9 years. Job Description As a Cloud Infrastructure Site Reliability Engineer (SRE) with expertise in multiple public cloud service provider platforms, you will be responsible for operating infrastructure solutions, following the principles and...


  • Austin, Texas, United States Cafell Technologies Full time $80,000 - $150,000 per year

    Dear Applicant,Role: Site Reliability Engineer (SRE) - OnsiteLocation: Columbus OH / Austin/ Charlotte NC(Full Time) Visa Type: USC/GC preferred. H1b/H4EAD acceptedExperience - 7 to 9 yrsJob Description:Position Summary:As a Cloud Infrastructure Site Reliability Engineer (SRE) with expertise in multiple public cloud service provider platforms, you will be...


  • Austin, United States SecurityScorecard Full time

    Join to apply for the Principal Site Reliability Engineer role at SecurityScorecard1 week ago Be among the first 25 applicantsJoin to apply for the Principal Site Reliability Engineer role at SecurityScorecardThis range is provided by SecurityScorecard. Your actual pay will be based on your skills and experience — talk with your recruiter to learn...


  • Austin, United States Paradromics, Inc. Full time

    Site Reliability Engineer About Paradromics Brain-related illness is one of the last great frontiers in medicine, not because the brain is unknowable, but because it has been inaccessible. Paradromics is building a brain-computer interface (BCI) platform that records brain activity at the highest possible resolution: the individual neuron. AI algorithms then...


  • Austin, TX, United States Leo Tech Services Full time

    At LeoTech, we are passionate about building software that solves real-world problems in the Public Safety sector. Our software has been used to help the fight against continuing criminal enterprises, drug trafficking organizations, identifying financial fraud, disrupting sex and human trafficking rings and focusing on mental health matters to name a few....


  • Austin, TX, United States Apptronik Full time

    Apptronik is building robots for the real world to improve human quality of life and to help solve the ever-increasing labor shortage problem. Our team has been building some of the most advanced robots on the planet for years, dating back to the DARPA Robotics Challenge. We apply our expertise across the full robotics stack to some of the most important and...


  • Austin, United States Doyle Security Services, Inc. (DSS) Full time

    Cloud Engineer – Site Reliability Engineer Join to apply for the Cloud Engineer‑Site Reliability Engineer role at Doyle Security Services, Inc. (DSS) Location: 100% Remote Salary: $130,000 – $150,000 (based on experience) Responsibilities Design and manage complex engineering and integration of the application, security, and infrastructure...


  • Austin, Texas, United States Fathom Management LLC Full time $130,000 - $150,000 per year

    Cloud Engineer-Site Reliability EngineerWe are working aggressively with the customers to assess and migrate IT systems into cloud-based environments (Microsoft Azure, Amazon Web Services and others) as well as procure and implement new technology to replace legacy systems. The Site Reliability Engineer is a member of the group of technologists who are...


  • Austin, United States Visa Full time

    Company DescriptionVisa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...


  • Austin, United States Thales Full time

    Location: Austin, United States of AmericaThales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become...