Site Reliability Engineer
2 days ago
Optomi, in partnership with our client, are seeking an experienced SRE II to join their team for a 6 month contract to hire opportunity that is 2 days hybrid onsite in Irving, TX. W2 only - no C2C/sponsorship at this time. We are seeking a highly skilled Site Reliability Engineer II to join our engineering organization. This role focuses on building resilient, scalable, and automated systems—not traditional production support. The ideal candidate has hands-on engineering experience across cloud infrastructure, observability, automation, and reliability-focused development. You will work closely with development, cloud engineering, and platform teams to ensure high availability, optimal performance, and operational excellence of critical customer-facing applications. Key Responsibilities Contribute directly to the reliability, scalability, performance, and security of critical applications. Build reusable services, automation, and frameworks that improve platform stability and developer velocity. Cloud & Platform Engineering Design and enhance cloud infrastructure using Azure services including: Azure Service Bus Event Hub Azure SQL AKS (Azure Kubernetes Service) Function Apps App Services Implement and manage Infrastructure as Code (IaC) using Terraform. Containerization & Orchestration Build and deploy containerized applications using Docker (2–3+ years). Support Kubernetes workloads via AKS, including scaling, upgrades, and cluster reliability improvements. Development & DevOps Collaborate with development teams using a working knowledge of .NET. Improve CI/CD workflows using Azure DevOps (ADO). Monitoring, Observability & Incident Response Implement and optimize monitoring and alerting strategies. Use Splunk Observability Cloud (preferred) or equivalent observability platforms to enhance visibility and reduce MTTR. Drive proactive incident identification, root-cause analysis, and long-term fixes. Performance, Reliability & Scalability Enhancements Design and implement SLOs, SLIs, and error budgets. Develop auto-scaling policies, failover strategies, and disaster recovery procedures. Optimize application and database performance to ensure reliability across high-traffic, mission-critical systems. Required Qualifications 3–5+ years of hands-on SRE experience Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent experience) Master’s degree preferred Hands-on experience with: Azure Cloud (AKS, Service Bus, Event Hub, SQL, Function Apps, App Services) Terraform Docker Azure DevOps Monitoring tools (Splunk Observability Cloud preferred) .NET ecosystem (understanding of development fundamentals) Preferred Skills Experience designing resilient, distributed systems Strong troubleshooting and analytical skills Performance tuning across applications, databases, and cloud services Experience improving uptime, latency, throughput, or cost efficiency of production applications Familiarity with SRE principles and modern operational practices
-
Site Reliability Engineering
11 hours ago
Irving, Texas, United States OneMain Financial Full timeWe are looking for a highly skilled and experienced Site Reliability Engineering Team Lead to guide our SRE team, foster best practices, and ensure operational excellence across our infrastructure.Position OverviewAs the SRE Team Lead, you will be responsible for the technical leadership of a talented team of site reliability engineers dedicated to...
-
Site Reliability Engineer
4 days ago
Irving, United States Optomi Full timeOptomi, in partnership with our client, are seeking an experienced SRE II to join their team for a 6 month contract to hire opportunity that is 2 days hybrid onsite in Irving, TX. W2 only - no C2C/sponsorship at this time.We are seeking a highly skilled Site Reliability Engineer II to join our engineering organization. This role focuses on building...
-
Site Reliability Engineer
2 hours ago
Irving, United States Optomi Full timeOptomi, in partnership with our client, are seeking an experienced SRE II to join their team for a 6 month contract to hire opportunity that is 2 days hybrid onsite in Irving, TX. W2 only - no C2C/sponsorship at this time.We are seeking a highly skilled Site Reliability Engineer II to join our engineering organization. This role focuses on building...
-
Site Reliability Engineer
3 weeks ago
Irving, United States Wellfit Technologies Full timeOverviewWellfit is the dental industry’s fintech solution, breaking down financial barriers so patients, providers, employers, and payors can all access better care. As a healthcare fintech innovator, we’re transforming the patient journey and redefining what’s possible in dental care.This role: Site Reliability Engineer (SRE) with deep expertise in...
-
Site Reliability Engineer
3 weeks ago
Irving, United States Tata Consultancy Services Full timeOverviewJoin to apply for the Site Reliability Engineer role at Tata Consultancy Services.Be among the first applicants in Irving, TX.ResponsibilitiesEnsure high availability, scalability, and reliability of OpenShift clusters across production and non-production environments.Monitor system performance, resource utilization, and proactively address...
-
Site Reliability Engineer
1 week ago
Irving, Texas, United States InfoVision Inc. Full time $100,000 - $120,000 per yearSite Reliability Engineer (SRE)We're looking for anSRE with strong DevOps DNA— not just someone to run pipelines, but someone whoownsreliability, automation, and innovation.Key Must-Haves:Proven SRE mindset — find issues, automate, and improve without waiting for instructions.Deep AWS experience: Autoscaling, Security Groups, Route53, S3, IAM.Strong in ...
-
Site Reliability Engineer
2 weeks ago
Irving, Texas, United States CellPoint Digital Full time $120,000 - $180,000 per yearJoin CellPoint Digital: Shape the Future of Payments with UsAt CellPoint Digital, we're revolutionizing the way businesses in the air, travel, and hospitality sectors manage their payments.With our Leading Payment Orchestration Platform, we're turning payments into a strategic advantage, helping clients optimize their payment experience to boost profits,...
-
Site Reliability Engineer
4 weeks ago
Irving, TX, United States The Judge Group Full timeAbout the Role: Our client is seeking a Site Reliability Engineer (SRE) with deep expertise in monitoring, debugging, and optimizing Azure App Services. This role is critical in ensuring our platforms remain reliable, performant, and scalable as we continue to grow. If you thrive at the intersection of infrastructure, development, and performance, this is...
-
Site Reliability Engineer
1 week ago
Irving, United States HER Ontada, LLC Full timeWe are seeking a Site Reliability Engineer to join our team and help ensure the reliability, scalability, and performance of our systems. This role combines software engineering and systems administration to build and maintain resilient infrastructure and automation for our applications.Key ResponsibilitiesDesign, implement, and maintain monitoring solutions...
-
Site Reliability Engineer
2 weeks ago
Irving, TX, United States The Judge Group Full timeThe Judge Group, a Technology, Talent & Learning Solutions company based in Wayne, PA, that helps professionals find top jobs with the nation’s leading brands. We’re looking to hire a Site Reliability Engineer (Azure App Services) for a Full-Time, permanent position based in Irving, TX. We are looking for a Site Reliability Engineer (SRE) with strong...