Senior Staff Site Reliability Engineer

1 week ago


Dallas, TX, United States WEX Full time

About the Team & Role

We are looking for a highly motivated and high-potential Senior Staff Site Reliability Engineer (SRE) to join our team as a senior technical leader, driving transformational change and delivering significant business impact across WEX's platform ecosystem.

This is a truly exciting moment to be part of the SRE organization at WEX. Our sophisticated platforms support a broad spectrum of customer businesses and generate vast, complex telemetry and operational data. Reliability, scalability, and efficiency are critical to unlocking the full potential of our services and enabling business and customer value at scale.

As a Senior Staff SRE, you'll be at the forefront of defining and executing WEX's reliability engineering vision. You'll lead complex, cross-functional initiatives that elevate our observability, incident and problem management, automation, performance optimization, and capacity planning capabilities. You'll architect resilient systems, design proactive reliability strategies, and build the frameworks and tooling that power operational excellence across the company.

Beyond hands-on technical contributions, you'll serve as a strategic thought partner to engineering, product, and platform leadership-setting direction, influencing architecture, and embedding SRE principles across our development lifecycle. You'll mentor engineers across levels, scale best practices, and act as a catalyst for a culture of reliability, continuous improvement, and innovation.

We work with modern technologies, leverage AI and automation to drive smarter operations, and operate within an agile, product-centric engineering model.

If you're a visionary technical leader passionate about building reliable, scalable systems-and you're excited to make a lasting impact while growing your career-this is a rare and powerful opportunity.

How you'll make an impact

  • Set the vision and strategy for SRE across the organization.

  • Lead the development of innovative solutions to complex reliability challenges.

  • Represent the organization in industry forums and conferences.

  • Drive cultural transformation to prioritize reliability and operational excellence.

  • Build and maintain relationships with executive stakeholders.

  • Lead organization-wide reliability engineering initiatives.

  • Ensure business continuity, disaster recovery, and compliance.

Experience you'll bring

  • 12+ years of experience in SRE, DevOps, or software engineering leadership.

  • Deep experience in cloud-native architecture and large-scale distributed systems.

  • Deep knowledge of Kubernetes, service meshes, and distributed tracing.

  • Deep expertise in system design, automation, and performance optimization.

  • Advanced cloud automation experience (AWS, Azure, GCP).

  • Exceptional leadership skills with a proven track record of driving reliability improvements.

  • Experience with monitoring and logging (Grafana, ELK stack, Splunk, etc.).

  • Knowledge of containerization and orchestration (Docker, Kubernetes).

  • Strong understanding of database reliability engineering (MySQL, PostgreSQL, NoSQL).

  • Knowledge of networking, databases, and storage architectures.

  • Excellent incident command and crisis management skills.

  • Strong experience in regulatory and compliance frameworks.

  • Experience with systems with high availability and reliability equivalent to benefit platforms.

Preferred Qualification

  • Ability to influence C-level executives on reliability strategies.

  • Deep expertise in AI/ML for anomaly detection and predictive analytics in observability.

  • Proven ability to align SRE practices with business goals.

  • Strong financial acumen in cost optimization and cloud spending strategies.

  • Experience with identifying and solving significant problems for Site Reliability and Operations using AI.

  • Recognized thought leader with publications or conference talks in SRE.

  • Strong ability to build high-performing SRE teams and mentor engineering leaders.

  • Experience in healthcare, insurance, or benefits technology.

  • Understanding of Benefits domain such as claims processing, eligibility lookup success rate

  • Understanding of incident impact awareness on members and providers.

  • Experience working with compliance frameworks such as HIPAA, SOC 2, or HITRUST.

The base pay range represents the anticipated low and high end of the pay range for this position. Actual pay rates will vary and will be based on various factors, such as your qualifications, skills, competencies, and proficiency for the role. Base pay is one component of WEX's total compensation package. Most sales positions are eligible for commission under the terms of an applicable plan. Non-sales roles are typically eligible for a quarterly or annual bonus based on their role and applicable plan. WEX's comprehensive and market competitive benefits are designed to support your personal and professional well-being. Benefits include health, dental and vision insurances, retirement savings plan, paid time off, health savings account, flexible spending accounts, life insurance, disability insurance, tuition reimbursement, and more. For more information, check out the "About Us" section. Pay Range: $150,000.00 - $199,000.00

  • Dallas, TX, United States Digital Realty Full time

    Job DescriptionPosition Title: Site Reliability Engineer, Interconnection Service and Network DeliveryLocation: Hybrid: Austin, Dallas, Boston, Ashburn, Atlanta, London, or AmsterdamYour role In this role, you will be responsible for deploying and maintaining all Digital Realty interconnection fabric network infrastructure. The ideal candidate can...


  • Dallas, TX, United States Olsson Full time

    Senior Civil Engineer - Site Design Dallas, TX; Fort Worth, TX; Oklahoma City, OK; Texas - Remote; Tulsa, OK Company Description We are Olsson, a team-based, purpose-driven engineering and design firm. Our solutions improve communities, and our people make it possible. Our most meaningful asset is our people, and we are dedicated to providing an environment...


  • Dallas, TX, United States Olsson Full time

    Senior Civil Engineer - Site Design Dallas, TX; Fort Worth, TX; Oklahoma City, OK; Texas - Remote; Tulsa, OK Company Description We are Olsson, a team-based, purpose-driven engineering and design firm. Our solutions improve communities, and our people make it possible. Our most meaningful asset is our people, and we are dedicated to providing an environment...


  • Dallas, TX, United States International Staff Consulting Full time

    Sr Software Engineer, PLC for Automation, Factory Integration, Hybrid -Dallas, TX Our mid-size client firm, a leader in its industry, is hiring a Senior Software Engineer to work on software solutions for large-scale automated aerospace assembly. The primary focus is on control system configuration and development, including microprocessor-based servo motion...


  • Dallas, TX, United States International Staff Consulting Full time

    Sr Software Engineer, PLC for Automation, Factory Integration, Hybrid -Dallas, TX Our mid-size client firm, a leader in its industry, is hiring a Senior Software Engineer to work on software solutions for large-scale automated aerospace assembly. The primary focus is on control system configuration and development, including microprocessor-based servo motion...


  • Dallas, TX, United States International Staff Consulting Full time

    Sr Software Engineer, PLC for Automation, Factory Integration, Hybrid -Dallas, TX Our mid-size client firm, a leader in its industry, is hiring a Senior Software Engineer to work on software solutions for large-scale automated aerospace assembly. The primary focus is on control system configuration and development, including microprocessor-based servo motion...

  • Reliability Engineer

    2 weeks ago


    Dallas, TX, United States TrinityRail Full time

    TrinityRail is searching for a Reliability Engineer to join our Railcar Fleet Engineering team at our corporate headquarters in Dallas, Texas. What you'll do: Analyze data from various quality inputs (including, but not limited to nonconformance reports, customer complaints, and internal quality data) to determine trends and identify areas for systemic...


  • Dallas, TX, United States TrinityRail Full time

    TrinityRail is searching for a Reliability Engineer to join our Railcar Fleet Engineering team at our corporate headquarters in Dallas, Texas. What you'll do: Analyze data from various quality inputs (including, but not limited to nonconformance reports, customer complaints, and internal quality data) to determine trends and identify areas for systemic...


  • Dallas, TX, United States Suncap Technology Full time

    Role: Site Reliability Engineer Location: Dallas, TX (Onsite)***2 Positions available Implement SRE practices Identify, craft, and maintain SLIs and SLOs for teams, as well as metrics such as MTTR, Lead time for change, Deployment Frequency and Change Failure Rate Work with Application teams to set up Observability, Telemetry Define what it means for a...


  • Dallas, TX, United States Suncap Technology Full time

    Role: Site Reliability Engineer Location: Dallas, TX (Onsite)***2 Positions available Implement SRE practices Identify, craft, and maintain SLIs and SLOs for teams, as well as metrics such as MTTR, Lead time for change, Deployment Frequency and Change Failure Rate Work with Application teams to set up Observability, Telemetry Define what it means for a...