Senior Reliability Engineering Specialist

2 weeks ago


San Antonio Texas, United States HEB Full time

Key Responsibilities:


At H-E-B, our digital solutions are rapidly gaining traction, and our Digital Technology teams thrive in a dynamic environment, acquiring new skills and applying them to tackle significant business challenges.

As a Senior Reliability Engineering Specialist, you will leverage your engineering expertise to enhance the reliability, availability, and efficiency of our systems, while also improving workflows, automation, and scalability.

Upon meeting eligibility criteria, you will become a stakeholder in the company, so we seek individuals who demonstrate commitment, diligence, and a focus on quality and customer satisfaction. Our 'partner-owned' philosophy emphasizes that our most valuable asset—our people—drive the innovation, growth, and success that establish H-E-B as a leader in omnichannel retail.



Do you possess:
HEART FOR PEOPLE... strong interpersonal skills?

HEAD FOR BUSINESS... a systematic approach to problem-solving?

PASSION FOR RESULTS...

an eagerness for automation and continuous enhancement? We are looking for candidates with:
- A minimum of 5 years of relevant experience.

- A profound understanding of software engineering principles with a focus on reliability, scalability, and performance optimization.

- Proficiency in one or more programming languages suitable for SRE tasks (e.g., Python, Go, Java, Rust).

What does the role entail?

Design & Development:
- Formulate and execute comprehensive monitoring, SLO tracking, and capacity-planning strategies that align with business objectives.

- Perform in-depth analyses of performance bottlenecks to enhance system efficiency.

- Establish architectural guidelines and best practices to improve distributed system design for resilience by applying software engineering principles.

- Contribute to long-term reliability strategies.

- Drive substantial process improvements that influence broader team practices.


Engage in the entire lifecycle of services, from inception and design through deployment, operation, and refinement, utilizing your knowledge of system architecture.

What is your background?
- M.S. or B.S. in Computer Science or a related field (or equivalent experience in large-scale distributed systems).

- A deep understanding of software engineering principles with an emphasis on reliability, scalability, and performance optimization.

- Proficiency in one or more programming languages suitable for SRE tasks (e.g., Python, Go, Java, Rust).


Do you have what it takes to excel as an H-E-B SRE? Extensive experience in designing and implementing resilient, high-performing, and scalable software solutions, grounded in a thorough understanding of systems and networks.


Exceptional communication and collaboration skills:
- Ability to lead cross-functional teams, advocate for reliability best practices, and drive strategic initiatives from inception to completion.


- Proven analytical and problem-solving abilities, coupled with a focus on preventative solutions: Capacity to proactively identify systemic risks and inefficiencies, architecting comprehensive solutions that address underlying issues.


- Proven ability to thrive in a high-growth, fast-paced technical environment: Independently manage complex and ambiguous challenges, prioritize effectively, and make sound judgment calls under pressure.


- Ability to strategically align reliability goals with business objectives: Demonstrate an understanding of how SRE practices impact both technical KPIs and broader company goals.


- Passion for mentorship and knowledge sharing within a robust software engineering culture: Lead by example, foster the growth of SRE team members, and shape the direction of reliability engineering at H-E-B.

Can you...
- Function in a fast-paced, retail office environment?

- Travel by car or plane with overnight stays?

- Work extended hours; sit for prolonged periods; work rotating and on-call schedules?

DEVS3232

ISA3232

  • San Antonio, Texas, United States H-e-b, L.p. Full time

    Join H-E-B as a Senior Reliability EngineerAt H-E-B, our digital solutions are rapidly evolving, and our Digital Technology teams thrive in a dynamic environment, continuously acquiring new skills to tackle significant business challenges.As a Senior Reliability Engineer, you will leverage your engineering expertise to enhance the reliability, availability,...


  • San Antonio, Texas, United States H-E-B Corporate Full time

    Key Responsibilities: H-E-B's digital initiatives are expanding rapidly, and our Digital Technology teams thrive in a dynamic environment, acquiring new skills and applying them to tackle significant business challenges. As a Senior Site Reliability Engineer, you will leverage your engineering expertise to enhance the reliability, availability, and...


  • San Antonio, Texas, United States H-E-B Corporate Full time

    Key Responsibilities: H-E-B's digital solutions are expanding rapidly, and our Digital Technology teams operate in a dynamic environment, acquiring new skills and applying them to tackle significant business challenges. As a Senior Site Reliability Engineer, you will leverage your engineering expertise to enhance the reliability, availability, and efficiency...


  • Texas, United States JAS Recruitment Full time

    The Senior Manager of Engineering is tasked with directing all facets of engineering and reliability initiatives within the organization. This position entails guiding a team of engineers and support personnel to guarantee the effective functioning of machinery, facilities, and infrastructure. Key Responsibilities:Formulate and execute detailed plans and...


  • San Jose, United States Ehub Global Inc Full time

    Senior Reliability Engineer San Jose CA FULLTIME Job Description: Perform reliability evaluation of IC products, packages, and process technology with focus on suitability to end applications and conformance to industry standards. Perform device level failure analysis for an in-depth understanding of IC device failures. Analyze reliability results, device...


  • San Jose, United States Ehub Global Inc Full time

    Senior Reliability EngineerSan Jose CAFULLTIMEJob Description: Perform reliability evaluation of IC products, packages, and process technology with focus on suitability to end applications and conformance to industry standards. Perform device level failure analysis for an in-depth understanding of IC device failures. Analyze reliability results, device...


  • San Antonio, Texas, United States Energy Transfer Full time

    Position OverviewThis role focuses on enhancing the reliability of electrical systems within our operations and project engineering sectors. The primary objective is to ensure optimal performance and maintenance of systems related to pipelines and terminals.Key ResponsibilitiesReliability Engineering: Develop and implement strategies to improve the...


  • San Diego, United States Indotronix Avani Group Full time

    Job Description:A Systems /Reliability Engineer in this role is responsible for using their broad technical expertise, leadership skills, and product development knowledge to provide Systems leadership through various phases of the product life cycle. This Systems /Reliability Engineer will manage the verification and traceability of product requirements and...


  • San Diego, United States Indotronix Avani Group Full time

    Job Description:A Systems /Reliability Engineer in this role is responsible for using their broad technical expertise, leadership skills, and product development knowledge to provide Systems leadership through various phases of the product life cycle. This Systems /Reliability Engineer will manage the verification and traceability of product requirements and...


  • San Diego, California, United States Mentis Systems Full time

    Job OverviewWe are currently seeking a Senior Reliability Systems Engineer at Mentis Systems. This role is pivotal in overseeing the development of new products and ensuring their reliability throughout the lifecycle.Position DetailsRole: Senior Reliability Systems EngineerDuration: 12+ months ContractLocation: Hybrid/San Diego CAKey ResponsibilitiesThe...


  • San Diego, California, United States Mentis Systems Full time

    Job OverviewWe are currently seeking a Senior Reliability Systems Engineer at Mentis Systems. This role is crucial for driving our New Product Development initiatives.Position DetailsThe Senior Reliability Systems Engineer will utilize their extensive technical expertise and leadership capabilities to guide Systems Engineering through various stages of the...


  • Texas, United States Catalyst Recruiting, Inc Full time

    Senior Mechanical Engineer - Reliability FocusLocation: Hybrid RoleCatalyst Recruiting, Inc is seeking a proficient mechanical engineer with a strong background in reliability engineering to oversee mechanical engineering and reliability initiatives across a large-scale operation.Position Overview:This role encompasses the management of mechanical...


  • Texas, United States JAS Recruitment Full time

    The Senior Manager of Engineering is tasked with overseeing the various facets of engineering and reliability functions within the organization. This position entails guiding a team of engineers and support personnel to guarantee the smooth operation of machinery, facilities, and infrastructure. Key Responsibilities:Formulate and execute detailed plans and...


  • San Antonio, United States Sunoco LP Full time

    Sunoco LP is a leading energy infrastructure and fuel distribution master limited partnership operating across 47 U.S. states, Puerto Rico, Europe, and Mexico. The Partnership's midstream operations include an extensive network of approximately 9,500 miles of pipeline and over 100 terminals. This critical infrastructure complements the Partnership's fuel...


  • Texas, United States Austin Fraser Full time

    Infrastructure Reliability Specialist | Remote OpportunityA prominent financial technology firm is seeking to onboard a skilled Infrastructure Reliability Specialist with a solid foundation in software development. This position is crucial for collaborating with the engineering team to maintain platform uptime and ensure efficient software deployment.The...


  • San Antonio, Texas, United States First Resort Global Recruitment Full time

    Become a Senior Reservoir Engineering Specialist with UsFirst Resort Global Recruitment is seeking a highly skilled individual to join a leading organization in the Oil, Gas, and Petroleum sector, known for its esteemed reputation in the industry.We are looking for a talented professional at the Senior Level to play a pivotal role in our client’s...


  • San Antonio, Texas, United States Sunoco LP Full time

    Position OverviewSunoco LP is a prominent player in the energy sector, specializing in fuel distribution and infrastructure across various regions including the United States, Puerto Rico, Europe, and Mexico.Role ResponsibilitiesOversee and enhance the reliability of electrical systems within our extensive operations.Conduct thorough assessments and analyses...


  • San Francisco, California, United States Chelsoft Solutions Co Full time

    Job OverviewWe are seeking a Senior Site Reliability Engineer to join our dynamic team at Chelsoft Solutions Co. This position is designed for a skilled SRE professional who thrives in a hybrid work environment.Key ResponsibilitiesImplement and maintain reliable systems and infrastructure.Collaborate with cross-functional teams to enhance system...


  • San Antonio, Texas, United States CNG Engineering Full time

    Job Summary:CNG Engineering, a professional engineering design firm, is seeking an experienced Senior Mechanical Engineer to lead our projects in San Antonio or Austin, Texas. As a key member of our team, you will be responsible for successfully planning, managing, and executing projects on time and within budget while maintaining current client...


  • San Diego, California, United States Mentis Systems Full time

    Job OverviewWe are currently seeking a Senior Reliability Systems Engineer to contribute to a pivotal project at Mentis Systems. This role is essential for ensuring the robustness and reliability of our innovative products.Key Responsibilities:The selected candidate will be responsible for:Utilizing extensive technical knowledge to guide systems engineering...