Current jobs related to Site Reliability Engineer - Houston, Texas - RPOI


  • Houston, Texas, United States Schlumberger Full time

    Full-time or part-time: Full-timeJob title: Site Reliability EngineerJob Location: 1430 Enclave Parkway, Houston, TX 77077Job Description:Create ultra-scalable and highly reliable software systems through system design consulting, capacity planning, system health monitoring, and sustainable incident response. Engage in and improve the entire lifecycle of...


  • Houston, Texas, United States Veradigm® Full time

    Welcome to Veradigm Our mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our Vision We envision a connected community of health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...


  • Houston, Texas, United States Invesco Full time

    About InvescoAs a premier global asset management firm, Invesco is committed to assisting investors across the globe in achieving their financial goals. We harness the strength of our unique investment management capabilities to offer a diverse array of investment strategies and vehicles to our clients worldwide.At Invesco, we value challenging work,...


  • Houston, Texas, United States VMC Soft Technologies, Inc Full time

    W2 CONTRACT ONLYC2C CANDIDATES PLEASE DO NOT APPLYPOSITION: Site Reliability EngineerRemote Opportunity Essential Qualifications:Minimum of 3 years of experience in the following technologies:Proficiency in New Relic Platform including APM, Synthetic, and Browser MonitoringExpertise in New Relic Query Language (NRQL)Strong programming skills in...


  • Houston, Texas, United States TalentMatch LLC Full time

    Job OverviewOur partner is a pioneering pressure pumping organization that is experiencing significant growth.With its corporate headquarters in Houston, Texas, and operational locations throughout various oil and gas regions in the United States, they are recognized as the leading provider of electric hydraulic fracturing solutions in the industry....


  • Houston, Texas, United States MK Search Full time

    **About MK Search**MK Search is a leading recruitment agency specializing in placing top talent in the chemical manufacturing industry.**Job Summary**We are seeking a highly skilled and dedicated **Maintenance Specialist** to join our client's team in Houston, Texas.**Key Responsibilities:**Develop and implement comprehensive maintenance programs, including...


  • Houston, Texas, United States E-Solutions INC Full time

    Job OverviewPosition: Reliability EngineerCompany: E-Solutions INCEmployment Type: FulltimeInterview Format: Virtual MeetingCompensation: Competitive Salary (Dependent on Experience)Key Responsibilities:Act as a Maintenance Engineer, executing both Preventive and Corrective Maintenance tasks.Possess strong expertise in Oil & Gas Equipment.Engage in various...


  • Houston, Texas, United States Ben Aris Full time

    About the role at Ben ArisThe Senior Rotating Equipment Specialist will provide machinery engineering support and reliability improvements to the refinery. This position is responsible for reviewing and implementing site-wide standards and improvements to obtain optimum asset performance from rotating equipment.Key Responsibilities:Collaborate with the...


  • Houston, Texas, United States ResourceTek, LLC Full time

    ResourceTek, LLC is collaborating with a prominent player in the metals manufacturing sector to find a skilled Reliability Maintenance Engineer for their operations. The ideal candidate will be a proactive leader in enhancing maintenance and reliability practices.KEY RESPONSIBILITIESDevelop and execute a comprehensive predictive maintenance strategy,...

  • Reliability Engineer

    3 weeks ago


    Houston, Texas, United States Lyondell Basell North America Full time

    LyondellBasellBasic FunctionSupply Chain is a customer-focused Center of Excellence providing industry-leading service while delivering differential value to the business, today and into the future. We separate our Supply Chain functions into several areas; these include logistics, customer fulfillment, services, trade compliance, and support for business...


  • Houston, Texas, United States Scientific Drilling Inc. Full time

    Job OverviewScientific Drilling International, a leading independent service provider in the drilling sector, is currently seeking a Senior Reliability Engineer. Our company specializes in delivering high-precision wellbore placement and drilling solutions across various industries, including Oil & Gas, Coal Bed Methane, Geothermal, and Mining.Position...


  • Houston, Texas, United States VMC Soft Technologies, Inc Full time

    W2 CONTRACT ONLYC2C CANDIDATES PLEASE DO NOT APPLYPOSITION: Site Reliability EngineerRemote Opportunity Essential Qualifications:Minimum 3 years of experience with the following technologies:• Proficient in New Relic Platform including APM, Synthetic, and Browser Monitoring• Expertise in New Relic Query Language (NRQL)• Strong programming skills in...


  • Houston, Texas, United States Marathon Petroleum Full time

    An exciting career awaits youAt MPC, we're committed to being a great place to work – one that welcomes new ideas, encourages diverse perspectives, develops our people, and fosters a collaborative team environment.Position Summary:Overall, the position entails managing rotating equipment reliability at the Galveston Bay Refinery, which is the largest and...


  • Houston, Texas, United States Ascend Performance Materials Full time

    Job SummaryWe are seeking a highly skilled Reliability Engineering Lead to join our team at Ascend Performance Materials. As a key member of our reliability team, you will be responsible for leading a program aimed at identifying and mitigating reliability risks associated with fixed equipment assets and their related systems.Key ResponsibilitiesDevelop and...


  • Houston, Texas, United States LyondellBasell Industries N.V. Full time

    Overview:At LyondellBasell Industries N.V., we pride ourselves on our commitment to excellence in the field of Supply Chain Management. Our focus is on delivering exceptional service that adds significant value to our operations and stakeholders.Key Responsibilities:As a Reliability Engineer, you will play a crucial role in enhancing our operational...


  • Houston, Texas, United States DBSI Services Full time

    Job Title: Reliability Assurance EngineerCompany: DBSI ServicesLocation: Houston, TX (Onsite)Job Overview:As a Reliability Assurance Engineer, you will be responsible for ensuring the operational integrity of equipment through various maintenance strategies. Your expertise will contribute to the enhancement of reliability across our operations.Key...


  • Houston, Texas, United States Sira Consulting, an Inc 5000 company Full time

    Reliability Maintenance EngineerJoin our team at Sira Consulting, an Inc 5000 company, as we seek a dedicated professional in the field of maintenance engineering.This full-time position involves:Executing preventive and corrective maintenance tasks.Demonstrating a solid understanding of oil and gas equipment.Participating in reliability enhancement...


  • Houston, Texas, United States Vallourec Star LP Full time

    Job OverviewPOSITION SUMMARY:Utilize engineering principles to enhance the efficiency of machinery, processes, and departmental expenditures, aiming for improved maintainability, reliability, and equipment availability, assessed through Overall Equipment Effectiveness (OEE).KEY RESPONSIBILITIES:PlanningOversee the planning, assessment, organization, and...


  • Houston, Texas, United States LyondellBasell Industries Full time

    Position Overview:The Senior Mechanical Reliability Engineer plays a pivotal role within the Reliability department, focusing on enhancing the operational integrity of our facilities. This position is responsible for leading a team of engineers and analysts dedicated to the effective implementation of reliability objectives.Key Responsibilities:Leadership...


  • Houston, Texas, United States Channel Personnel Services Full time

    Job OverviewThe Machinery Reliability Engineer plays a crucial role within the Reliability Group, focusing on enhancing plant operations and driving reliability improvements. This position requires collaboration within a team setting and involves the implementation of best practices in reliability, as well as the development and optimization of preventive...

Site Reliability Engineer

2 months ago


Houston, Texas, United States RPOI Full time

Site Reliability Engineers (SREs) are responsible for keeping all user-facing services and other DSX production systems running smoothly.

SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our environments.

As an SRE you will:

Be on a PagerDuty rotation to respond to availability incidents and provide support for service engineers with customer incidents.

Use your on-call shift to prevent incidents from happening

Run our infrastructure with Terraform and Kubernetes.

Use monitoring and alerting to alert on symptoms not outages.

Document every action so that your findings turn into repeatable actions (playbooks) and then into automation.

Improve the deployment process

Design, build and maintain core infrastructure pieces that allow DSX to scale to support hundreds and then thousands of concurrent users

Debug production issues across services and levels of the stack.

Plan the growth of the DSX infrastructure.

Think about systems, and particularly edge cases and failure modes.

Know your way around Linux and the Unix Shell.

Have strong programming skills--preferably Nodejs, but it could be Python, Go, .NET or even Ruby.

Have an urge to collaborate and communicate asynchronously.

Have an urge to document all the things so you don't need to learn the same thing twice

Have an enthusiastic, go-for-it attitude.

When you see something broken, you can't help but fix it.

Have an urge for delivering quickly and iterating fast.

Have experience with Nginx, Docker, Kubernetes, Terraform, or similar technologies.

Have good experience with GitHub. Projects you could work on

Coding infrastructure automation with GitHub Actions and Terraform.

Improving our Prometheus Monitoring or building new Metrics.

Helping to deploy new versions of DSX.

Helping to plan, prepare for, and execute the migration of DSX from virtual machines running on Azure to cloud-native container-based deployments with Kubernetes using Azure Kubernetes Service.

Details Description Technical General knowledge of 4 of the following areas of technical expertise with deep knowledge in 1 area:

Implement "Infrastructure as Code" using Terraform and GitHub CI/CD for automation. - Load balancing of the application including Proxies and CDN.

Kubernetes and container rising our system.

Administering a high-availability MSSQL cluster.

Monitoring and Metrics in Prometheus and Grafana, and their integrations with Slack/PagerDuty.

Logging infrastructure.

Backend storage management and scaling.

Disaster Recovery and High Availability strategy.

Contributing to code for services and automation.

  1. Provide emergency response either by being on-call or by reacting to symptoms according to monitoring and escalation when needed.
  2. Propose ideas and solutions within the infrastructure team to reduce the workload by automation
  3. Plan, design and execute solutions within the team to reach specific, agreed-upon, goals.
  4. Plan and execute configuration change operations both at the application and the infrastructure level.
  5. Actively look for opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation.