Site Reliability Engineer
2 weeks ago
Job Description:
Our client is one of the world’s leading manufacturers of semiconductor chip-making equipment. A majority of the world’s microchips receive their critical lithographic patterning in machines made by this organisation. In addition, they produce metrology tools and advanced applications to analyze and optimize the performance of the customer production process.
Job Mission:
Troubleshoot short-term problems and translate, develop into structural improvements on our distributed data and compute platform infrastructure.
Be accurate, be precise and help drive up the aggregate availability of the installs of these distributed computing systems in Korea, Taiwan, Israel, China and the US (etc.).
Be part of the computing platform that is one of the main pillars under the production of the next-generation microchips of Apple, Samsung and many others.
Responsibilities:
Create awareness in other teams about methods and procedures we use to help them to prevent repetitive help requests.
Help application developers to understand the infrastructure / cluster / system.
Share knowledge / mindset to other teams (dev/infra engineers).
Contribute towards building VCP as a Product which meets our standards of quality.
Increase stability and reliability of VCP by automated testing and automation.
Customer satisfaction and product reliability.
Improve the functionality and reliability of VCP.
Translate customer ecosystem needs to engineering deliverables.
Find the broken pieces of the puzzle at system/cluster level.
Make the VCP reliable by improving system resilience (bug-fixing and beyond).
Resolve bugs in a sustaining way (implement regression test, design structural fixes).
Ambassador of predictable component lifecycle management.
Technical roadmap maintenance (App life cycle management).
Support feature and service request from the field.
Suggest improvements to our technical solutions and way of working, and implement them in alignment with your team and their stakeholders.
Highly valued qualifications & experiences:
Experience with DC/OS.
Experience with new technology introduction @ zero downtime including data migration.
Fan of automatic testing and qualification, if can be part of CI/CD pipeline.
Affinity to dig deep into the details of networking issues.
Available to work (remotely) outside regular office hours when it proves that attempt to build a fail-safe system was not yet successful. We really want this to be an exception, not a rule.
Required qualifications & experiences:
Knowledge of distributed computing systems, practical experience (must).
Experienced in build and release infrastructure, Maven, Nexus, Bamboo, Github.
Familiar with at least one scripting language (Python).
Experience with Ansible.
Linux expert.
#J-18808-Ljbffr
-
Houston, United States Walmart Global Tech Full timeAbout Team: Sam's Club is our membership warehouse club, a business model that provides our members with high-quality products at prices that are unrivaled by traditional retail. Sam's Club provides a carefully curated assortment of items, as well as developing and leading technologies and services such as Scan & Go, Club Pickup, and home delivery service in...
-
Fixed Equipment Reliability Engineer
2 weeks ago
Houston, United States Channel Personnel Services Full timeJob DescriptionJob DescriptionReliability Manager - Fixed Equipment. The responsibility of the Reliability Manager - Fixed Equipment is to improve the performance of the fixed equipment assets across all sites, build and lead a program which identifies and manages the fixed equipment assets and supporting/surrounding systems reliability risks. The...
-
Fixed Equipment Reliability Engineer
4 days ago
Houston, United States Channel Personnel Services Full timeJob DescriptionJob DescriptionReliability Manager - Fixed Equipment. The responsibility of the Reliability Manager - Fixed Equipment is to improve the performance of the fixed equipment assets across all sites, build and lead a program which identifies and manages the fixed equipment assets and supporting/surrounding systems reliability risks. The...
-
Safety & Reliability Engineer
14 hours ago
Houston, United States DSJ Global Full timeTitle: Join a Global Leader as Safety & Reliability Engineer in Lindenberg, GermanyIntroductory Paragraph:We are seeking an experienced and talented safety and reliability engineer to join our client's team based in the beautiful city of Lindenberg, Germany. Our client is one of the world's leading Aerospace companies with operations spread across various...
-
Senior Electrical Reliability Engineer
1 day ago
Houston, United States AXG Contracting Full timeJob Description Job Description Perform Engineering duties by applying principles, knowledge, and practices of electrical power distribution for timely identification, analysis, and resolution of electrical related issues to ensure safe, reliable, and cost-efficient plant operations. The Sr. Electrical Reliability Engineer also utilizes current and new...
-
Senior Electrical Reliability Engineer
2 weeks ago
Houston, United States AXG Contracting Full timeJob DescriptionJob DescriptionPerform Engineering duties by applying principles, knowledge, and practices of electrical power distribution for timely identification, analysis, and resolution of electrical related issues to ensure safe, reliable, and cost-efficient plant operations. The Sr. Electrical Reliability Engineer also utilizes current and new...
-
Reliability Engineering Lead
7 days ago
Houston, Texas, United States Certarus Ltd. Full timeCertarus is the North American leader in providing low carbon energy solutions through a fully integrated compressed natural gas (CNG), renewable natural gas (RNG), and hydrogen platform. The company safely delivers clean-burning fuels to remote communities and industrial customers not connected to a pipeline.By displacing more carbon-intensive fuels,...
-
Reliability Engineer
13 hours ago
Houston, United States Praxair Distribution Inc Full time30th April, 2024 Purpose of the position Responsible for the reliability improvement performance of all the plants and equipment in the country. As part of a cohesive team, work with the site teams to implement plant maintenance strategies and improvement projects to ensure reliable delivery of contractual product at optimum cost. Principal responsibilities...
-
Reliability Engineer
3 days ago
Houston, United States Infinitek Limited Full timeInfinitek are working with a leading global provider of reliability solutions, supporting some of the world’s largest resource, power, and utility companies. Using innovative technology, advisory services, and decades of reliability engineering experience, we are transforming the way companies manage the reliability of their assets.Implementing a diverse...
-
Reliability Engineer
2 days ago
Houston, United States Infinitek Limited Full timeInfinitek are working with a leading global provider of reliability solutions, supporting some of the world’s largest resource, power, and utility companies. Using innovative technology, advisory services, and decades of reliability engineering experience, we are transforming the way companies manage the reliability of their assets.Implementing a diverse...
-
Reliability Engineer
1 week ago
Houston, United States Airswift Full timeWe are seeking a Reliability Engineer to work with one of our major clients, a global chemicals business. This professional will provide expertise to improve reliability of equipment, maintainability of systems and develop processes to increase mean time between failures in order to operate the facility at peak operating efficiency on a 24-hour basis and...
-
I&E Reliability Engineer
1 day ago
Houston, United States Enlink Full timeResponsibilities The I&E Reliability Engineer will provide engineering support to Operations, Maintenance, and Project Management groups while ensuring proper Electrical and instrumentational Operations. Duties include: Develop and execute an Electrical and Instrumentation Reliability Program for site operations and maintenance groups. Review electrical...
-
Maintenance Reliability Engineer
1 week ago
Houston, United States MK Search Full timeMK Search is currently working with a medium-sized specialty chemical company based in Houston, Texas. They are seeking a dedicated and skilled Maintenance & Reliability Engineer to join their growing team. If you are passionate about optimizing maintenance processes and enhancing the reliability of equipment, we want to hear from you. The Maintenance &...
-
Maintenance Reliability Engineer
1 week ago
Houston, United States MK Search Full timeMK Search is currently working with a medium-sized specialty chemical company based in Houston, Texas. They are seeking a dedicated and skilled Maintenance & Reliability Engineer to join their growing team.If you are passionate about optimizing maintenance processes and enhancing the reliability of equipment, we want to hear from you.The Maintenance &...
-
Maintenance Reliability Engineer
1 week ago
Houston, United States MK Search Full timeMK Search is currently working with a medium-sized specialty chemical company based in Houston, Texas. They are seeking a dedicated and skilled Maintenance & Reliability Engineer to join their growing team.If you are passionate about optimizing maintenance processes and enhancing the reliability of equipment, we want to hear from you.The Maintenance &...
-
Maintenance Reliability Engineer
1 week ago
Houston, United States MK Search Full timeMK Search is currently working with a medium-sized specialty chemical company based in Houston, Texas. They are seeking a dedicated and skilled Maintenance & Reliability Engineer to join their growing team.If you are passionate about optimizing maintenance processes and enhancing the reliability of equipment, we want to hear from you.The Maintenance &...
-
Fixed Equipment Reliability Engineer
14 hours ago
Houston, United States NES Fircroft Ltd Full timeTop 3 Must-Have Skills Working knowledge of ASME codes for Pressure Vessels and Piping, AP| 510 and 570 Prior experience providing engineering support to a chemical plant or refinery Ability to perform Fitness for Service calculations for pressure vessels and piping This position will work with a team of dedicated, highly-trained professionals. The...
-
Fixed Equipment Reliability Engineer
7 days ago
Houston, United States NES Fircroft Full timeTop 3 Must-Have SkillsWorking knowledge of ASME codes for Pressure Vessels and Piping, AP| 510 and 570Prior experience providing engineering support to a chemical plant or refineryAbility to perform Fitness for Service calculations for pressure vessels and pipingThis position will work with a team of dedicated, highly-trained professionals. The Fixed...
-
Fixed Equipment Reliability Engineer
14 hours ago
Houston, United States NES Fircroft Full timeTop 3 Must-Have Skills Working knowledge of ASME codes for Pressure Vessels and Piping, AP| 510 and 570 Prior experience providing engineering support to a chemical plant or refinery Ability to perform Fitness for Service calculations for pressure vessels and piping This position will work with a team of dedicated, highly-trained professionals. The Fixed...
-
Fixed Equipment Reliability Engineer
4 days ago
Houston, United States Net2Source Inc. Full timeFixed Equipment Reliability Engineer - Pipeline Industry - API 510, 570Top 3 Must-Have Skills1. Working knowledge of ASME codes for Pressure Vessels and Piping, API 510 and 5702. Prior experience providing engineering support to a chemical plant or refinery3. Ability to perform Fitness for Service calculations for pressure vessels and pipingJOB DISCRIPTION:...