Lead Software Reliability Engineer

4 weeks ago


Houston, United States Grab Limited Full time

Life at GrabAt Grab, every Grabber is guided by The Grab Way, which spells out our mission, how we believe we can achieve it, and our operating principles - the 4Hs: Heart, Hunger, Honour and Humility. These principles guide and help us make decisions as we work to create economic empowerment for the people of Southeast Asia.Get to know the TeamThe Business & Transaction Platform, SNP and DNA SRE SRE team is a longstanding team responsible for the stable operation of the core Grab systems. We make an impact by contributing to Business & Transaction Platform, Search & Personalization, Demand and Ads systems and the company's stability and operational excellence. Our team is made up of a group of passionate Site Reliability Engineers. If you are looking for an opportunity to work in a large scale cloud environment and utilize your sharp ideas to make engineers’ life better, then you should join our teamGet to know the RoleWe are looking for a Lead Software Reliability Engineer to provide better stability and operational excellence for Business & Transaction Platform, SNP and DNA tech families in Grab. We believe a successful candidate has professional sysops/infrastructure knowledge and the ability to build comprehensive systems, but if you believe you have what it takes then we’d love to hear from you either way. This role is required because stability and operational excellence is critical to our services. In return, you will get an opportunity to generate impacts to Grab’s core systems.The Day-to-Day ActivitiesEngage in and improve the whole lifecycle of services - from design, through deployment, operation and refinement.Work with engineering teams to design and write code to create systems which are highly available and able to scale seamlessly.Help improve reliability, stability and scalability challenges with engineering teamsGet involved in deep diagnosis of incidents, and engage with multiple highly skilled engineering teams on resolutions.Maintain services once they are live by measuring and monitoring availability, latency and overall system health.Contribute to a culture of learning and responsibility by guiding teams to write detailed postmortem reports.Identify and resolve problems relating to critical service operations and to prevent their recurrence using automation.Be part of a cool team, responsible for one of the largest cloud based services in South East Asia.Mentor other engineers, define our technical culture, set high engineering bars and help build a fast-growing teamLead other engineers to conquer challenging projects with great qualitiesContribute initiatives to improve tech family’s stability and operational excellenceThe Must-HavesBachelor's or Master's degree in Computer Science, Software Engineering, Information Technology or related technical field involving coding.Preferably with at least 5 years of relevant experience of this role.Strong experience with algorithms, data structures, complexity analysis and software design.Strong experience in one or more of the following: Go, Python, C, C++, Java, Perl or Ruby.Strong experience in using service monitoring, log, and alarm-related environments and tools.Strong experience in system troubleshooting in Linux environment.Solid experience in using Linux commands and shell script, and has the ability to automate routine tasks.Solid experience with automation & provisioning tools (e.g Jenkins, Ansible/Chef/SaltStack/Puppet).Possess analytical skills, mental resilience and the ability to think systematically under stressful conditions.Highly accountable and takes ownership. Outstanding work ethic, high-integrity, team player, and a lifelong learner.Proficiency in verbal and written English.The Nice-to-HavesExperience in Go.Experience with cloud based large-scale infrastructure from vendors such as Amazon Web Services, Azure or Google Cloud PlatformExperience with containerization technologies (e.g Docker) and container orchestration platforms (e.g Kubernetes)Experience on building high throughput streaming services, and knowledge on the streaming processing framework such as FlinkContributes to open source project experience with performance analysis and debugging tools.Our CommitmentWe recognize that with these individual attributes come different workplace challenges, and we will work with Grabbers to address them in our journey towards creating inclusion at Grab for all Grabbers.



  • Houston, United States The Chemical Engineer Full time

    Basic Function Supply Chain is a customer-focused Center of Excellence providing industry-leading service while delivering differential value to the business, today and into the future. We separate our Supply Chain functions into several areas; these include logistics, customer fulfillment, services, trade compliance, and support for business processes and...


  • Houston, United States Grab Limited Full time

    Life at Grab At Grab, every Grabber is guided by The Grab Way, which spells out our mission, how we believe we can achieve it, and our operating principles - the 4Hs: Heart, Hunger, Honour and Humility. These principles guide and help us make decisions as we work to create economic empowerment for the people of Southeast Asia. Get to know the Team The...


  • Houston, United States Acceler8 Talent Full time

    Introduction: We are looking for a Lead Software Engineer - Qt/QML (User-Facing) to join our team in the exciting field of AI and deep learning. Our company specializes in advanced image and video enhancement software, catering to a diverse and growing user base. We have set the gold standard in the industry, and our technology is empowering millions of...


  • Houston, United States Curate Partners Full time

    The Lead Software Engineer leads software engineers in the development and support/maintenance of software solutions, including but not limited to integrations, web applications and services, API, ETL processes, batch, and/or job orchestration spanning all systems and functional areas (such as clinical, claims, enrollment, reporting, finance, and various...


  • Houston, Texas, United States Certarus Ltd. Full time

    Certarus is the North American leader in providing low carbon energy solutions through a fully integrated compressed natural gas (CNG), renewable natural gas (RNG), and hydrogen platform. The company safely delivers clean-burning fuels to remote communities and industrial customers not connected to a pipeline.By displacing more carbon-intensive fuels,...


  • Houston, United States Targa Full time

     This job posting isn't available in all website languages Responsible for identifying and managing asset reliability risk to enable Enterprise Asset Management best practices. Assist in developing, optimizing, and implementing equipment reliability strategies that will maximize asset value creation, safe and compliant equipment operation, and controlled...


  • Houston, United States Lowe's Full time

    Expand your career possibilities. Thank you for dedicating your time and talent to Lowe’s. We want to give you more opportunities to learn and grow, so if you find a position you’re interested in below, we encourage you to apply! Find Your Home to More Possibilities. Job Summary The primary purpose of this role is to translate business requirements and...


  • Houston, Texas, United States Octopus Energy Full time

    The energy industry is undergoing the largest transformation since industrialisation at an unprecedented rate of change and we are positioning ourselves to be at the heart of that change. Our aim is to be the leading global provider of solutions that enable customers to release £30bn of value per annum from distributed energy resources (DERs). We are...


  • Houston, United States Urbangridco Full time

    Urban Grid, a leading independent power producer, facilitates a rapid and sustainable energy transition by developing high-quality renewable energy projects, fostering community partnerships, and serving as a good land steward. Our company is positioned to own and operate its facilities while cultivating a land management system that benefits farmers,...


  • Houston, United States Aerodyne Industries, LLC Full time

    **Job Description**: - Aerodyne Industries is a dynamic, rapidly growing engineering and information technology services firm headquartered on Florida’s exciting Space Coast. With locations throughout the US, we take pride in delivering small business agility with large corporation capabilities. Our list of clients count on us to prepare NASA’s Missions...

  • Software Engineer

    2 weeks ago


    Houston, United States Seven Seven Software Full time

    Work in collaboration with teams within digital platforms and other application teams to: * Ideate, strategize and develop foundational services and frameworks * Be hands-on with certifying integrity and quality of code and design * Build reusable code and libraries with excellent documentation * Develop cloud native interoperable solutions for foundational...


  • Houston, United States Aerodyne Industries, LLC Full time

    **Job Description**: - Aerodyne Industries is a dynamic, rapidly growing engineering and information technology services firm headquartered on Florida’s exciting Space Coast. With locations throughout the US, we take pride in delivering small business agility with large corporation capabilities. Our list of clients count on us to prepare NASA’s Missions...


  • Houston, United States Octopus Energy Group Full time

    As we expand our international footprint to take KrakenFlex's offerings to customers around the globe, we're looking for an exceptional technical team lead who will cultivate a fantastic developer team experience and continuously deliver features that provide value to our customers. Our ideal technical team lead would be an individual who loves to engage...

  • Reliability Engineer

    3 weeks ago


    Houston, United States Urban Grid Solar Projects, LLC Full time

    Urban Grid, a leading independent power producer, facilitates a rapid and sustainable energy transition by developing high-quality renewable energy projects, fostering community partnerships, and serving as a good land steward. Our company is positioned to own and operate its facilities while cultivating a land management system that benefits farmers,...

  • Reliability Engineer

    3 weeks ago


    Houston, United States Urban Grid Solar Projects, LLC Full time

    Job DescriptionJob DescriptionUrban Grid, a leading independent power producer, facilitates a rapid and sustainable energy transition by developing high-quality renewable energy projects, fostering community partnerships, and serving as a good land steward. Our company is positioned to own and operate its facilities while cultivating a land management system...


  • Houston, United States Urban Grid Solar Projects, LLC Full time

    Urban Grid, a leading independent power producer, facilitates a rapid and sustainable energy transition by developing high-quality renewable energy projects, fostering community partnerships, and serving as a good land steward. Our company is positioned to own and operate its facilities while cultivating a land management system that benefits farmers,...


  • Houston, United States Channel Personnel Services Full time

    Job DescriptionJob DescriptionThe role is part of the Reliability Group supporting plant operation and reliability improvement efforts. Working in a team environment, it carries responsibility for implementing reliability best practices, developing and optimizing preventive maintenance tasks, and supporting maintenance and turnaround activities. The position...


  • Houston, United States Channel Personnel Services Full time

    Job DescriptionJob DescriptionThe role is part of the Reliability Group supporting plant operation and reliability improvement efforts. Working in a team environment, it carries responsibility for implementing reliability best practices, developing and optimizing preventive maintenance tasks, and supporting maintenance and turnaround activities. The position...


  • Houston, United States Mclaurin Aerospace Full time

    Are you passionate about human space exploration, understanding the origins of the universe, and working with a passionate and diverse team to make a difference? If you are, we need you! We need your talent, teamwork, and energy to help us achieve great things that inspire people all over the globe. We need you to bring creative ideas and diverse...


  • Houston, Texas, United States Lyondell Basell North America Full time

    LyondellBasellBasic FunctionIn this position, you will be recognized as the reliability subject matter expert and be responsible for leading the improvements in maintenance and reliability across a number of operational facilities. You will provide leadership and direction to the site to drive long term reliability. You will utilize reliability and lean...