Reliability Engineer

2 months ago


St Louis, Missouri, United States Federal Reserve System Full time

About the Opportunity

We are seeking a highly skilled Senior Cloud Reliability Engineer to join our team at the Federal Reserve System. As a key member of our Cloud Solutions & Services department, you will be responsible for implementing reliability practices using software as a means for the cloud foundational product line.

Key Responsibilities

  • Work with cloud foundational platform squads to demonstrate and champion site reliability culture and practices, exerting technical influence throughout the team.
  • Solve reliability issues of cloud platforms using software engineering principles.
  • Develop and maintain automations, scripts, and code associated with automating manual work, improving reliability, and stability of the cloud platform.
  • Develop, integrate, and maintain synthetics (canaries) code to establish the health of the platform.
  • Lead SLIs, SLOs, and Error budgets efforts in collaboration with the product team to instrument, visualize, and proactively manage the stability of cloud platforms.
  • Implement observability (logs, metrics, traces) and monitoring for cloud foundational platforms.
  • Define chaos experiments in collaboration with product owners and conduct experiments.
  • Develop and mentor junior engineers in the team.
  • Other duties assigned as necessary.

Requirements

  • 5-7 years of experience in end-to-end enterprise software development life cycle, including maintenance and support.
  • 3+ years of experience in Observability and SRE practices.
  • Bachelor's degree in computer science, Information Systems, or equivalent background or equivalent experience.
  • The ideal candidate is someone who loves building and maintaining reliable and scalable systems, is passionate about continuous improvement.
  • Self-motivated individual with the ability to prioritize and manage changing priorities.
  • Strong analytic and problem-solving skills.
  • Strong customer focus and communication skills.
  • Independent critical thinking and decision-making abilities.
  • Excellent written and oral communication abilities.

Expertise

  • Extensive knowledge and experience of working in AWS environments.
  • Software development experience with one of the languages: Python, GoLang.
  • Experience with observability and tools like Dynatrace, Prometheus, Grafana, AWS CloudWatch, AWS Canary, AWS event bridge.
  • Expertise in automation and tooling.
  • Working experience in Agile and Scaled Agile environments.
  • Experience supporting infrastructure for large multi-services applications.
  • Knowledge of secure coding standards and banking environment is a plus.

Benefits

  • Great medical benefits.
  • Pension and 401(k) with employer match.
  • Paid time off.
  • Tuition reimbursement.
  • Employee resource networks.
  • Paid volunteer leave.
  • Flexible work options.
  • Onsite amenities that make working here fun.

  • Reliability Engineer

    1 month ago


    St Louis, Missouri, United States Mallinckrodt Full time

    Job TitlePrincipal Mechanical Reliability/Maintenance EngineerJob SummaryThe Mechanical Reliability/Maintenance Engineer plays a critical role in maximizing overall equipment effectiveness by establishing reliability procedures, controls, and reporting systems. This position ensures continuous improvement of all maintenance reliability processes.Key...

  • Reliability Engineer

    2 weeks ago


    St Louis, Missouri, United States Mallinckrodt Full time

    Job TitlePrincipal Mechanical Reliability/Maintenance EngineerJob SummaryThe Mechanical Reliability/Maintenance Engineer plays a crucial role in maximizing overall equipment effectiveness by establishing reliability procedures, controls, and reporting systems. This position ensures continuous improvement of all maintenance reliability processes.Key...


  • St Louis, Missouri, United States Mallinckrodt Pharmaceuticals Full time

    Job Title: Mechanical Reliability/Maintenance EngineerMallinckrodt Pharmaceuticals is seeking a skilled Mechanical Reliability/Maintenance Engineer to join our team. As a key member of our maintenance department, you will play a critical role in ensuring the reliability and efficiency of our equipment and processes.Key Responsibilities:Develop and implement...


  • St Louis, Missouri, United States Mallinckrodt Pharmaceuticals Full time

    Job Title: Mechanical Reliability/Maintenance EngineerMallinckrodt Pharmaceuticals is seeking a skilled Mechanical Reliability/Maintenance Engineer to join our team. As a key member of our maintenance department, you will play a critical role in ensuring the reliability and efficiency of our equipment and processes.Key Responsibilities:Develop and implement...


  • St Louis, Missouri, United States Mallinckrodt Pharmaceuticals Full time

    Job Title: Mechanical Reliability/Maintenance EngineerJob Summary:The Mechanical Reliability/Maintenance Engineer plays a crucial role in maximizing overall equipment effectiveness by establishing reliability procedures, controls, and reporting systems. This position ensures continuous improvement of all maintenance reliability processes.Key...


  • St Louis, Missouri, United States Insight Global Full time

    Job Title: Site Reliability EngineerInsight Global is seeking a highly skilled Site Reliability Engineer to join our team in St. Louis, MO. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable cloud infrastructure...


  • St Louis, Missouri, United States JobRialto Full time

    Job SummaryAs a Site Reliability Engineer at JobRialto, you will play a critical role in ensuring the reliability, quality, and efficiency of our distributed software applications. Your primary responsibility will be to monitor the production environment, identify potential issues, and implement solutions to prevent downtime and improve overall system...

  • Reliability Engineer

    2 weeks ago


    St Louis, Missouri, United States Fulcrum Digital Full time

    Job Title: Sr System Reliability EngineerFulcrum Digital is a cutting-edge digital transformation company that accelerates innovation and technology services. We are seeking a highly skilled Sr System Reliability Engineer to join our team.Key Responsibilities:Design and implement strategies for Application Performance Monitoring and Optimization in...


  • St Louis, Missouri, United States United Software Group Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at United Software Group. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement highly available cloud architectures...


  • St Louis, Missouri, United States Equifax Full time

    {"title": "Site Reliability Engineer", "description": "At Equifax, we're on a mission to power your possible. If you're looking to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. Our Site Reliability Engineering (SRE) team is a discipline that combines...


  • St Louis, Missouri, United States Insight Global Full time

    Insight Global is seeking a Cloud Reliability Engineer to join a Federal client's team in St. Louis, MO. This team supports the U.S. Department of the Treasury in key business lines related to financial management for the federal government.As a Cloud Reliability Engineer, you will report to a Manager and be part of a team that provides software engineering...


  • St Louis, Missouri, United States Insight Global Full time

    Job Title: Site Reliability EngineerInsight Global is seeking a skilled Site Reliability Engineer to join our team in St. Louis, MO. As a Site Reliability Engineer, you will be part of a team that provides software engineering services to a forecasting system owned by the Department of Treasury's Bureau of the Fiscal Service.This role will involve designing...


  • St Louis, Missouri, United States Futran Tech Solutions Pvt. Ltd. Full time

    System Reliability EngineerFutran Tech Solutions Pvt. Ltd. is seeking a skilled System Reliability Engineer to join our team. As a key member of our IT organization, you will be responsible for ensuring the reliability and performance of our production environment.Key Responsibilities:Plan, manage, and oversee all aspects of a Production EnvironmentDefine...


  • St Louis, Missouri, United States Mallinckrodt Pharmaceuticals Full time

    Job TitleMechanical Reliability/Maintenance EngineerJob SummaryThe Mechanical Reliability/Maintenance Engineer plays a critical role in maximizing overall equipment effectiveness by establishing reliability procedures, controls, and reporting systems. This position ensures continuous improvement of all maintenance reliability processes.Key...


  • St Louis, Missouri, United States Futran Tech Solutions Pvt. Ltd. Full time

    Job Title: System Reliability EngineerFutran Tech Solutions Pvt. Ltd. is seeking a highly skilled System Reliability Engineer to join our team. As a System Reliability Engineer, you will be responsible for planning, managing, and overseeing all aspects of a Production Environment.Key Responsibilities:Plan, manage, and oversee all aspects of a Production...


  • St Louis, Missouri, United States Futran Tech Solutions Pvt. Ltd. Full time

    Job Title: System Reliability EngineerFutran Tech Solutions Pvt. Ltd. is seeking a highly skilled System Reliability Engineer to join our team. As a key member of our IT department, you will be responsible for ensuring the smooth operation of our production environment.Key Responsibilities:Plan, manage, and oversee all aspects of the production...


  • St Louis, Missouri, United States Mallinckrodt Full time

    Job TitleMechanical Reliability/Maintenance EngineerJob SummaryThe Mechanical Reliability/Maintenance Engineer plays a crucial role in maximizing overall equipment effectiveness by establishing reliability procedures, controls, and reporting systems. This position ensures continuous improvement of all maintenance reliability processes.Key...


  • St Louis, Missouri, United States Vantage Point Consulting Inc. Full time

    Job Summary:Vantage Point Consulting Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our digital platforms and infrastructure.Main Responsibilities:Troubleshoot and resolve infrastructure issues and incidents...


  • St Louis, Missouri, United States Diverse Lynx Full time

    At Diverse Lynx LLC, we are seeking a skilled Site Reliability Engineer to contribute to the reliability and uptime of our digital platforms, which are critical for our global operations and customer success.The ideal candidate will work on projects that have a direct impact on our revenue and profitability, driving continuous improvement initiatives that...


  • St Louis, Missouri, United States Vantage Point Consulting Inc. Full time

    Job Summary:The Site Reliability Engineer plays a critical role in ensuring the reliability, scalability, and performance of our digital platforms and infrastructure. As part of a global team of highly skilled engineers, the SRE will work on challenging and impactful projects that directly contribute to our core business activities.Main Responsibilities:...