Reliability Engineer

4 weeks ago


New York, New York, United States Alchemy Full time
About the Role

We're seeking a highly skilled Reliability Engineer to join our Infrastructure team at Alchemy. As a key member of our team, you will play a critical role in designing, deploying, and continuously improving the infrastructure that supports our globally used developer platform.

Key Responsibilities
  • Develop and own company-wide reliability best practices, including SLO definition, incident management, postmortem reviews, launch readiness reviews, and change management.
  • Architect production infrastructure and tools that encourage and enforce high reliability.
  • Inspire the broader engineering organization to ensure reliability is a first-class citizen in the products we build.
  • Collaborate with engineering teams on reliability topics, such as high-reliability architecture, observability, and safe change management.
  • Improve critical infrastructure and systems used to operate infrastructure at scale.
  • Develop and own best practices for managing production infrastructure, including provisioning, application scaling, configuration management, capacity planning, monitoring, and more.
  • Develop and own best practices for developer processes, including CI/CD, dev and staging environments, and more.
  • Provide input into long-term platform requirements and operational guidelines with a focus on reliability.
  • Continuously raise our standard of engineering excellence by implementing best practices for coding, testing, and deployment.
Requirements
  • 6+ years of experience as an Infrastructure Engineer focused on Reliability.
  • Experience leading and driving company-wide reliability efforts and engineering initiatives.
  • Experience with observability best practices and tooling, such as Prometheus, Grafana, and Datadog.
  • Experience designing and operating large-scale, multi-region production systems.
  • Experience working with AWS or other cloud infrastructures.
  • Experience with container schedules and runtimes, such as Docker and Kubernetes.
  • Experience building deployment pipelines leveraging common CI/CD tools, such as Argo, Flux, and Gitops.
  • Experience with Infrastructure-as-Code, such as Terraform, Pulumi, Chef, and Puppet.
  • Strong communication and collaboration skills.
What We Offer

Alchemy is committed to offering competitive compensation, including base salary and equity. Additionally, we offer comprehensive medical, dental, and vision coverage, as well as other benefits, such as 401k and unlimited flexible time off.

The base salary range for this position is estimated to be between $135,000 - $350,000 annually. Please note that this range reflects base salary only and does not include bonus, equity, or benefits. Your salary will be determined by various factors, including relevant experience, skill set, qualifications, and other business needs.


  • Reliability Engineer

    2 weeks ago


    New York, New York, United States Alchemy Full time

    About AlchemyAlchemy is a world-class developer platform that empowers innovators to build on the blockchain. Our mission is to democratize access to blockchain technology, making it easy for developers to create and deploy decentralized applications.The RoleWe're seeking a seasoned Infrastructure Engineer to join our team and drive reliability initiatives...

  • Reliability Engineer

    2 weeks ago


    New York, New York, United States International Flavors and Fragrances Full time

    Job SummaryDemonstrates expertise in multiple technology areas or applications, utilizing technical knowledge to develop value-creating solutions for our products or production assets.Functions independently in area of technical expertise; initiates and drives programs and identifies needed resources; receives limited guidance from supervisor; has...

  • Reliability Engineer

    2 weeks ago


    New York, New York, United States Mini-Circuits Full time

    Job Title: Reliability EngineerMini-Circuits is seeking a highly skilled Reliability Engineer to join our team. As a Reliability Engineer, you will be responsible for managing new product qualification prior to market release of MMIC Business Unit (BU) while also supporting qualification activities with other Business Units.Key Responsibilities:Participate...

  • Reliability Engineer

    2 weeks ago


    New York, New York, United States International Flavors & Fragrances Full time

    Job SummaryDemonstrates expertise in multiple technology areas or applications, utilizing technical knowledge to develop value-creating solutions for our products or production assets.Functions independently in area of technical expertise; initiates and drives programs and identifies needed resources; receives limited guidance from supervisor; has...


  • New York, New York, United States Alchemy Full time

    About the RoleAlchemy is seeking a highly skilled Site Reliability Engineer to join our Infrastructure team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our globally used developer platform.Key ResponsibilitiesDesign, deploy, and continuously improve the infrastructure supporting...


  • New York, New York, United States Alchemy Full time

    About the RoleAlchemy is seeking a highly skilled Site Reliability Engineer to join our Infrastructure team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our globally used developer platform.Our mission is to empower builders with the tools they need to create exceptional on-chain products....


  • New York, New York, United States Alchemy Full time

    About AlchemyAlchemy is a world-class developer platform that aims to make building on the blockchain easy. Our mission is to bring blockchain to a billion people, and we're committed to delivering high-quality products that delight our customers.The RoleWe're seeking an experienced Infrastructure Engineer to join our team and help us design, deploy, and...


  • New York, New York, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and...


  • New York, New York, United States FLOAT LLC Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Float LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud infrastructure, enabling our engineering teams to focus on delivering high-quality software to our customers.Key...


  • New York, New York, United States Lorven Technologies Full time

    Job Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and reliable cloud...


  • New York, New York, United States Unreal Gigs Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Unreal Gigs. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems.Key Responsibilities:Design, implement, and maintain scalable infrastructure solutions to support...


  • New York, New York, United States Phaxis Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Phaxis. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining our critical infrastructure platforms.Key Responsibilities:Design and implement scalable and resilient servicesCollaborate with engineering teams to...


  • New York, New York, United States ADP Full time

    About ADPADP is a global leader in HR technology, offering the latest AI and machine learning-enhanced payroll, tax, HR, benefits, and more. We believe our people make all the difference in cultivating an inclusive, down-to-earth culture that welcomes ideas, encourages innovation, and values belonging.Job DescriptionWe are seeking a Site Reliability Engineer...


  • New York, New York, United States Diverse Lynx Full time

    Job Title: SRE - Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and...


  • New York, New York, United States GLOBALFOUNDRIES Full time

    About GlobalFoundriesGlobalFoundries is a leading full-service semiconductor foundry that provides a unique combination of design, development, and fabrication services to some of the world's most innovative technology companies.With a global manufacturing footprint spanning three continents, GlobalFoundries makes possible the technologies and systems that...


  • New York, New York, United States Motion Recruitment Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Motion Recruitment. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our systems, as well as collaborating with cross-functional teams to drive innovation and improvement.Key Responsibilities:Design,...


  • New York, New York, United States Grafbase, Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Engineering team at Grafbase, Inc.As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems and services.Key ResponsibilitiesCollaborate with cross-functional teams to develop and deploy software...


  • New York, New York, United States Braze Full time

    About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our team at Braze. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our internal-facing services and platforms.Key ResponsibilitiesPartner with Braze's engineering teams to architect products that effectively utilize...


  • New York, New York, United States Diverse Lynx Full time

    About the RoleWe are seeking a highly skilled Cloud Reliability Engineer to join our team at Diverse Lynx. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key ResponsibilitiesDesign and implement automated workflows to reduce TOIL and improve system reliabilityDevelop...


  • New York, New York, United States Unreal Gigs Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our tech startup, Unreal Gigs, specializing in infrastructure and authorization solutions.As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, availability, and performance of our systems. Your responsibilities will include designing,...