Senior Cloud Infrastructure Engineer

2 days ago


San Francisco, California, United States MongoDB Full time
About MongoDB

MongoDB is a leading developer data platform that empowers innovators to create, transform, and disrupt industries by unleashing the power of software and data.

Job Description

We are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Team. As a key member of our team, you will be responsible for designing and building the global infrastructure on which we deploy our services.

Our customers are growing and globalizing, and our services must satisfy demands for low-latency requests around the globe, while complying with various data sovereignty requirements.

You will work closely with our Cloud teams to design, implement, and troubleshoot the automation and monitoring of services that seamlessly span the globe, including several cloud providers.

As a Senior Site Reliability Engineer, you will become an expert in infrastructure performance, helping us optimize from the application level all the way through the firmware.

You will build for resilience, ensuring that our infrastructure is designed to minimize downtime and maximize availability.

Key Responsibilities:

  • Design and build the infrastructure for a global cloud service that comprises hundreds of thousands of MongoDB clusters.
  • Design, implement, and troubleshoot the automation and monitoring of services that seamlessly span the globe.
  • Become an expert in infrastructure performance, helping us optimize from the application level all the way through the firmware.
  • Build for resilience, ensuring that our infrastructure is designed to minimize downtime and maximize availability.
  • Improve our infrastructure capabilities, optimizing for cost, simplicity, and maintainability.

Requirements:

  • Experience running a mission-critical service at scale.
  • Understanding of information security issues.
  • Prior experience running critical production systems in a Linux environment.
  • Firm grasp of at least one modern programming language, beyond basic scripting.
  • Solid understanding of web and network protocols and standards (HTTP, TLS, DNS, etc).
  • Bachelor's degree in Computer Science or equivalent experience.
  • Experience writing automation tools & eagerness to automate all the things.

Nice to Have:

  • Experience building large applications from scratch, complete with CI/CD infrastructure.
  • Experience in networking, security, hardware or OS performance tuning.
  • Experience with at least one of the major cloud providers (Amazon Web Services, Google Compute, Microsoft Azure).
  • Experience managing Kubernetes clusters or some other container orchestration infrastructure.
  • Experience with observability of large-scale distributed systems.

What's in it for you:

  • Generous compensation package (top-range salary: we pay in the top 95% percentile and our package includes equity and generous benefits).
  • Opportunities to learn on the job (time to upskill in new technologies).
  • High level of independence in your day-to-day work.

About MongoDB's Culture:

MongoDB is committed to providing any necessary accommodations for individuals with disabilities within our application and interview process. To request an accommodation due to a disability, please inform your recruiter.

MongoDB, Inc. provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type and makes all hiring decisions without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.



  • San Francisco, California, United States Sight Machine, Inc. Full time

    About Sight Machine, Inc.Sight Machine strengthens manufacturers by providing the industry's only standard data model and system-level visualization capabilities. By integrating all crucial data into a single innovative platform, everyone involved in the fabrication process can visualize, contextualize and examine data in one intuitive interface.Job...


  • San Francisco, California, United States Humane USA Full time

    About the RoleAt Humane USA, we're seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining our cloud infrastructure to ensure high availability, scalability, and reliability.Key ResponsibilitiesArchitect and implement cloud...


  • San Francisco, California, United States Social Finance Ltd Full time

    About the RoleWe are seeking a highly skilled Senior Staff Engineer to join our Cloud Infrastructure team. As a key member of our infrastructure engineering organization, you will have the opportunity to directly impact the direction and architecture of our cloud platforms, enabling engineers at SoFi with great platform-level systems.You will be responsible...


  • San Francisco, California, United States Sight Machine Full time

    About Sight MachineSight Machine is a leading provider of industrial IoT solutions, empowering manufacturers to improve efficiency, sustainability, and quality. Our innovative platform integrates data from various sources, enabling real-time visibility and insights into production processes.Job DescriptionWe are seeking a highly skilled Senior Cloud...


  • San Francisco, California, United States Intelliswift Full time

    Job Title: Senior Cloud Infrastructure Automation EngineerIntelliswift Software Inc. is seeking a highly skilled Senior Cloud Infrastructure Automation Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining cloud-based infrastructure systems using Terraform and AWS.Key...


  • San Francisco, California, United States Humane USA Full time

    About the RoleWe're seeking a highly skilled Senior Site Reliability Engineer to join our team at Humane USA. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining scalable and reliable infrastructure and processes to support our mission to reimagine computing.Our team operates as software engineers,...


  • San Francisco, California, United States Humane USA Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Humane USA. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining scalable and reliable infrastructure and processes to support our mission to reimagine computing.Our team operates as software engineers...


  • San Francisco, California, United States Sight Machine Full time

    About Sight MachineSight Machine is a leading provider of industrial IoT solutions, empowering manufacturers to improve efficiency, sustainability, and productivity. Our innovative platform integrates data from various sources, enabling real-time visibility and insights into manufacturing processes.Job SummaryWe are seeking a highly skilled Senior Cloud...


  • San Francisco, California, United States Humane USA Full time

    About Humane USAHumane USA is a team of proven industry experts who have invented, built, and shipped category-defining hardware and software products to billions of people across the globe.Our VisionWe're known for building the audacious, ambitious, and the impossible, and we're doing it again. Our vision for the next shift between humans and computing...


  • San Francisco, California, United States Cisco Full time

    About the RoleCisco is seeking a highly skilled Senior Site Reliability Engineer II to join our FedRAMP SRE team. As a key member of our team, you will be responsible for designing and operating large-scale, highly available distributed systems in the cloud. Your expertise in cloud security and compliance will ensure that our Federal region's infrastructure...


  • San Francisco, California, United States Humane USA Full time

    About the RoleWe're seeking a highly skilled Senior Site Reliability Engineer to join our team at Humane USA. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining scalable and reliable infrastructure and processes to support our mission to reimagine computing.Our team operates at high scale, and...


  • San Francisco, California, United States Humane USA Full time

    About the RoleWe're seeking a highly skilled Senior Site Reliability Engineer to join our team at Humane USA. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining scalable and reliable infrastructure and processes to support our mission to reimagine computing.Our team operates as software engineers,...

  • Senior Cloud Engineer

    4 weeks ago


    San Francisco, California, United States Google Cloud - Minnesota Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Engineer to join our team at Google Cloud - Minnesota. As a key member of our Technical Infrastructure team, you will play a critical role in designing, building, and operating large-scale, distributed systems that power our cloud services.ResponsibilitiesService Lifecycle Management: Engage in the...


  • San Francisco, California, United States HashiCorp Full time

    About the RoleWe are seeking a skilled Cloud Infrastructure Engineer to join our Terraform Enterprise team at HashiCorp. As a key member of our team, you will be responsible for designing, implementing, and maintaining our cloud infrastructure, ensuring seamless user experiences for our customers.Key ResponsibilitiesDesign and implement scalable cloud...


  • San Francisco, California, United States Social Finance Ltd Full time

    About the RoleWe are seeking a highly skilled Senior Staff Engineer to join our Cloud Infrastructure team. As a key member of our infrastructure engineering organization, you will have the opportunity to directly impact the direction and architecture of our cloud platforms, enabling engineers at SoFi with great platform-level systems.The ideal candidate will...


  • San Francisco, California, United States Sight Machine Full time

    About Sight MachineSight Machine is a leading provider of industrial IoT solutions, empowering manufacturers to improve efficiency, sustainability, and quality. Our innovative platform integrates data from various sources, enabling real-time monitoring, analysis, and optimization of manufacturing processes.Job DescriptionWe are seeking a highly skilled...


  • San Francisco, California, United States HashiCorp Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Infrastructure Services team at HashiCorp. As a key member of our team, you will play a pivotal role in designing, building, and maintaining the infrastructure that underpins all HashiCorp cloud products.Key ResponsibilitiesDesign and implement resilient infrastructure...


  • San Francisco, California, United States Autodesk Full time

    Job Requisition ID #24WD81384At Autodesk, we're a world leader in 3D design, engineering, and entertainment software, committed to solving complex design and real-world problems.We're seeking a passionate Senior Site Reliability Engineer to lead and innovate with us, ensuring the best-in-class operation and reliability of our software...


  • San Francisco, California, United States Sight Machine Full time

    About Sight MachineSight Machine is a leading provider of industrial IoT solutions, empowering manufacturers to improve efficiency, sustainability, and quality. Our innovative platform enables real-time data visualization, contextualization, and examination, driving business outcomes and customer satisfaction.Job SummaryWe are seeking a highly skilled Senior...


  • San Francisco, California, United States Autodesk Full time

    Job OverviewAt Autodesk, we're seeking a highly skilled Senior Site Reliability Engineer to lead the development and maintenance of our cloud infrastructure. This role is critical to ensuring the best-in-class operation and reliability of our software solutions.Key ResponsibilitiesDevelop and maintain robust cloud infrastructure to support millions of daily...