Site Reliability Engineer 3

4 days ago


New York, New York, United States MongoDB Full time $111,000 - $218,000 per year

The Site Reliability Engineering team designs and builds the global infrastructure on which we deploy our services, focusing on the above mentioned flagship MongoDB Atlas platform. As our customers grow and globalize, our services must satisfy demands for low-latency requests around the globe, and comply with various data sovereignty requirements. The SRE Team's mission is to build this increasingly complex infrastructure, while continually lowering the operational burden associated with it, and increasing our internal visibility into the health of the system. We are strong believers in infrastructure-as-code and self-healing systems. The SRE Team is fully integrated with all the other engineering teams, and the teams work closely together with a soft and traversable boundary between their areas of responsibility.

We are looking to speak to candidates who are based in New York City for our hybrid working model.

Responsibilities

  • Design and build the infrastructure for a global cloud service that comprises hundreds of thousands of MongoDB clusters, processes a billion metrics per day, and replicates tens of billions of database writes to our backup service
  • Design, implement, and troubleshoot the automation and monitoring of services that seamlessly spans the globe - including several cloud providers
  • Become an expert in infrastructure performance, helping us optimize from the application level all the way through the firmware
  • Build for resilience. Our goal is that nobody's pager goes off, ever. Are we there yet? No. Are we really close? Very. While we work on that - participate in a weekly on-call rotation
  • Improve our infrastructure capabilities, optimizing for cost, simplicity, and maintainability

Requirements

  • 3+ years of experience running a mission critical service at scale in a Linux environment
  • Firm grasp of at least one modern programming language, beyond basic scripting
  • Familiarity with web and network protocols and standards (HTTP, TLS, DNS, etc)
  • Bachelor's degree in Computer Science or equivalent experience
  • Experience writing automation tools & eagerness to "automate all the things"

Nice to have

  • Experience building large applications from scratch, complete with CI/CD infrastructure
  • Experience in networking, security, hardware or OS performance tuning
  • Experience with at least one of the major cloud providers (Amazon Web Services, Google Compute, Microsoft Azure)
  • Experience managing kubernetes clusters or some other container orchestration infrastructure
  • Experience with observability of large scale distributed systems

About MongoDB
MongoDB is built for change, empowering our customers and our people to innovate at the speed of the market. We have redefined the database for the AI era, enabling innovators to create, transform, and disrupt industries with software. MongoDB's unified database platform—the most widely available, globally distributed database on the market—helps organizations modernize legacy workloads, embrace innovation, and unleash AI. Our cloud-native platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available across AWS, Google Cloud, and Microsoft Azure.

With offices worldwide and nearly 60,000 customers—including 75% of the Fortune 100 and AI-native startups—relying on MongoDB for their most important applications, we're powering the next era of software.

Our compass at MongoDB is our Leadership Commitment, guiding how and why we make decisions, show up for each other, and win. It's what makes us MongoDB.

To drive the personal growth and business impact of our employees, we're committed to developing a supportive and enriching culture for everyone. From employee affinity groups, to fertility assistance and a generous parental leave policy, we value our employees' wellbeing and want to support them along every step of their professional and personal journeys. Learn more about what it's like to work at MongoDB, and help us make an impact on the world

MongoDB is committed to providing any necessary accommodations for individuals with disabilities within our application and interview process. To request an accommodation due to a disability, please inform your recruiter.

MongoDB, Inc. provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type and makes all hiring decisions without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

Req ID:
MongoDB's base salary range for this role is posted below. Compensation at the time of offer is unique to each candidate and based on a variety of factors such as skill set, experience, qualifications, and work location. Salary is one part of MongoDB's total compensation and benefits package. Other benefits for eligible employees may include: equity, participation in the employee stock purchase program, flexible paid time off, 20 weeks fully-paid gender-neutral parental leave, fertility and adoption assistance, 401(k) plan, mental health counseling, access to transgender-inclusive health insurance coverage, and health benefits offerings. Please note, the base salary range listed below and the benefits in this paragraph are only applicable to U.S.-based candidates.

MongoDB's base salary range for this role in the U.S. is:

$111,000—$218,000 USD



  • New York, New York, United States Kanak Elite Services Full time $140,000 - $170,000 per year

    Title: Site Reliability Engineer (SRE) (Automation & Scheduling)Location: Fully Remote (CST hours) - open to tier 2/3 markets (e.g., Omaha, Kansas, etc.)Duration: 6 Months Contract to HireInterview Process3 Rounds TotalHiring ManagerDirector of Back Office SystemsTeam MemberSeeking aSite Reliability Engineer Automation & Schedulingto lead efforts in...


  • New York, New York, United States Cutover Full time $120,000 - $130,000 per year

    An inclusive work environment is an empowering one. At Cutover, we lead with empathy and enable others to succeed through curiosity, kindness, and self-expression.Location: Remote, United States (candidates should be based in ET or -1 ET)2nd Shift: 2:00pm -11:00pm PST (10:00 PM - 7:00 AM UTC)Cutover provides enterprise technology operations teams with an...


  • New York, New York, United States CloudIngest Full time $120,000 - $180,000 per year

    Site Reliability Engineer (SRE)focused on Dynatrace, OpenTelemetry, and Data Observability using tools like Splunk, Datadog, and New Relic..Location: Berkeley Heights, NJ |Onsite Work Setting(5 days/week in the office required).Role Overview: We're seeking a skilled Site Reliability Engineer with deep expertise in OpenTelemetry and data observability...


  • New York, New York, United States Longbridge Full time

    About UsLongbridge is a fast-growing online brokerage platform on a mission to make investing smarter, simpler, and more accessible for everyone.As part of our global expansion, we're looking for ahands-on Site Reliability Engineer (SRE)to design, scale, and safeguard the reliability of our next-generation financial platforms. This is a high-impact role...


  • New York, New York, United States WalkMe Full time $100,000 - $140,000 per year

    WalkMe, an SAP company, pioneered the Digital Adoption Platform (DAP) to enable business leaders to fully harness technology in today's complex digital landscape. By leveraging WalkMe's features—guidance, engagement, insights, and automation—employees boost efficiency, executives gain greater visibility into digital usage, and organizations maximize...


  • New York, New York, United States Ampstek Full time

    Title: SRELocation: New York, NY (Day 1 Onsite)Implementation: InfosysKindly share Must Have SkillsSRE experience Cloud knowledgeKubernetesApplication log Monitoring, Infrastructure log MonitoringDetailed Job DescriptionSite Reliability Engineer SRE1 SRE experience, Cloud knowledge, Application log Monitoring, Infrastructure log Monitoring, Kubernetes,...


  • New York, New York, United States YES Network Full time $120,000 - $150,000 per year

    Manager, Site Reliability EngineeringYES Network for the Gotham Advanced Media and Entertainment ("G.A.M.E")Gotham Advanced Media and Entertainment ("G.A.M.E."), a joint venture of Yankees Entertainment and Sports Network ("YES") and MSG Networks ("MSGN"), is actively seeking a Manager, Site Reliability Engineering to join their team in the greater NYC...


  • New York, New York, United States StubHub Full time $200,000 - $250,000 per year

    StubHub is on a mission to redefine the live event experience on a global scale. Whether someone is looking to attend their first event or their hundredth, we're here to delight them all the way from the moment they start looking for a ticket until they step through the gate. The same goes for our sellers. From fans selling a single ticket to the promoters...


  • New York, New York, United States Tabs Full time $200,000 - $240,000 per year

    About The CompanyTabs is the leading AI-native revenue platform for modern finance and accounting teams. Tabs agents automates the entire contract-to-cash lifecycle, including billing, collections, revenue recognition, and reporting, to help teams eliminate manual work and accelerate cash flow.High-growth companies like Cursor and Statsig rely on Tabs to...


  • New York, New York, United States Uniswap Labs Full time $198,000 - $220,000 per year

    Uniswap Labs builds products that help millions of people access DeFi simply and securely ‒ from the Uniswap Web App and Wallet to crypto infrastructure like the Uniswap Trading API, and Unichain. Uniswap Labs also contributes to the development of the Uniswap Protocol, which has processed over $2.9 trillion in volume across thousands of tokens on Ethereum...