Current jobs related to Site Reliability Engineer - San Francisco - BaseTen Labs, Inc.


  • San Francisco, United States ViralMoment Full time

    Job DescriptionJob DescriptionJob Title: DevOps/Site Reliability EngineerLocation: RemoteAbout ViralMoment:ViralMoment is an AI social listening platform that analyzes social videos to identify trending topics and provide insights to brands and agencies. Our mission is to help our clients stay ahead of the curve by leveraging cutting-edge AI technology.About...


  • San Francisco, United States PicnicHealth Full time

    [Full Time] Site Reliability Engineer at PicnicHealth (United States) Site Reliability Engineer PicnicHealth United States Date Posted: 10 Aug, 2023 Work Location: San Francisco, United States Salary Offered: $160 — $190 yearly Job Type: Full Time Experience Required: 6+ years Remote Work: Yes Stock Options: No Vacancies: 1 available Healthcare needs good...


  • San Francisco, United States Patreon Full time

    Patreon is the best place for creators to build exclusive content and community for their fans. We enable creators (podcasters, writers, musicians, illustrators, etc) to connect with their fans directly and make money from their creative work. Creators can sell one-off items from their own shops or offer recurring monthly memberships with exclusive access to...


  • San Francisco, United States PicnicHealth Full time

    [Full Time] Site Reliability Engineer at PicnicHealth (United States) Site Reliability Engineer PicnicHealth United States Date Posted: 10 Aug, 2023 Work Location: San Francisco, United States Salary Offered: $160 $190 yearly Job Type: Full Time Experience Required: 6+ years Remote Work: Yes Stock Options: No Vacancies: 1 available Healthcare needs good...


  • San Francisco, United States Instabase Full time

    At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...


  • San Francisco, United States PostHog Full time

    [Full Time] Site Reliability Engineer at PostHog (United States) Site Reliability Engineer PostHog United States Date Posted: 31 Oct, 2022 Work Location: San Francisco, United States Salary Offered: Not Specified Job Type: Full Time Experience Required: 3+ years Remote Work: Yes Stock Options: No Vacancies: 1 available About PostHog PostHog is an open-source...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our platform empowers developers to streamline their workflows, enhancing productivity and accelerating release cycles while adhering to...


  • San Francisco, United States Fieldguide.ai Full time

    [Full Time] Senior Site Reliability Engineer at Fieldguide (United States) | BEAMSTART Jobs Senior Site Reliability Engineer Fieldguide United States Date Posted: 31 Oct, 2022 Work Location: San Francisco, United States Salary Offered: Not Specified Job Type: Full Time Experience Required: 3+ years Remote Work: Yes Stock Options: No Vacancies: 1...


  • San Francisco, California, United States Academia Full time

    SRE / Site Reliability EngineerSAN FRANCISCO, CA or REMOTE from anywhere in the USAWho we are: has built and is expanding the premier distribution and peer review platform for academic research. Guided by our mission to democratize and accelerate the world's research, Academia aims to make every academic paper ever published available for free online and...


  • San Francisco, United States Swish Analytics Full time

    Swish Analytics is a sports analytics, betting and fantasy startup building the next generation of predictive sports analytics data products. We believe that oddsmaking is a challenge rooted in engineering, mathematics, and sports betting expertise; not intuition. We're looking for team-oriented individuals with an authentic passion for accurate and...


  • San Francisco, United States AutoRABIT Holding, Inc. Full time

    About AutoRABIT: AutoRABIT is a hyper-growth SaaS software company and the leading provider of Salesforce DevSecOps platform for regulated industries such as financial institutions, insurance, and healthcare. AutoRABIT solutions enable developers to automate their daily tasks to be more productive and increase the release velocity for their development team...


  • San Jose, United States Adobe Full time

    Site Reliability Engineer page is loadedAdobe’s Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. Adobe Sign is the fastest, and easiest way to get contracts signed and filed.You have a track record as a site reliability engineer in large-scale SaaS businesses, and a strong...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS provider and a prominent leader in the Salesforce DevSecOps platform tailored for regulated sectors such as finance, insurance, and healthcare. Our solutions empower developers to streamline their daily operations, enhancing productivity and accelerating release cycles while adhering to...


  • San Jose, California, United States Adobe Full time

    Site Reliability Engineer page is loadedAdobe's Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. Adobe Sign is the fastest, and easiest way to get contracts signed and filed.You have a track record as a site reliability engineer in large-scale SaaS businesses, and a strong...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our offerings empower developers to streamline their daily operations, enhancing productivity and accelerating release cycles while adhering...


  • San Francisco, United States Withorb Full time

    Mission Orb is on an ambitious mission to provide every business with the infrastructure to unlock their revenue. Best-in class businesses find ways to effectively align their monetization to product usage—whether that's through seats, consumption, feature limits, or usage-based tiers. Orb brings that opportunity to every software company. We are...


  • San Francisco, United States Gusto Full time

    About GustoGusto is a modern, online people platform that helps small businesses take care of their teams. On top of full-service payroll, Gusto offers health insurance, 401(k)s, expert HR, and team management tools. Today, Gusto offices in Denver, San Francisco, and New York serve more than 300,000 businesses nationwide. Our mission is to create a world...


  • San Francisco, United States Gusto Full time

    About Gusto Gusto is a modern, online people platform that helps small businesses take care of their teams. On top of full-service payroll, Gusto offers health insurance, 401(k)s, expert HR, and team management tools. Today, Gusto offices in Denver, San Francisco, and New York serve more than 300,000 businesses nationwide. Our mission is to create a world...


  • San Francisco, United States Outdefine Full time

    Site Reliability Engineer - Outdefine Partner Location: Hybrid – Sunnyvale, CA Visa Status: USC, GC, H4 EAD Contract: Long term contract: 1099 – US Based candidates / No C2C or C2H About the job Overview: Job Overview: Experience: 5+ years as SRE or DevOps Mandatory Skills: Ecommerce industry experience SRE or DevOps experience Java Kubernetes Azure...


  • San Francisco, United States AutoRABIT Holding Inc. Full time

    Job DescriptionJob DescriptionAbout AutoRABIT:AutoRABIT is a hyper-growth SaaS software company and the leading provider of Salesforce DevSecOps platform for regulated industries such financial institutions, insurance, and healthcare. AutoRABIT solutions enable developers to automate their daily tasks to be more productive and increase the release velocity...

Site Reliability Engineer

2 months ago


San Francisco, United States BaseTen Labs, Inc. Full time

ABOUT BASETEN We're a growing team of builders backed by top-tier investors including IVP, Spark Capital, and Sarah Guo at Conviction. ML teams at enterprises and category-defining AI-native companies like Descript, Bland, and Patreon use Baseten to power their core production workloads with best in class performance, security, and reliability. While we've unlocked PMF and secured Series B funding, the ML infrastructure market is massive and we're just getting started. If you're excited to work on engaging and relevant problems while building something new from the ground up, come join us As a Site Reliability Engineer, you'll envision and build robust systems and processes that ensure our infrastructure is scalable, reliable, and efficient. This can range from automating deployments and monitoring systems to optimizing performance and managing incidents. We all work closely with our users, learning from their past struggles in operationalizing ML, onboarding them onto our platform, and turning our learnings into ideas for improving Baseten. AS A SITE RELIABILITY ENGINEER, YOU: Have experience building and maintaining scalable infrastructure.

You have extensive experience with Kubernetes.

Know when automation is your friend, and apply it when relevant, e.g. for managing CI/CD pipelines

Have experience establishing standards and best practices for reliability and performance.

Don't need to have prior ML experience, but are open to learning about it.

Bonus Points:

Relevant OSS observability experience (Prometheus, ELK stack, Grafana stack, Opentelemetry)

YOU ALSO: Can own products and projects end-to-end; engineers and designers at Baseten also function as PMs, so we'd like everyone on the team to be able to empathize with our users, understand/write project specs, and manage the end-to-end execution of projects.

Are comfortable with navigating ambiguity and enjoy the journey as much as the destination.

Are motivated by customer problems and find joy in creating simple, elegant solutions that avoid unnecessary complexity.

Exercise good judgment on tradeoffs and tools needed to solve the problem and don't over index on trendy/fashionable tech unless it's the right tool for the job.

Demonstrate pride, ownership, and accountability for your work and expect the same from your teammates.

OUR CURRENT TECH STACK: Backend — Go, Python, Postgres Platform — Kubernetes, Go, Postgres, Redis, Kafka Infrastructure — Gitops, Flux, Terraform, AWS/GCP WHAT WE OFFER Competitive compensation package(Unlimited PTO, 401k, covered healthcare premiums)

A unique opportunity to be part of a rapidly growing startup in one of the most exciting engineering fields of our era

An inclusive and supportive work culture that fosters learning and growth

Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities

Apply Now

to embark on a rewarding journey in shaping the future of AI

If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you. At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

#J-18808-Ljbffr