ML Platform Engineer

4 weeks ago

San Francisco, United States Abridge Al, Inc Full time

Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most-their patients.

Our enterprise-grade technology transforms patient-clinician conversations into structured clinical notes in real-time, with deep EMR integrations. Powered by Linked Evidence and our purpose-built, auditable AI, we are the only company that maps AI-generated summaries to ground truth, helping providers quickly trust and verify the output. As pioneers in generative AI for healthcare, we are setting the industry standards for the responsible deployment of AI across health systems.

We are a growing team of practicing MDs, AI scientists, PhDs, creatives, technologists, and engineers working together to empower people and make care make more sense.
The Role

As an ML Platform Engineer at Abridge, you will be responsible for scaling and deploying machine learning models to handle increasing traffic demands and integrating them with various platforms. You'll play a pivotal role in building a scalable infrastructure that not only supports current deployments but also lays the foundation for long-term growth. Your role will be critical in ensuring our AI-driven healthcare platform is powered by robust, scalable, and efficiently deployed models.
What You'll Do

Architect, design, and implement ML software systems for deploying and managing models at scale.
Stand up ML models for inference, starting with critical models like the 'linkages' model, and ensure they are capable of handling traffic increases.
Develop and maintain infrastructure that supports efficient ML operations, including model evaluations, deployments, and training at scale.
Collaborate closely with ML researchers, engineers, and cross-functional teams to ensure seamless integration of models with services like Zoom and Athena.
Work with stakeholders across machine learning and operations teams to iterate on systems design and implementation.
Optimize and maintain the performance of ML systems to ensure high availability, fault tolerance, and smooth scalability.
Troubleshoot production issues and continuously improve systems to enhance performance and efficiency.

What You'll Bring

5+ years of experience in ML model deployment and scaling, with a focus on production-quality software
Strong proficiency in Python and Kubernetes, with experience building scalable ML infrastructure
Expertise in designing fault-tolerant, highly available systems.
Experience working with cloud environments, Infrastructure as Code (IaC), and managing deployments using Kubernetes.
Proficiency in optimizing system performance, debugging production issues, and designing systems for scalability and security.
Experience in software design and architecture for highly available machine learning systems for use cases like inference, evaluation, and experimentation
Excellent understanding of low-level operating systems concepts, including multi-threading, memory management, networking and storage, performance, and scale
Bachelor's/Master's Degree or greater in Computer Science/Engineering, Statistics, Mathematics, or equivalent
Excellent interpersonal and written communication skills

Ideally, You Have

Experience with large-scale ML platforms like Ray, Databricks, or AnyScale
Expertise with ML toolchains such as PyTorch or TensorFlow
Proven experience working with distributed systems and handling inference at scale
Background in working with teams and leaders to deliver impactful ML-powered solutions in fast-paced environments
in machine learning toolchains and techniques, such as Pytorch or Tensorflow
Demonstrated experience incubating and productionizing new technology, working closely with research scientists and technical teams from idea generation through implementation

We value people who want to learn new things, and we know that great team members might not perfectly match a job description. If you're interested in the role but aren't sure whether or not you're a good fit, we'd still like to hear from you.

Base Salary: $200,000 USD - $265,000 USD per year + Equity

The salary range provided is based on transparent pay guidelines and is an estimate for candidates residing in the San Francisco and New York City metro areas. The actual base salary will vary depending on the candidate's location, relevant experience, skills, qualifications, and other job-related factors. Additionally, this role may include the opportunity to participate in a company stock option plan as part of the total compensation package.

Must be willing to work from our SF office at least 3x per week

This position requires a commitment to a hybrid work model, with the expectation of coming into the office a minimum of (3) three times per week. Relocation assistance is available for candidates willing to move to San Francisco within 6 months of accepting an offer.

Must be willing to travel up to 10%

Abridge typically hosts a three-day builder team retreat every 3-6 months. These retreats often feature internal hackathons, collaborative project sessions, and social events that allow the team to connect in person.

We value people who want to learn new things, and we know that great team members might not perfectly match a job description. If you're interested in the role but aren't sure whether or not you're a good fit, we'd still like to hear from you.
Why Work at Abridge?

Be a part of a trailblazing, mission-driven organization that is powering deeper understanding in healthcare through AI
Opportunity to work and grow with talented individuals and have ownership and impact at a high-growth startup.
Flexible/Unlimited PTO - Salaried team members can take off as much approved time off as they need, plus 13 paid holidays
Equity - For all salaried team members
Medical insurance - We pay 100% of the premium for you + 75% for dependents. 3 Aetna plans to choose from.
Dental & Vision insurance - We pay 100% of the premium for you + 75% for dependents. 2 Aetna plans to choose from.
Flexible Spending (FSA) & Health Savings (HSA) Accounts
Learning and Development budget - $3,000 per year for coaching, courses, workshops, conferences, etc.
401k Plan - Contribute pre-tax dollars toward retirement savings.
Paid Parental Leave - 16 weeks paid parental leave, for all full-time employees
Flexible working hours - We care more about what you accomplish than what specific hours you're working.
Home Office Budget - We provide up to $1,600 in a one-time reimbursement to set up your home office.
Sabbatical Leave - 30 days of paid Sabbatical Leave after 5 years of employment.
...Plus much more

Life at Abridge

At Abridge, we're driven by our mission to bring understanding and follow-through to every medical conversation. Our culture is founded on doing things the "inverse" way in a legacy system-focusing on patients, instead of the system; focusing on outcomes, instead of billing; and focusing on the end-user experience, instead of a hospital administrator's mandate.

Abridgers are engineers, scientists, designers, and health policy experts from a diverse set of backgrounds-an experiment in alchemy that helps us transform an industry dominated by EHRs and enterprise into a consumer-driven experience, one recording at a time. We believe in strong ideas, loosely held, and place a high premium on a growth mindset. We push each other to grow and expose each other to the latest in our respective fields. Whether it's holding a PhD-level deep dive into understanding fairness and underlying bias in machine learning models, debating the merits of a Scandinavian design philosophy in our UI/UX, or writing responses for Medicare rules to influence U.S. health policy, we prioritize sharing our findings across the team and helping each other be successful.
Diversity & Inclusion

Abridge is an equal opportunity employer. Diversity and inclusion is at the core of what we do. We actively welcome applicants from all backgrounds (including but not limited to race, gender, educational background, and sexual orientation).
Staying Safe - Protect Yourself From Recruitment Fraud

We are aware of individuals and entities fraudulently representing themselves as Abridge recruiters and/or hiring managers. Abridge will never ask for financial information or payment, or for personal information such as bank account number or social security number during the job application or interview process. Any emails from the Abridge recruiting team will come from an @abridge.com email address. You can learn more about how to protect yourself from these types of fraud by referring to this article. Please exercise caution and cease communications if something feels suspicious about your interactions.

Software Engineer, ML Infrastructure

3 weeks ago

San Francisco, United States Scale AI, Inc. Full time

As a software engineer on the ML Infrastructure team, you will work on developing the platform for orchestrating post-training and model evaluation jobs. At Scale, we are constantly developing new data sources and running experiments to understand their impact on ML models. To support this effort, we are looking for engineers who are comfortable navigating...
Principal Product Manager, ML Platform

2 weeks ago

San Francisco, United States The Product Folks Full time

Adobe is the global leader in digital media and digital marketing solutions. Our creative, marketing and document solutions empower everyone – from emerging artists to global brands – to bring digital creations to life and deliver immersive, compelling experiences to the right person at the right moment for the best results. In short, Adobe is...
Platform ML Engineering Manager, Training

4 weeks ago

San Francisco, United States OpenAI Full time

About the Team The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference. Our priorities are to maximize training throughput (how quickly we can train a new model)...
Platform ML Engineering Manager, Model Graph

2 weeks ago

San Francisco, United States Openai Full time

About the Team The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference. Our priorities are to maximize training throughput (how quickly we can train a new model)...
Senior AI/ML Platform Manager

1 month ago

San Jose, California, United States PayPal Full time

At PayPal, we're revolutionizing commerce globally, and we need a Senior AI/ML Platform Manager to help us scale our AI/ML infrastructure and platform.We're looking for a strong Senior Product Manager with a deep understanding of the AI/ML Platform stack and a strong business acumen to partner with Data Scientists and ML Engineers in delivering a...
Platform Engineer

3 weeks ago

San Francisco, United States Eventualcomputing Full time

About EventualEventual is a data platform that helps data scientists and engineers build data applications across ETL, analytics and ML/AI.OUR PRODUCT IS OPEN-SOURCE AND USED AT ENTERPRISE SCALEOur distributed data engine Daft is open-sourced and runs on 800k CPU cores daily. This is more compute than Frontier, the world's largest supercomputer!Today's data...
Software Engineer

4 months ago

San Francisco, United States CentML Full time

About Us We believe AI will fundamentally transform how people live and work. CentML's mission is to massively reduce the cost of developing and deploying ML models so we can enable anyone to harness the power of AI and everyone to benefit from its potential. Our founding team is made up of experts in AI, compilers, and ML hardware and has led efforts at...
ML Engineer

4 weeks ago

San Francisco, United States LOG10 LLC Full time

About Log10 Inc Log10 is addressing the challenges around reliability and consistency of LLM-powered applications via a platform that provides AI-powered evaluations, fine-tuning and debugging tools. We are currently a team of 8 having previously worked in AI and infra roles at companies such as Intel, MosaicML, Adobe, Docker, PostEra, Starburst and Second...
Software Engineer

1 week ago

San Francisco, California, United States Eventual Computing Full time

About Eventual ComputingEventual Computing is a cutting-edge data platform that empowers data scientists and engineers to build scalable data applications across ETL, analytics, and ML/AI.We are on a mission to bridge the gap between traditional tabular data analytics and modern ML/AI workloads. Our open-source distributed data engine, Daft, runs on 800k CPU...
Data Platform Engineer

3 hours ago

San Francisco, United States Robust Intelligence Full time

Robust Intelligence's mission is to eliminate AI Risk. As the world increasingly adopts AI into automated decision processes, we inherit great risk. Our flagship product is built to be integrated with existing AI systems to enumerate and eliminate risks caused by unintentional and intentional (adversarial) failure modes. With Generative AI becoming...
Senior Geospatial AI/ML Engineer

16 hours ago

San Francisco, United States Wherobots Inc Full time

We are looking for passionate, skilled, and experienced ML engineers and data scientists to join Wherobots’ dynamic team in building the distributed geospatial cloud products of the future. Wherobots offers a fully-managed cloud platform designed to simplify geospatial analytics and AI applications. Our platform empowers customers to analyze massive...
Senior Geospatial AI/ML Engineer

4 days ago

San Francisco, United States Wherobots Full time

We are looking for passionate, skilled, and experienced ML engineers and data scientists to join Wherobots' dynamic team in building the distributed geospatial cloud products of the future. Wherobots offers a fully-managed cloud platform designed to simplify geospatial analytics and AI applications. Our platform empowers customers to analyze massive amounts...
Senior Data and ML Infrastructure Engineer

2 weeks ago

San Francisco, California, United States Unity Technologies Full time

About the RoleWe're seeking a skilled Senior Data and ML Infrastructure Engineer to join our team at Unity. As a key member of our Data & ML Platform team, you will design and optimize large-scale data platforms and machine learning infrastructure systems for efficiency, reliability, and cost-effectiveness.Key Responsibilities:Design and optimize large-scale...
Machine Learning Engineer, GenAI Platform

4 weeks ago

San Francisco, United States Magical Tome Full time

About Tome Tome is a unified platform for enterprise sellers and account managers. We use state-of-the-art models to simplify complex research and strategic planning for sellers. Tome can surface the most actionable knowledge about a customer from within internal systems as well as from public information across thousands of data sources. Our system is tuned...
Senior Software Engineer, Machine Learning Platform

3 weeks ago

San Francisco, United States Discord Full time

Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform: play video games. Over 90% of our users play games, spending a combined 1.5 billion hours playing thousands of unique titles on Discord each month. Discord plays a uniquely important role in the future of...
Managing Director, Platform Sales

6 days ago

San Mateo, United States Snowflake Computing Full time

Build the future of the AI Data Cloud. Join the Snowflake team. Snowflake is seeking an accomplished Managing Director, Platform Sales, AI & ML to lead and drive the sales strategy for our AI & ML workload. As a senior leader within the Platform Sales team, you will be responsible for aligning our go-to-market strategies with the business objectives for AI...
Senior Software Engineer, Machine Learning Platform

4 weeks ago

San Francisco, United States Discord Full time

Discord is used by over 200 million people every month for many different reasons, but there's one thing that nearly everyone does on our platform: play video games. Over 90% of our users play games, spending a combined 1.5 billion hours playing thousands of unique titles on Discord each month. Discord plays a uniquely important role in the future of gaming....
Staff Software Engineer, Machine Learning Platform

3 weeks ago

San Francisco, United States Discord Full time

Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform: play video games. Over 90% of our users play games, spending a combined 1.5 billion hours playing thousands of unique titles on Discord each month. Discord plays a uniquely important role in the future of...
Senior Manager, AI/ML Platform

3 weeks ago

San Jose, United States PayPal Full time

The CompanyPayPal has been revolutionizing commerce globally for more than 25 years. Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empowers consumers and businesses in approximately 200 markets to join and thrive in the global economy.We operate a global, two-sided network at scale that...
ML Infrastructure Engineer

3 weeks ago

San Francisco, United States Abridge AI Inc. Full time

Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients.Our enterprise-grade technology transforms patient-clinician conversations into...

Americas

Europe

Asia / Oceania

Africa

ML Platform Engineer