Engineering Manager, Model Serving

1 week ago


San Francisco, California, United States Anthropic Full time
About Anthropic

Anthropic is a leading technology company dedicated to developing reliable, interpretable, and steerable AI systems. Our mission is to create AI that is safe and beneficial for our users and society as a whole. Our team is a rapidly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the Role

We are seeking an experienced and highly technical Engineering Manager to lead our Model Serving engineering team, focused on external partnerships. The team's charter is to build scalable infrastructure to support the serving of Anthropic's world-class large language models on leading cloud service providers. The team collaborates closely with experts in Anthropic's research organization, product leaders, and external partners to ensure we deliver industry-leading functionality. The team also works closely with Anthropic's systems and core infrastructure teams to define best practices and supporting systems for delivering this unique service.

Key Responsibilities
  1. Team Leadership: Manage and grow a team of talented backend and infrastructure engineers to deliver on the External Technical Partnerships team charter and goals.
  2. Technical Expertise: Maintain deep technical involvement in the project, to help drive the technical roadmap and execution to ship and expand capabilities, scale, and launch new LLMs. You should be comfortable operating in the codebase alongside the engineers on your team.
  3. Collaboration: Collaborate with product and research teams to define the feature set, API interfaces, and technical requirements for launching new models and features at ever-faster latencies.
  4. Customer Focus: Work closely with product management and external partners to design solutions that meet ever-evolving customer needs.
  5. Communication: Practice excellent communication and upward/outward management to establish high-functioning relationships with internal and external partners and keep your executive stakeholders informed.
  6. Engineering Excellence: Establish engineering practices and operational excellence to power high-quality, scalable fine-tuning services.
  7. Capacity Management: Develop capacity management solutions and business metric tracking to ensure the service scales efficiently.
  8. Team Development: Hire, mentor, and grow a diverse team of top engineering talent.
  9. Culture Building: Foster a culture of innovation, accountability, and customer focus.
Requirements
  1. 5+ years of engineering management experience leading high-performing teams to deliver business-critical products.
  2. Strong backend development background and deep experience operating customer-facing services at scale, with stringent uptime requirements.
  3. Proven track record partnering with customers and navigating enterprise/B2B environments.
  4. Excellent cross-functional leadership and communication skills to align engineering, product, research, and business teams.
  5. Passion for building innovative AI products in a fast-paced, customer-driven environment.
  6. Commitment to developing AI responsibly and safely.
Preferred Qualifications
  1. Deep ML/AI engineering expertise, ideally with experience in large language models and fine-tuning techniques.
  2. Building and operating SaaS or PaaS offerings on public cloud infrastructure.
  3. Developing pricing models, SLAs, and operating agreements for AI/ML products.
  4. Managing relationships with strategic partners and enterprise customers.
  5. Expertise in large-scale capacity management and resource orchestration.
  6. Track record hiring and developing diverse engineering teams.
Compensation and Benefits

Anthropic's compensation package consists of three elements: salary, equity, and benefits. We are committed to pay fairness and aim for these three elements collectively to be highly competitive with market rates.

Equity: For eligible roles, equity will be a major component of the total compensation. We aim to offer higher-than-average equity compensation for a company of our size, and communicate equity amounts at the time of offer issuance.

US Benefits: The following benefits are for our US-based employees:

  1. Optional equity donation matching.
  2. Comprehensive health, dental, and vision insurance for you and all your dependents.
  3. 401(k) plan with 4% matching.
  4. 22 weeks of paid parental leave.
  5. Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more.
  6. Stipends for education, home office improvements, commuting, and wellness.
  7. Fertility benefits via Carrot.
  8. Daily lunches and snacks in our office.
  9. Relocation support for those moving to the Bay Area.

UK Benefits: The following benefits are for our UK-based employees:

  1. Optional equity donation matching.
  2. Private health, dental, and vision insurance for you and your dependents.
  3. Pension contribution (matching 4% of your salary).
  4. 21 weeks of paid parental leave.
  5. Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more.
  6. Health cash plan.
  7. Life insurance and income protection.
  8. Daily lunches and snacks in our office.

This compensation and benefits information is based on Anthropic's good faith estimate for this position as of the date of publication and may be modified in the future. Employees based outside of the UK or US will receive a different benefits package. The level of pay within the range will depend on a variety of job-related factors, including where you place on our internal performance ladders, which is based on factors including past work experience, relevant education, and performance on our interviews or in a work trial.

How We're Different

We believe that the highest-impact AI research will be big science. At Anthropic, we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

Come work with us



  • San Francisco, California, United States OpenAI Full time

    We have ambitious goals to make the most capable models broadly available. Being able to distribute these models to billions of users in a reliable fashion requires a world-class compute fleet.As we scale the number of GPUs, number of users, and size of OpenAI, having a team dedicated to the infrastructure to support is crucial.This role will be responsible...

  • AI Engineer

    2 days ago


    San Francisco, California, United States Dataphoenix Full time

    About the RoleWe are seeking a highly skilled AI Engineer to join our team at Dataphoenix as a Model Customization Expert. In this role, you will be responsible for building and training bespoke models for our most innovative and novel solutions, maximizing business impact and innovation for our strategic customers.Key ResponsibilitiesProactively identify...


  • San Francisco, California, United States DoorDash Full time

    About the RoleAs a Machine Learning Engineer at DoorDash, you will have the opportunity to leverage our robust data and machine learning infrastructure to develop ML models that impact millions of users across our three audiences and tackle our most challenging business problems. You will work with other engineers, analysts, and product managers to develop...


  • San Francisco, California, United States Software Aspekte Full time

    About Software AspekteAt Software Aspekte, we are dedicated to creating top-notch tools for AI developers. Our journey began with the realization that while there were exceptional resources for developers to enhance their code, there was a lack of equally effective tools for machine learning practitioners to refine their models. From our initial experiment...


  • San Francisco, California, United States Wispr AI, Inc. Full time

    About Wispr AI, Inc.Wispr AI is pioneering a more intuitive method for technology interaction through advanced neural interfaces. Our team comprises top-tier engineers, product designers, and research scientists dedicated to creating innovative solutions.We have successfully secured $25M in funding from prestigious venture capital firms such as NEA and 8VC....


  • San Francisco, California, United States Software Aspekte Full time

    About Software AspekteAt Software Aspekte, we are dedicated to creating top-notch tools for AI developers. Our journey began with the realization that while developers had access to excellent coding tools, there was a significant gap in resources available for machine learning practitioners to enhance their model-building capabilities. Our initial product...


  • San Francisco, California, United States Software Aspekte Full time

    About Software AspekteAt Software Aspekte, we are dedicated to creating exceptional tools for AI developers. Our company was established with the understanding that while there are outstanding resources for developers to enhance their code, there were insufficient tools available to assist machine learning practitioners in refining their models. Beginning...


  • San Francisco, California, United States Software Aspekte Full time

    About Software AspekteAt Software Aspekte, we are dedicated to creating exceptional tools for AI developers. Our company was established with the understanding that while there are outstanding resources for software developers, there was a lack of equally effective tools for machine learning practitioners to enhance their models.Initially launching our...


  • San Francisco, California, United States Induced Full time

    About the RoleWe are seeking a highly skilled Senior Machine Learning Engineer to join our team at Induced. As a key member of our engineering team, you will be responsible for developing and improving our proprietary Large Language Model (LLM) models.Key ResponsibilitiesModel Development: Develop and improve our proprietary LLM models to achieve...


  • San Francisco, California, United States Jobot Full time

    Hybrid Remote - Reputable Structural and Geotechnical Engineering Firm - Comprehensive Benefits + Career Advancement OpportunitiesAbout Us:With over five decades of experience, we are a leading firm recognized for our structural and geotechnical engineering services across diverse sectors including healthcare, education, residential, and aviation. Our...


  • San Francisco, California, United States Wispr AI, Inc. Full time

    About Wispr AI, Inc.Wispr AI is pioneering a more intuitive method for engaging with technology through advanced neural interfaces. Our exceptional team comprises engineers, product designers, and research scientists dedicated to creating innovative solutions.We have successfully secured $25M in funding from prestigious venture capital firms such as NEA and...


  • San Francisco, California, United States Wispr AI, Inc. Full time

    About Wispr AI, Inc.Wispr AI is pioneering a more intuitive approach to technology interaction through advanced neural interfaces. Our distinguished team comprises top-tier engineers, product designers, and research scientists dedicated to creating transformative solutions.We have successfully secured $25M in funding from leading venture capital firms,...


  • San Francisco, California, United States Wispr AI, Inc. Full time

    About Wispr AI, Inc.Wispr AI is pioneering a revolutionary approach to technology interaction through advanced neural interfaces. Our distinguished team comprises engineers, product designers, and research scientists dedicated to creating transformative solutions.We have successfully secured $25M in funding from leading venture capital firms, including NEA...


  • San Francisco, California, United States DALLAS VA RESEARCH CORPORATION Full time

    About the RoleWe are seeking a highly skilled Research Engineer to join our team at Dallas VA Research Corporation. As a Research Engineer, you will play a key role in advancing the state-of-the-art in large language models.Key ResponsibilitiesDesign and Develop Advanced Language Models: Design methods, tools, and infrastructure to push forward the state of...


  • San Francisco, California, United States Usespeak Full time

    About UsWe are a pioneering company in the field of language learning, dedicated to making it accessible to everyone. Our mission is to revolutionize the way people learn foreign languages, with a focus on creating a seamless and effective experience.Our goal is to empower individuals to communicate confidently in a foreign language, bridging the gap between...


  • San Francisco, California, United States ADVANCED ENGINEERING GROUP PC Full time

    About ADVANCED ENGINEERING GROUP PCADVANCED ENGINEERING GROUP PC is dedicated to the development of reliable, interpretable, and controllable AI systems. Our mission is to ensure that AI technologies are beneficial and safe for society. Our diverse team brings expertise from various fields including machine learning, engineering, policy, and business.Role...


  • San Francisco, California, United States MV Engineering Full time

    About the TeamThe MV Engineering team is scaling OpenAI with cutting-edge technologies. We apply our latest models to real-world problems in order to assist with or automate work across the company—then share what we learn back to the broader product and research teams. We've built an ecosystem of automation products that's applied everywhere from customer...


  • San Francisco, California, United States Jobot Full time

    Hybrid Remote - Established Structural and Geotechnical Engineering Firm - Excellent Benefits + Career Advancement OpportunitiesAbout Us:With over 50 years of experience, we are a leading firm in structural and geotechnical engineering, known for our innovative approach to projects in various sectors including healthcare, education, residential, and...


  • San Francisco, California, United States Figure Full time

    About Figure Figure is at the forefront of transforming financial services through its innovative, scalable, and rapidly evolving technology platform. By leveraging its loan origination capabilities and extensive partner network, Figure is set to introduce and expand new offerings that improve efficiency and transparency within the sector. The integration of...


  • San Francisco, California, United States Induced Full time

    About the RoleWe are seeking a highly skilled Senior Machine Learning Engineer to join our team at Induced. As a key member of our engineering team, you will be responsible for developing and improving our proprietary Large Language Model (LLM) models.Key ResponsibilitiesModel Development: Develop and improve our proprietary LLM models to achieve...