Principal Machine Learning Engineer

5 days ago


San Francisco, California, United States 1Five Full time $200,000 - $250,000 per year

1Five and our clients (seed - publicly traded tech companies) are seeking Principal Machine Learning Engineers with deep expertise in generative AI (diffusion, VLMs, etc.) and/or ML infrastructure, particularly training and inference, and the ability to tech lead teams. Our clients are working on some of the most compelling problems in machine learning today, including:

  • ML to detect and identify rare earth mineral and precious metal deposits using proprietary data sets;
  • Generative AI to automate computer-aided design (CAD);
  • LLMs and GenAI to deliver personalized healthcare to millions of patients;
  • AI & computational biophysics for drug discovery;
  • and more

You Will

  • Lead engineering efforts focused on continuous improvement of the AI platform, focused on rapid build out and iteration on scalable and robust distributed infrastructure for ML training, inference, and evaluation.
  • Support model training and deployment across multiple clusters and multiple clouds, optimizing for throughput and cost.
  • Optimizing efficiency of ML models and other workloads in terms of latency, throughput, memory consumption, etc. (e.g., via GPU performance engineering), pushing the limits of what's possible with the current hardware.
  • Define the long-term vision for the ML platform.
  • Have the opportunity to mentor and guide more junior members of a technical team as well as research interns, fostering an environment of growth and innovation.

You are

  • Strong engineer who constantly strives for technical excellence. You can write clean code and have a deep understanding of the codebases you work in.
  • Deeply experienced with distributed training and inference of large models on GPU clusters and some of the core libraries and frameworks we use: Pytorch, Pytorch Lightning, Pytorch Geometric, and Ray.
  • Independent thinker with a strong sense of ownership and capability of engineering robust systems from first-principles-based conceptualization to state-of-the-art realization.
  • Curious, problem-oriented thinker who is excited to dive deep into the emerging fields of AI + geometry, AI + physics, AI + geology, AI + healthcare... and more

Nice to haves

  • Experienced with building, maintaining and debugging low-level cluster infrastructure running on multiple clouds using Kubernetes and Terraform.
  • Experienced GPU engineer who can quickly figure out performance bottlenecks and architect highly performant code for large scale ML workloads.
  • Experience with XLA, Triton, CUDA, or similar accelerator programming languages and/or deep learning compiler stacks.

Please apply directly if you're interested, thank you



  • San Francisco, California, United States Atlassian Full time $150,000 - $250,000 per year

    OverviewWorking at AtlassianAtlassians can choose where they work – whether in an office, from home, or a combination of the two. That way, Atlassians have more control over supporting their family, personal goals, and other priorities. We can hire people in any country where we have a legal entity.ResponsibilitiesWhat you'll doAs a Principal Machine...


  • South San Francisco, California, United States Roche Full time $231,280 - $429,520

    Why Genentech​​​​​​​We're passionate about delivering on Our Promise to improve the lives of patients and create healthier communities for all. We foster a culture of inclusivity, integrity and creativity while boldly pursuing answers to the world's most complex health challenges and transforming society.Who We AreOur Data, Analytics, and AI...


  • San Francisco, California, United States EvenUp Full time $260,000 - $390,000 per year

    EvenUp is on a mission to close the justice gap using technology and AI. We empower personal injury lawyers and victims to get the justice they deserve. Our products enable law firms to secure faster settlements, higher payouts, and better outcomes for victims injured through no fault of their own in vehicle collisions, accidents, natural disasters, and...


  • San Mateo, California, United States Roblox Full time $289,460 - $338,270 per year

    Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to...


  • San Mateo, California, United States Roblox Full time $250,000 - $400,000 per year

    With Roblox Ads business growing at a rapid rate, we are building large scale ads machine learning infrastructure to deliver effective performance ads to our users, and more business values to our advertisers.As a Principal TLM, you'll lead the Ads ML Infra team, build scalable, reliable, and high-performance infrastructure that powers ML systems across our...


  • San Francisco, California, United States Attis Full time $260,000 per year

    Head of Machine Learning – Generative AI for the Physical WorldOverviewA rare opportunity has emerged for a visionary Head of Machine Learning to build the core intelligence for a stealth-mode, well-funded AI company. This foundational leadership role is for someone passionate about teaching machines to understand and engineer the physical world, moving...


  • San Francisco, California, United States Acceler8 Talent Full time $120,000 - $200,000 per year

    Machine Learning Engineer (Inference)We are seeking an Inference focussed Machine Learning Engineer to join a Stanford spin out scale up building a foundational infrastructure layer for AI inference.The team were founded on the back of a successful exit, with the core of the previous founding team creating their new venture. Their aim is to dramatically...


  • San Francisco, California, United States Facebook Full time $125,000 - $175,000 per year

    Company DescriptionMeta, formerly known as Facebook, builds technologies that help connect people, find communities, and grow businesses. Launched in 2004, Facebook revolutionized the way people connect, and subsequent apps like Messenger, Instagram, and WhatsApp further empowered billions globally. Meta is progressing beyond 2D screens toward augmented and...


  • San Francisco, California, United States TechLink Resources, Inc Full time $120,000 - $200,000 per year

    Computer Vision/ Machine Learning EngineerAd Platforms organization within Company Technology is fully responsible for building, enhancing and maintaining the high-performance, distributed, microservice-based Advertising Platform across all of Company online properties. We build and maintain proprietary technology, ranging from ad serving and ad delivery,...


  • San Francisco, California, United States Gameer Full time $200,000 - $250,000 per year

    Company DescriptionGameer is the first AI game generator that turns text prompts into fully playable worlds in less than one minute. We see instant world generation as a gateway to a new era of gaming, where anyone can become a game creator without code. Our vision is to democratize game creation and open up new possibilities for creativity in the gaming...