AI Model Efficiency Expert

20 hours ago


San Francisco, California, United States Genmo Full time
About the Role:

We are seeking an AI Model Efficiency Expert to join our team at Genmo. In this role, you will analyze and optimize the performance of our massive parallel and distributed systems. You will also implement and fine-tune distributed training strategies for multi-GPU and multi-node environments and develop and maintain benchmarking suites for continuous performance monitoring.

Responsibilities:
  • Analyze and optimize the performance of massively parallel and distributed systems
  • Implement and fine-tune distributed training strategies for multi-GPU and multi-node environments
  • Implement high-performance CUDA, Triton, C++ and PyTorch code.
  • Profile model performance and identify bottlenecks using tools like NVIDIA NSight Systems, PyTorch Profiler, and TensorFlow Profiler
  • Develop and maintain benchmarking suites for continuous performance monitoring

Requirements:
  • Master's or PhD in Computer Science, Electrical Engineering, or a related field
  • 5+ years of experience in optimizing deep learning models, preferably in a production environment
  • Strong programming skills in Python and C++. Experience in training large models using Python & PyTorch and/or TensorFlow including their distributed training frameworks.
  • Proven track record of optimizing large-scale models (10B+ parameters)
  • Deep understanding of GPU architecture and CUDA programming
  • Experience in entire development pipeline from data processing, preparation & data loading to training and inference.
  • Demonstrated expertise in high-performance computing using NVIDIA Triton and CUDA
  • Demonstrated ability to significantly improve model inference and training speeds through low-level optimizations


  • San Francisco, California, United States Scale AI Full time

    Job OverviewWe are seeking a highly skilled and experienced AI Model Development Manager to lead our Generative AI team at Scale AI. As the primary point of contact for this role, you will be responsible for managing a team of research engineers and ML engineers focused on delivering scalable, production-ready solutions to support our GenAI Data...


  • San Francisco, California, United States Scale AI Full time

    About Scale AIAt Scale AI, our mission is to accelerate the development of AI applications. With 8 years of experience as the leading AI data foundry, we've helped fuel exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. Our recent Series F round has enabled us to accelerate the abundance of frontier data,...


  • San Francisco, California, United States Scale AI Full time

    Research Role OverviewScale AI's Generative AI team is pushing the boundaries of artificial intelligence by developing innovative models, algorithms, and supervision techniques. As a Senior AI Research Scientist for Generative Models, you will play a critical role in advancing our research agenda and driving product development. Your expertise in Generative...


  • San Diego, California, United States Kneron Full time

    We are looking for a talented AI Model Compression Expert to join our team at Kneron. As a key member of our team, you will be responsible for developing and implementing model compression techniques, including QAT, model distillation, pruning, quantization, and others for deep learning models.Key Responsibilities:Develop and implement novel deep neural...


  • San Jose, California, United States Tik Tok Full time

    About the RoleThe AI Model Training and Deployment Expert will design, architect, and implement backend systems to deploy generative AI models for image and video generation use cases.Responsibilities:Design and implement highly efficient engineering systems for generative AI tasks.Optimize the performance of generative AI model training and serving.Build...

  • AI Model Developer

    1 day ago


    San Francisco, California, United States Databricks Full time

    About DatabricksDatabricks is a cloud-based platform that enables companies to solve complex problems using machine learning and deep learning models. Our mission is to democratize access to modern AI technology and empower our customers to achieve their goals.Job Title: Deep Learning ExpertLocation: Remote (USA)DescriptionWe are seeking a highly skilled...


  • San Francisco, California, United States Perplexity AI Full time

    About the RoleWe are seeking an experienced Full Stack AI Software Engineer to help revolutionize the way people interact online.ResponsibilitiesPropose novel product features that can be built with LLMs and integrate them into our product.Stay up-to-date on new features released from external LLM providers and in-house researchers.Ensure high-quality and...


  • San Francisco, California, United States Scale AI Full time

    OverviewSkyrocket the advancement of AI across industries at Scale, a pioneering company in AI research and development. Our mission is to accelerate the transition from traditional software to AI, empowering organizations to build and deploy cutting-edge models.


  • San Francisco, California, United States Perplexity AI Full time

    Leveraging Expertise in Large Language ModelsAre you an expert in large language models and conversational AI? Do you thrive in fast-paced environments where no two days are alike? We're Perplexity AI, a cutting-edge company dedicated to revolutionizing the conversational AI landscape. As a seasoned Large Language Model Engineer, you will play a pivotal role...


  • San Francisco, California, United States Perplexity AI Full time

    At Perplexity AI, we're pushing the boundaries of conversational AI. As a Conversational AI Expert, you'll be instrumental in shaping the future of our answer machine.About the RoleWe're seeking an experienced Machine Learning Engineer to join our team and help us improve query understanding for every answer.This is a full-time position that...


  • San Francisco, California, United States Scale AI, Inc. Full time

    About Scale AI, Inc.We are accelerating the development of AI applications at Scale AI, Inc. Our mission is to make the transition from traditional software to AI faster across every industry.Our products power the world's most advanced LLMs, generative models, and computer vision models.Generative AI Data EngineThe data we produce is some of the most...

  • Fullstack Developer

    2 days ago


    San Francisco, California, United States Scale AI Full time

    About Scale AI's Mission">We're making the transition from traditional software to AI happen faster across every industry.Our Generative AI Data Engine powers the world's most advanced LLMs and generative models through world-class RLHF (Reinforcement Learning with Human Feedback), human data generation, model evaluation, safety, and alignment.As a Senior...


  • San Francisco, California, United States Decagon AI, Inc. Full time

    About Decagon AI, Inc.We are a pioneering conversational AI company that empowers enterprises to deliver exceptional customer experiences. With our cutting-edge technology, we've established ourselves as a leader in the industry, working with prominent clients like Duolingo, Notion, and Eventbrite.Our journey has been marked by significant milestones,...


  • San Francisco, California, United States Scale AI, Inc. Full time

    About Us: We believe that everyone should be able to bring their whole selves to work. At Scale AI, we are proud to be an affirmative action employer and inclusive and equal opportunity workplace. We are expanding our team to accelerate the development of AI applications and power the world's most advanced LLMs, generative models, and computer vision models.


  • San Francisco, California, United States Scale AI Full time

    Unlocking Human Potential through AI TrainingWe are seeking a talented AI Instructional Specialist to join our team and develop innovative training solutions that unlock human potential. As an integral part of our organization, you will work closely with subject matter experts, product developers, and training stakeholders to create engaging and effective...


  • San Francisco, California, United States Abridge AI Inc. Full time

    Abridge AI Inc. is a trailblazing organization that empowers deeper understanding in healthcare through artificial intelligence.Estimated Salary: $185,000 USD - $265,000+ USD per year + EquityWe are seeking experienced Full Stack Engineers to join our growing team and help us build innovative ML-powered solutions for healthcare AI technology.About the...


  • San Francisco, California, United States Scale AI Full time

    About the RoleWe're looking for an entrepreneurial Software Engineer who can take an ambiguous scope and lead the execution of outcomes. You'll be given the opportunity to build products and drive millions of dollars in revenue.You'll also get widespread exposure to the forefront of the AI race as Scale sees it in enterprises, startups, governments, and...

  • AI Visionary

    3 weeks ago


    San Francisco, California, United States Asari AI Full time

    Discover a rewarding opportunity at Asari AI, where innovation and passion converge. Our team of technologists is dedicated to building cutting-edge AI agents that empower people to create new products, services, and discoveries.Salary: $150K - $250KAs a key member of our team, you will play a pivotal role in shaping the future of AI. Your expertise in...


  • San Francisco, California, United States Together AI Full time

    About the RoleWe are seeking an experienced Systems Research Engineer to join our team at Together AI. As a key member of our research-driven artificial intelligence company, you will play a crucial role in researching and building the next generation AI platform.Company OverviewTogether AI is committed to creating open and transparent AI systems that drive...


  • San Francisco, California, United States Tatari Full time

    Tatari, a pioneer in TV advertising revolution, seeks an experienced AI expert to spearhead the development of cutting-edge generative AI models and systems.We combine a sophisticated media buying platform with proprietary analytics to transform TV advertising into an automated, digital-like experience. As a Senior AI Engineer, you will play a pivotal role...