AI Model Efficiency Expert
20 hours ago
We are seeking an AI Model Efficiency Expert to join our team at Genmo. In this role, you will analyze and optimize the performance of our massive parallel and distributed systems. You will also implement and fine-tune distributed training strategies for multi-GPU and multi-node environments and develop and maintain benchmarking suites for continuous performance monitoring.
Responsibilities:
- Analyze and optimize the performance of massively parallel and distributed systems
- Implement and fine-tune distributed training strategies for multi-GPU and multi-node environments
- Implement high-performance CUDA, Triton, C++ and PyTorch code.
- Profile model performance and identify bottlenecks using tools like NVIDIA NSight Systems, PyTorch Profiler, and TensorFlow Profiler
- Develop and maintain benchmarking suites for continuous performance monitoring
Requirements:
- Master's or PhD in Computer Science, Electrical Engineering, or a related field
- 5+ years of experience in optimizing deep learning models, preferably in a production environment
- Strong programming skills in Python and C++. Experience in training large models using Python & PyTorch and/or TensorFlow including their distributed training frameworks.
- Proven track record of optimizing large-scale models (10B+ parameters)
- Deep understanding of GPU architecture and CUDA programming
- Experience in entire development pipeline from data processing, preparation & data loading to training and inference.
- Demonstrated expertise in high-performance computing using NVIDIA Triton and CUDA
- Demonstrated ability to significantly improve model inference and training speeds through low-level optimizations
-
AI Model Development Manager
1 week ago
San Francisco, California, United States Scale AI Full timeJob OverviewWe are seeking a highly skilled and experienced AI Model Development Manager to lead our Generative AI team at Scale AI. As the primary point of contact for this role, you will be responsible for managing a team of research engineers and ML engineers focused on delivering scalable, production-ready solutions to support our GenAI Data...
-
Data Engineering Expert for AI Model Development
3 weeks ago
San Francisco, California, United States Scale AI Full timeAbout Scale AIAt Scale AI, our mission is to accelerate the development of AI applications. With 8 years of experience as the leading AI data foundry, we've helped fuel exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. Our recent Series F round has enabled us to accelerate the abundance of frontier data,...
-
San Francisco, California, United States Scale AI Full timeResearch Role OverviewScale AI's Generative AI team is pushing the boundaries of artificial intelligence by developing innovative models, algorithms, and supervision techniques. As a Senior AI Research Scientist for Generative Models, you will play a critical role in advancing our research agenda and driving product development. Your expertise in Generative...
-
AI Model Compression Expert
7 days ago
San Diego, California, United States Kneron Full timeWe are looking for a talented AI Model Compression Expert to join our team at Kneron. As a key member of our team, you will be responsible for developing and implementing model compression techniques, including QAT, model distillation, pruning, quantization, and others for deep learning models.Key Responsibilities:Develop and implement novel deep neural...
-
AI Model Training and Deployment Expert
18 hours ago
San Jose, California, United States Tik Tok Full timeAbout the RoleThe AI Model Training and Deployment Expert will design, architect, and implement backend systems to deploy generative AI models for image and video generation use cases.Responsibilities:Design and implement highly efficient engineering systems for generative AI tasks.Optimize the performance of generative AI model training and serving.Build...
-
AI Model Developer
1 day ago
San Francisco, California, United States Databricks Full timeAbout DatabricksDatabricks is a cloud-based platform that enables companies to solve complex problems using machine learning and deep learning models. Our mission is to democratize access to modern AI technology and empower our customers to achieve their goals.Job Title: Deep Learning ExpertLocation: Remote (USA)DescriptionWe are seeking a highly skilled...
-
AI Software Engineering Expert
3 weeks ago
San Francisco, California, United States Perplexity AI Full timeAbout the RoleWe are seeking an experienced Full Stack AI Software Engineer to help revolutionize the way people interact online.ResponsibilitiesPropose novel product features that can be built with LLMs and integrate them into our product.Stay up-to-date on new features released from external LLM providers and in-house researchers.Ensure high-quality and...
-
San Francisco, California, United States Scale AI Full timeOverviewSkyrocket the advancement of AI across industries at Scale, a pioneering company in AI research and development. Our mission is to accelerate the transition from traditional software to AI, empowering organizations to build and deploy cutting-edge models.
-
Large Language Model Engineer
2 days ago
San Francisco, California, United States Perplexity AI Full timeLeveraging Expertise in Large Language ModelsAre you an expert in large language models and conversational AI? Do you thrive in fast-paced environments where no two days are alike? We're Perplexity AI, a cutting-edge company dedicated to revolutionizing the conversational AI landscape. As a seasoned Large Language Model Engineer, you will play a pivotal role...
-
Conversational AI Expert
21 hours ago
San Francisco, California, United States Perplexity AI Full timeAt Perplexity AI, we're pushing the boundaries of conversational AI. As a Conversational AI Expert, you'll be instrumental in shaping the future of our answer machine.About the RoleWe're seeking an experienced Machine Learning Engineer to join our team and help us improve query understanding for every answer.This is a full-time position that...
-
AI Data Engineer for Generative Models
2 days ago
San Francisco, California, United States Scale AI, Inc. Full timeAbout Scale AI, Inc.We are accelerating the development of AI applications at Scale AI, Inc. Our mission is to make the transition from traditional software to AI faster across every industry.Our products power the world's most advanced LLMs, generative models, and computer vision models.Generative AI Data EngineThe data we produce is some of the most...
-
Fullstack Developer
2 days ago
San Francisco, California, United States Scale AI Full timeAbout Scale AI's Mission">We're making the transition from traditional software to AI happen faster across every industry.Our Generative AI Data Engine powers the world's most advanced LLMs and generative models through world-class RLHF (Reinforcement Learning with Human Feedback), human data generation, model evaluation, safety, and alignment.As a Senior...
-
AI Innovation Engineer
2 weeks ago
San Francisco, California, United States Decagon AI, Inc. Full timeAbout Decagon AI, Inc.We are a pioneering conversational AI company that empowers enterprises to deliver exceptional customer experiences. With our cutting-edge technology, we've established ourselves as a leader in the industry, working with prominent clients like Duolingo, Notion, and Eventbrite.Our journey has been marked by significant milestones,...
-
Federal AI Development Expert
1 week ago
San Francisco, California, United States Scale AI, Inc. Full timeAbout Us: We believe that everyone should be able to bring their whole selves to work. At Scale AI, we are proud to be an affirmative action employer and inclusive and equal opportunity workplace. We are expanding our team to accelerate the development of AI applications and power the world's most advanced LLMs, generative models, and computer vision models.
-
AI Instructional Specialist
19 hours ago
San Francisco, California, United States Scale AI Full timeUnlocking Human Potential through AI TrainingWe are seeking a talented AI Instructional Specialist to join our team and develop innovative training solutions that unlock human potential. As an integral part of our organization, you will work closely with subject matter experts, product developers, and training stakeholders to create engaging and effective...
-
AI Engineering Specialist
3 weeks ago
San Francisco, California, United States Abridge AI Inc. Full timeAbridge AI Inc. is a trailblazing organization that empowers deeper understanding in healthcare through artificial intelligence.Estimated Salary: $185,000 USD - $265,000+ USD per year + EquityWe are seeking experienced Full Stack Engineers to join our growing team and help us build innovative ML-powered solutions for healthcare AI technology.About the...
-
Full Stack AI Developer
1 week ago
San Francisco, California, United States Scale AI Full timeAbout the RoleWe're looking for an entrepreneurial Software Engineer who can take an ambiguous scope and lead the execution of outcomes. You'll be given the opportunity to build products and drive millions of dollars in revenue.You'll also get widespread exposure to the forefront of the AI race as Scale sees it in enterprises, startups, governments, and...
-
AI Visionary
3 weeks ago
San Francisco, California, United States Asari AI Full timeDiscover a rewarding opportunity at Asari AI, where innovation and passion converge. Our team of technologists is dedicated to building cutting-edge AI agents that empower people to create new products, services, and discoveries.Salary: $150K - $250KAs a key member of our team, you will play a pivotal role in shaping the future of AI. Your expertise in...
-
Advanced AI Infrastructure Engineer
3 weeks ago
San Francisco, California, United States Together AI Full timeAbout the RoleWe are seeking an experienced Systems Research Engineer to join our team at Together AI. As a key member of our research-driven artificial intelligence company, you will play a crucial role in researching and building the next generation AI platform.Company OverviewTogether AI is committed to creating open and transparent AI systems that drive...
-
AI Innovation Leader for Generative Models
3 weeks ago
San Francisco, California, United States Tatari Full timeTatari, a pioneer in TV advertising revolution, seeks an experienced AI expert to spearhead the development of cutting-edge generative AI models and systems.We combine a sophisticated media buying platform with proprietary analytics to transform TV advertising into an automated, digital-like experience. As a Senior AI Engineer, you will play a pivotal role...