Senior AI Performance Optimization Specialist
5 days ago
We are searching for a seasoned Deep Learning Performance Engineer to optimize the performance of our large generative AI models.
The ideal candidate will have a deep understanding of deep learning performance bottlenecks, kernel optimization, and distributed training strategies.
This role is perfect for someone with a strong programming background in Python and C++, experience in training large models using Python & PyTorch and/or TensorFlow, and a proven track record of optimizing large-scale models.
As a member of our team, you will play a critical role in ensuring our models run efficiently on clusters, leveraging advanced techniques and tools to enhance their performance.
Key responsibilities include analyzing and optimizing the performance of massively parallel and distributed systems, implementing and fine-tuning distributed training strategies for multi-GPU and multi-node environments, and profiling model performance and identifying bottlenecks using tools like NVIDIA NSight Systems, PyTorch Profiler, and TensorFlow Profiler.
Additionally, you will develop and maintain benchmarking suites for continuous performance monitoring and contribute to the development of high-performance CUDA, Triton, C++ and PyTorch code.
Requirements- Master's or PhD in Computer Science, Electrical Engineering, or a related field
- 5+ years of experience in optimizing deep learning models, preferably in a production environment
- Strong programming skills in Python and C++
- Proven track record of optimizing large-scale models (10B+ parameters)
- Deep understanding of GPU architecture and CUDA programming
- Experience in entire development pipeline from data processing, preparation & data loading to training and inference
- Experience optimizing and deploying inference workloads for throughput and latency across the stack
- Demonstrated expertise in high-performance computing using NVIDIA Triton and CUDA
- Demonstrated ability to significantly improve model inference and training speeds through low-level optimizations
Genmo is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law.
-
San Francisco, California, United States Perplexity AI Full timePerplexity AI is a leading innovator in conversational AI technology.SalaryThe estimated annual salary for this role is $240,000, reflecting the company's commitment to attracting top talent and rewarding expertise.Job DescriptionWe're seeking an experienced Data Science Specialist to join our team and play a key role in optimizing our conversational AI...
-
Senior AI Model Optimization Specialist
5 days ago
San Francisco, California, United States Perplexity AI Full timeOverviewPerplexity AI is at the forefront of conversational search technology, having achieved tremendous growth and adoption since launching the world's first fully functional conversational answer engine. Our AI-powered search assistant has amassed 10 million monthly active users, with our mobile apps installed over 1 million times across iOS and Android...
-
High Performance Optimization Engineer
1 day ago
San Francisco, California, United States Liquid AI Full timeWe are seeking a highly skilled engineer at Liquid AI to optimize inference stacks tailored to diverse hardware platforms. This role is ideal for an expert with extensive experience in CUDA, C++, and Triton, as well as a deep understanding of GPU, CPU, and NPU architectures.Key ResponsibilitiesDesign and optimize inference stacks for GPUs, CPUs, and...
-
AI Inference Software Architect
7 days ago
San Francisco, California, United States Untether AI Full timeSoftware Architect for AI InferenceWe are seeking an exceptional Software Architect to join our team at Untether AI, where you will play a key role in designing and developing software that interacts with our innovative chip. As part of our top-notch team, you will collaborate closely with hardware engineers and fellow software engineers to create software...
-
AI Innovations Specialist
5 days ago
San Francisco, California, United States Untether AI Full timeAt Untether AI, we're pushing the boundaries of artificial intelligence with our revolutionary new architecture that achieves unparalleled performance and efficiency in neural net inference. Our groundbreaking technology has already garnered significant interest from smart clients looking to be at the forefront of innovation.We're seeking a seasoned...
-
Senior Conversational AI Engineer
20 hours ago
San Francisco, California, United States Perplexity AI Full timeAbout Perplexity AIWe're a cutting-edge tech company revolutionizing the way people interact with information. Our mission is to empower users with intuitive and personalized experiences.As we continue to grow, we're seeking talented engineers to join our team and shape the future of conversational AI.Compensation PackageWe offer a competitive salary range...
-
Senior AI Engineering Specialist
5 days ago
San Francisco, California, United States Resolve Full timeWe are seeking a skilled Senior AI Engineering Specialist to join our team at Resolve, a cutting-edge technology company revolutionizing the way we approach artificial intelligence. As an integral part of our engineering team, you will contribute to the development and deployment of AI-powered workflows, pushing the boundaries of what is possible with...
-
San Francisco, California, United States Unreal Gigs Full timeAbout the RoleWe are seeking a seasoned High-Performance AI Infrastructure Specialist to join our team at Unreal Gigs. In this role, you will be responsible for designing, developing, and optimizing scalable infrastructure solutions to support machine learning workflows.
-
Senior AI Ecosystem Developer
11 hours ago
San Francisco, California, United States Naptha AI Full timeUnlock the future of AI agent development as our Senior AI Ecosystem Developer. We're seeking a visionary to build and nurture relationships with pioneering AI developers, shaping the next wave of AI companies.Naptha AI is at the forefront of creating the foundational infrastructure for the next generation of AI systems, enabling frontier developers to build...
-
AI Systems Optimization Expert
5 days ago
San Francisco, California, United States Naptha AI Full timeJob OverviewWe're seeking an exceptional Agent Behavior Scientist to study and shape how AI agents interact, collaborate, and evolve within large-scale networks. This is a rare opportunity to define the patterns that will govern the next generation of agent systems.
-
Advanced AI Infrastructure Engineer
5 days ago
San Francisco, California, United States Together AI Full timeAbout the RoleWe are seeking an experienced Systems Research Engineer to join our team at Together AI. As a key member of our research-driven artificial intelligence company, you will play a crucial role in researching and building the next generation AI platform.Company OverviewTogether AI is committed to creating open and transparent AI systems that drive...
-
San Jose, California, United States Cypress HCM Full timeJob Title: AI Optimization Strategist for Performance MarketingAbout Us:Cypress HCM is a leading multimedia and creative software company, pioneering the use of generative AI tools to enhance performance marketing. We are seeking an exceptional Ai Optimization Strategist to join our team and help our customers unlock the full potential of our cutting-edge...
-
San Francisco, California, United States Scale AI Full timeResearch Role OverviewScale AI's Generative AI team is pushing the boundaries of artificial intelligence by developing innovative models, algorithms, and supervision techniques. As a Senior AI Research Scientist for Generative Models, you will play a critical role in advancing our research agenda and driving product development. Your expertise in Generative...
-
Senior Backend Software Engineer
4 days ago
San Francisco, California, United States Stack AI Full timeAbout Stack AIWe are a fast-growing startup revolutionizing access to Large Language Models, enabling anyone to build AI-powered applications with positive impact.Our No-Code platform seamlessly integrates top AI models, common data sources, and SaaS tools, making it easy for developers to focus on product and business growth.We value innovation, agility,...
-
Senior Legal Operations Specialist
11 hours ago
San Francisco, California, United States Cambio AI Inc. Full timeUnleash Your Potential as a Senior Legal Operations Specialist at Cambio AI Inc.We are seeking an exceptional individual to join our team as a Senior Legal Operations Specialist. This is a unique opportunity to leverage your expertise in contract lifecycle management and collaboration to drive operational excellence within our organization. As a seasoned...
-
AI Visionary
7 days ago
San Francisco, California, United States Asari AI Full timeDiscover a rewarding opportunity at Asari AI, where innovation and passion converge. Our team of technologists is dedicated to building cutting-edge AI agents that empower people to create new products, services, and discoveries.Salary: $150K - $250KAs a key member of our team, you will play a pivotal role in shaping the future of AI. Your expertise in...
-
AI Innovations Specialist
5 days ago
San Francisco, California, United States AI Tech Suite Full timeAbout AI Tech SuiteAt AI Tech Suite, we're pioneering the future of artificial intelligence and machine learning. Our cutting-edge solutions empower businesses to harness the full potential of data-driven insights.We're a dynamic team of innovators, united by our passion for revolutionizing industries through AI-powered technologies.Job SummaryWe're seeking...
-
AI Sales Strategist
5 days ago
San Francisco, California, United States Obviously AI Full timeWe are seeking an experienced AI Sales Strategist to join our team at Obviously AI. As a leader in the AI industry, we are committed to building and maintaining strong relationships with our customers and partners.Our ideal candidate will have a proven track record of success in sales and sales management, with a deep understanding of AI/ML technologies and...
-
San Francisco, California, United States Spice AI Full timeAbout UsSpice AI is a technology company creating innovative solutions to help developers build intelligent applications and agents that learn and adapt. Founded in 2021 by Microsoft and GitHub alumni Luke Kim and Phillip LeBlanc, we're backed by top industry leaders and venture capital firms.We're passionate about empowering developers with the tools and...
-
AI Infrastructure Engineer
2 days ago
San Francisco, California, United States Naptha AI Full timeAbout Naptha AIWe are seeking exceptional Software Engineering interns to join Naptha AI and contribute to building the future of AI agent infrastructure.This internship offers hands-on experience working with frontier AI technology, backed by industry veterans and technical leaders through NVIDIA Inception, Google for Startups, and Microsoft for Startups.As...