Senior AI Model Optimization Engineer

3 days ago


San Francisco, California, United States Lumicity Full time

About Lumicity

We are a pioneering company in generative video models, pushing the boundaries of AI innovation. With a strong presence in San Francisco and over $10M in funding, we're expanding our team to tackle cutting-edge challenges.

Salary: $180,000 - $220,000 per annum

The Role

We're seeking a highly skilled Senior AI Model Optimization Engineer to join our team. The ideal candidate will have expertise in model quantization, parallel inference, and deploying ML models in production. If you're passionate about optimizing large-scale data collection and curation systems for efficient GPU-based training pipelines, this is the perfect opportunity.

  • Design and implement distributed data collection and curation systems for large-scale model training and inference.
  • Optimize GPU-based training pipelines for efficiency and speed, focusing on large-scale model deployment.
  • Accelerate inference for diffusion models and transformers using techniques like model quantization and parallel inference.
What You Bring
  • 5+ years of experience in Python or Golang with a strong emphasis on performance optimization.
  • Expertise in model quantization, parallel inference, and deploying ML models in production.
  • Hands-on experience with PyTorch, TensorRT, Triton, and CUDA kernels for accelerating model inference.

Lumicity offers a dynamic work environment, competitive salary, and opportunities for growth and innovation.



  • San Francisco, California, United States Perplexity AI Full time

    OverviewPerplexity AI is at the forefront of conversational search technology, having achieved tremendous growth and adoption since launching the world's first fully functional conversational answer engine. Our AI-powered search assistant has amassed 10 million monthly active users, with our mobile apps installed over 1 million times across iOS and Android...


  • San Francisco, California, United States Perplexity AI Full time

    About Perplexity AIWe're a cutting-edge tech company revolutionizing the way people interact with information. Our mission is to empower users with intuitive and personalized experiences.As we continue to grow, we're seeking talented engineers to join our team and shape the future of conversational AI.Compensation PackageWe offer a competitive salary range...


  • San Francisco, California, United States Relyance AI Full time

    Job Summary:We're seeking an exceptional Senior Software Engineer - ML to lead the development of our AI solutions. As a key member of the team, you'll collaborate with cross-functional stakeholders to design and build scalable, high-performance systems that meet our customers' needs. Your expertise in machine learning and natural language processing will be...


  • San Francisco, California, United States Liquid AI Full time

    Company OverviewLiquid AI is a cutting-edge technology company that specializes in developing innovative artificial intelligence solutions. We are seeking a highly skilled Inference Performance Specialist to join our team and contribute to the development of our next-generation AI products.The ideal candidate will have extensive experience in optimizing ML...


  • San Francisco, California, United States Scale AI Full time

    Research Role OverviewScale AI's Generative AI team is pushing the boundaries of artificial intelligence by developing innovative models, algorithms, and supervision techniques. As a Senior AI Research Scientist for Generative Models, you will play a critical role in advancing our research agenda and driving product development. Your expertise in Generative...


  • San Francisco, California, United States Scale AI Full time

    About Scale AIAt Scale AI, our mission is to accelerate the development of AI applications. With 8 years of experience as the leading AI data foundry, we've helped fuel exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. Our recent Series F round has enabled us to accelerate the abundance of frontier data,...


  • San Francisco, California, United States Scale AI Full time

    Job OverviewWe are seeking a highly skilled and experienced AI Model Development Manager to lead our Generative AI team at Scale AI. As the primary point of contact for this role, you will be responsible for managing a team of research engineers and ML engineers focused on delivering scalable, production-ready solutions to support our GenAI Data...


  • San Francisco, California, United States Genmo Full time

    Job Description: We are seeking a skilled Senior AI Optimization Specialist to join our team at Genmo. As a key member of our research lab, you will play a crucial role in optimizing the performance of our large generative AI models. With your expertise in deep learning performance bottlenecks, kernel optimization, and distributed training strategies, you...


  • San Francisco, California, United States Perplexity AI Full time

    Leveraging Expertise in Large Language ModelsAre you an expert in large language models and conversational AI? Do you thrive in fast-paced environments where no two days are alike? We're Perplexity AI, a cutting-edge company dedicated to revolutionizing the conversational AI landscape. As a seasoned Large Language Model Engineer, you will play a pivotal role...


  • San Francisco, California, United States Together AI Full time

    About the RoleWe are seeking an experienced Systems Research Engineer to join our team at Together AI. As a key member of our research-driven artificial intelligence company, you will play a crucial role in researching and building the next generation AI platform.Company OverviewTogether AI is committed to creating open and transparent AI systems that drive...


  • San Francisco, California, United States Scale AI, Inc. Full time

    About Scale AI, Inc.We are accelerating the development of AI applications at Scale AI, Inc. Our mission is to make the transition from traditional software to AI faster across every industry.Our products power the world's most advanced LLMs, generative models, and computer vision models.Generative AI Data EngineThe data we produce is some of the most...

  • Fullstack Developer

    4 days ago


    San Francisco, California, United States Scale AI Full time

    About Scale AI's Mission">We're making the transition from traditional software to AI happen faster across every industry.Our Generative AI Data Engine powers the world's most advanced LLMs and generative models through world-class RLHF (Reinforcement Learning with Human Feedback), human data generation, model evaluation, safety, and alignment.As a Senior...


  • San Francisco, California, United States Distyl AI Full time

    **Transformative AI Solutions**Distyl AI is at the forefront of developing AI systems that drive real-world impact. As a Senior AI Software Engineer, you will play a crucial role in designing and delivering production-grade AI solutions that meet the evolving needs of our clients. With your expertise, we can transform industries and revolutionize the way...


  • San Francisco, California, United States Liquid AI Full time

    Company Overview: Liquid AI is a cutting-edge technology company at the forefront of artificial intelligence innovation. We're dedicated to harnessing the power of machine learning to drive exceptional outcomes in various industries.Salary: $140,000 - $160,000 per annum, depending on experience and qualifications.Job Description: As we prepare to deploy our...


  • San Francisco, California, United States Jupiter Power Full time

    About the RoleWe are seeking an exceptional Senior AI Model Developer to join our team at Jupiter Power. As a key member of our organization, you will be responsible for designing and developing cutting-edge large-scale models that drive innovation and growth.Key Responsibilities:Develop and optimize large-scale language models to meet business needs;Design...

  • Senior AI Engineer

    14 hours ago


    San Jose, California, United States K&K Global Talent Solutions Inc. Full time

    Job OverviewK&K Global Talent Solutions Inc. is seeking a highly skilled Senior AI Engineer to join our team in the deployment and optimization of AI models on Ryzen AI and other AI-enabled AMD CPUs.Key Responsibilities:Collaborate with R&D teams to improve the quality and usability of AMD's development tools.Develop and validate debug, optimization, and...


  • San Francisco, California, United States Scale AI Full time

    Company OverviewSkyrocketing the development of AI applications is our mission at Scale AI. For 8 years, we've been the leading AI data foundry, fueling groundbreaking advancements in AI, including generative AI, defense applications, and autonomous vehicles. Our recent Series F round accelerates the abundance of frontier data to pave the road to Artificial...


  • San Francisco, California, United States Scale AI Full time

    About UsWe believe that the transition from traditional software to AI is one of the most important shifts of our time. At Scale AI, our mission is to make that happen faster across every industry. Our team is transforming how organizations build and deploy AI.Job OverviewWe are expanding our team to accelerate the development of AI applications. Our...

  • Software Engineer

    4 weeks ago


    San Francisco, California, United States Stack AI Full time

    About Stack AIWe're a fast-growing startup on a mission to democratize access to Large Language Models. Our user-friendly and intuitive No-Code platform integrates the best AI models, common data sources, and SaaS tools.Our Traction is impressive: launched 8 months ago with over 65,000 users and 300+ paying customers, including public companies and...


  • San Francisco, California, United States Stack AI Full time

    About Stack AIWe are a fast-growing startup revolutionizing access to Large Language Models, enabling anyone to build AI-powered applications with positive impact.Our No-Code platform seamlessly integrates top AI models, common data sources, and SaaS tools, making it easy for developers to focus on product and business growth.We value innovation, agility,...