Current jobs related to AI Model Optimization Specialist - California - Acceler8 Talent


  • California, United States Tykhe Inc Full time

    Tykhe Inc is on the lookout for a Principal AI Research Engineer to spearhead research and development initiatives in the realm of Generative AI. This role emphasizes the creation and enhancement of algorithms and models that cater to various modalities, including text, images, and time-series data.The ideal candidate will collaborate closely with a team of...

  • Research Scientist

    1 month ago


    California, United States Pocket FM Full time

    About Pocket FMPocket FM is a pioneering audio entertainment company that aims to deliver personalized and immersive experiences to listeners worldwide. Leveraging cutting-edge technology, we are revolutionizing the industry through innovative storytelling and a robust platform that serves millions of users and generates billions of minutes of engagement...


  • California, United States Acceler8 Talent Full time

    Pioneering Hardware Engineering TransformationWe're seeking a Senior AI Engineer to play a crucial role in advancing our AI systems, driving innovation and efficiency in hardware engineering.Key Responsibilities:Craft and assess extensive systems integrating large language models, multimodal models, and reinforcement learning models.Organize data structures...


  • California, United States Tykhe Inc Full time

    Tykhe Inc is on the lookout for a Principal AI Product Engineer to spearhead the development of innovative Generative AI solutions across various sectors, including finance, aerospace, semiconductor, energy, and manufacturing.Your role will be pivotal in advancing the objectives of the Product Technology division by designing groundbreaking applications that...


  • California, United States Tykhe Inc Full time

    Tykhe Inc is on the lookout for a Principal AI Product Engineer to spearhead the development of cutting-edge Generative AI solutions across diverse sectors including finance, aerospace, semiconductor, energy, and manufacturing.Your role will be pivotal in advancing the objectives of the Product Technology team by designing innovative applications that merge...


  • California, United States Tykhe Inc Full time

    Tykhe Inc is on the lookout for a Principal AI Product Engineer to spearhead the development of cutting-edge Generative AI solutions across various sectors including finance, aerospace, semiconductor, energy, and manufacturing.Your role will be pivotal in advancing the objectives of the Product Technology division by designing innovative applications that...


  • California, United States Tykhe Inc Full time

    Tykhe Inc is on the lookout for a Principal AI Research Engineer to spearhead research and development initiatives in the realm of Generative AI. This role emphasizes the creation and enhancement of algorithms and models that encompass various modalities, including text, images, and time-series data.The ideal candidate will collaborate closely with a team of...


  • California, United States Tykhe Inc Full time

    Tykhe Inc is on the lookout for a Principal AI Product Engineer to spearhead the development of cutting-edge Generative AI solutions across various sectors including finance, aerospace, semiconductor, energy, and manufacturing.Your role will be pivotal in advancing the objectives of the Product Technology division by innovating applications that merge...

  • Senior AI Architect

    1 month ago


    California, United States FINESSE Full time

    About the RoleWe're seeking a highly skilled Senior Machine Learning Engineer to join our team at FINESSE, a pioneering company in the fashion industry. As a key member of our team, you'll be responsible for developing cutting-edge generative AI models that can create novel designs, predict emerging styles, and tailor recommendations to individual...

  • Lead AI Engineer

    4 weeks ago


    California, United States Acceler8 Talent Full time

    About UsWe're a pioneering company, Acceler8 Talent, on a mission to revolutionize Artificial General Intelligence (AGI) by pushing the boundaries of hardware capabilities. Our focus on Large Language Models (LLMs) enables us to streamline our hardware and software for unparalleled simplicity.Key ResponsibilitiesTrain and optimize Large Language Models for...


  • California, United States Replicate Full time

    About the RoleWe're seeking a highly skilled Systems Performance Optimization Specialist to join our team at Replicate. As a key member of our engineering team, you will be responsible for optimizing the performance of our machine learning models and infrastructure.Key ResponsibilitiesDesign and implement high-performance computing solutions to optimize the...


  • california, United States Diversity Talent Scouts- Executive Search Firm Full time

    About the Role:We are seeking a highly skilled AI Solutions Architect to join our team at Diversity Talent Scouts- Executive Search Firm. As a key member of our team, you will be responsible for designing and implementing AI solutions that meet the needs of our clients.Responsibilities:Provide technical guidance and best practices for optimizing and...

  • AI Researcher

    1 month ago


    California, United States Acceler8 Talent Full time

    Acceler8 Talent is seeking a talented Applied Scientist to drive advanced research and implement novel algorithms that redefine capital allocation capabilities. The ideal candidate will have a strong background in deep learning and NLP or LLMs, with excellent analytical and problem-solving skills. Key responsibilities include leading advanced research for...

  • Chief AI Strategist

    4 weeks ago


    California, United States Discover International Full time

    Principal Machine Learning ArchitectDiscover International is partnering with a pioneering AI organization in Los Angeles to find a seasoned Machine Learning Architect.We are seeking a highly skilled professional with expertise in deep learning and generative models to lead AI strategy, drive impactful projects, and represent our cutting-edge work at...

  • Founding AI Engineer

    2 months ago


    California, United States Skyrocket Ventures Full time

    About the RoleWe are seeking a highly skilled Founding AI Engineer to join our team at Skyrocket Ventures. As a key member of our organization, you will be responsible for building groundbreaking AI systems, leveraging multiple Large Language Models (LLMs) and Natural Language Processing (NLP) techniques.Key ResponsibilitiesDesign and implement NLP models to...


  • Sunnyvale, California, United States Chemix, Inc. Full time

    Unlock the Future of Energy StorageChemix, Inc. is seeking a highly skilled Battery Modeling Engineer to join our team and contribute to the development of our cutting-edge AI platform for battery materials discovery.As a key member of our team, you will be responsible for integrating physics-based battery models into our automated experimental design...

  • Lead Modeler

    4 weeks ago


    California, United States Maven Companies Inc Full time

    Lead ModelerMaven Companies Inc is seeking a highly skilled Lead Modeler to join our team. As a critical manufacturing specialist, you will be responsible for implementing and rolling out critical manufacturing MES tools as a modeler.Key Responsibilities:Implement and roll out critical manufacturing MES tools as a modelerWork with cross-functional teams to...

  • ML Research Engineer

    2 months ago


    California, United States Acceler8 Talent Full time

    About Acceler8 TalentWe are a cutting-edge technology company focused on developing innovative AI solutions for the highly complex electronics and semiconductor industry. Based in the SF Bay Area, we are backed by top investors in Silicon Valley and collaborate with some of the world's largest semiconductor companies. Our team is composed of former Stanford...


  • California, United States Harnham Full time

    Job Title: Senior Staff AI EngineerWe are seeking a highly skilled Senior Staff AI Engineer to lead the development of innovative AI solutions at Harnham.About the Role:Develop scalable, responsible AI platforms and solutions that drive ethical, privacy-focused, and innovative generative experiences to improve customer outcomes.Lead the creation,...


  • California, United States Glocomms Full time

    About the Team: Our team within Glocomms is dedicated to developing cutting-edge Generative AI technologies across various modalities, including text, images, videos, and landing pages. We aim to enhance creative efficiency for advertisers, agencies, and creators by automating creative workflows with Generative AI, ultimately boosting revenue. Our goal is to...

AI Model Optimization Specialist

2 months ago


California, United States Acceler8 Talent Full time

What We Are Building

As we embark on an exciting journey of growth, our emphasis is on collaborating with commercial partners to customize and enhance our sophisticated models to align with their unique business objectives. Our success in developing, aligning, and implementing cutting-edge models in our highly responsive consumer-oriented chatbot has created a robust foundation for future achievements. With substantial financial support and ample H100 resources, we have constructed a strong infrastructure and streamlined workflows to facilitate elite finetuning. By becoming a part of our organization, you will have the chance to utilize your expertise while contributing to a dynamic environment that champions creativity and collaboration.

About Our Organization

We are a compact, multidisciplinary AI studio. We have developed several state-of-the-art language models, including various iterations, and created a personal assistant. Presently, our studio is focused on finetuning and deploying models tailored for specific applications for our commercial clientele.

We are convinced that artificial intelligence heralds a period of rapid transformation. Our name symbolizes this pivotal moment, and our designation as a public benefit corporation empowers us to prioritize the welfare and satisfaction of our partners, users, and wider stakeholders above all else.

Position Overview

Research Engineer, Technical Staff Member (Inference)

In alignment with our dedication to deploying high-efficiency models for enterprise applications, our inference team guarantees that these models function effectively and efficiently in real-world scenarios. Research engineers in this role concentrate on optimizing model inference processes, minimizing response times, and improving throughput without compromising model performance, ensuring dependable deployment in corporate environments.

This position is suitable for you if you:

  • Possess experience in deploying and optimizing large language models for inference in both cloud and on-premises settings.
  • Are adept at utilizing tools and frameworks for model optimization and acceleration, such as ONNX, TensorRT, or TVM.
  • Enjoy troubleshooting and resolving intricate issues related to model performance and scalability.
  • Have a solid grasp of the trade-offs involved in model inference, including hardware constraints and real-time processing demands.
  • Are proficient in PyTorch and familiar with infrastructure management tools like Docker and Kubernetes for deploying inference pipelines.

We do not mandate a specific educational background or a predetermined number of years of experience. We are eager to learn about your projects. Please share examples of your best work, including but not limited to links to open-source contributions, personal projects, or a cover letter detailing past projects that you take pride in.

Keywords: Advanced models, Efficient workflows, Finetuning, Innovation, language models, LLMs, Inference, High-performance models, Enterprise applications, Optimizing model inference, Enhancing throughput, Reliable deployment, Large language models, Cloud environments, Model optimization, Model acceleration, ONNX, TensorRT, TVM, Scalability, Real-time processing, PyTorch, Docker, Kubernetes, Inference pipelines