Senior GenAI Model Research Architect/Engineer

4 weeks ago


Palo Alto, United States Tykhe Inc Full time

Would you be interested in exploring a perm full-time role for our start-up in Palo Alto, CA who are specialized in building Frontier and Foundational LLM.


As an Architect, you should be expertise in architecting the scalable training methodologies, implement the state-of-art neural architecture (Noval Neural network).


As an Engineer: efficiently train frontier and foundation multimodal large language models.

In this hands-on role, you will optimize and implement state of art neural architecture, robust training and inference infrastructure to efficiently take complex models with hundreds of billions and trillions of parameters to production while optimizing for low latency, high throughput, and cost efficiency.


Key Responsibilities:

  • Architect Distributed Training Systems: Design and implement highly scalable distributed training pipelines for LLMs and frontier models, leveraging model parallelism (tensor, pipeline, expert) and data parallelism techniques.
  • Optimize Performance: Utilize deep knowledge of CUDA, C++, and low-level optimizations to enhance model training speed and efficiency across diverse hardware configurations.
  • Implement Novel Techniques: Research and apply cutting-edge parallelism techniques like Flash
  • Attention to accelerate model training and reduce computational costs.
  • Framework Expertise: Demonstrate proficiency in deep learning frameworks such as PyTorch, TensorFlow, and JAX, and tailor them for distributed training scenarios.
  • Scale to Hundreds of Billions of Parameters: Work with massive models, ensuring stable and efficient training across distributed resources.
  • Evaluate Scaling Laws: Design and conduct experiments to analyze the impact of model size, data, and computational resources on model performance.
  • Collaborate: Partner closely with research scientists and engineers to integrate research findings into production-ready training systems.


Please reach out to Jia for more information about the role and clients.



  • Palo Alto, United States Tykhe Inc Full time

    Would you be interested in exploring a perm full-time role for our start-up in Palo Alto, CA who are specialized in building Frontier and Foundational LLM.As an Architect, you should be expertise in architecting the scalable training methodologies, implement the state-of-art neural architecture (Noval Neural network).As an Engineer: efficiently train...


  • Palo Alto, California, United States Tykhe Inc Full time

    Exciting Opportunity for a Lead Research Scientist/EngineerWe are a dynamic start-up focused on advancing GenAI infrastructure, particularly in the realms of Voice, Audio, Speech, Vision, and Multi-modal platforms.Your Role: As a key player in our organization, you will engage in the design, development, training, fine-tuning, and deployment of...


  • Palo Alto, California, United States Tesla, Inc. Full time

    Research Engineer, Foundation Models, Self-DrivingTesla is on the lookout for outstanding software engineers to contribute to the development of Tesla AI's foundational models. You will collaborate with a select group of elite deep learning professionals to create cutting-edge neural networks and explore the frontiers of AI research and innovation. Your...


  • Palo Alto, California, United States Tesla, Inc. Full time

    Foundation Models Research EngineerTesla is on the lookout for outstanding software engineers to contribute to the development of AI's foundation models. You will collaborate with a select group of elite deep learning specialists to create cutting-edge neural networks and explore the frontiers of AI research and innovation. Your contributions will facilitate...


  • Palo Alto, California, United States Tesla, Inc. Full time

    Foundation Models Research EngineerTesla is on the lookout for outstanding software engineers to contribute to the development of AI's foundation models. You will collaborate with a select group of elite deep learning professionals to create cutting-edge neural networks and expand the horizons of AI research and innovation. Your contributions will facilitate...


  • Palo Alto, California, United States Tesla, Inc. Full time

    Foundation Models Research EngineerTesla is in search of outstanding software engineers to advance the development of AI's foundation models. You will collaborate with a select group of elite deep learning professionals to create cutting-edge neural networks and explore the frontiers of AI research and innovation. Your contributions will facilitate the...


  • Palo Alto, California, United States Tesla, Inc. Full time

    Research Engineer, Foundation Models, Self-DrivingTesla is on the lookout for outstanding software engineers to develop the foundation models for Tesla AI. You will collaborate with a select group of elite deep learning specialists to create cutting-edge neural networks and explore the frontiers of AI research and innovation. Your contributions will...

  • Research Engineer

    3 months ago


    Palo Alto, United States Acceler8 Talent Full time

    Join US as a Founding Machine Learning Research EngineerAre you passionate about advancing AI systems and tackling complex challenges in machine learning? We are seeking enthusiastic individuals to join our pioneering team as Founding ML Research Engineers. This role offers both junior and senior opportunities, allowing individuals at different stages of...

  • Research Engineer

    3 months ago


    Palo Alto, United States Acceler8 Talent Full time

    Join US as a Founding Machine Learning Research EngineerAre you passionate about advancing AI systems and tackling complex challenges in machine learning? We are seeking enthusiastic individuals to join our pioneering team as Founding ML Research Engineers. This role offers both junior and senior opportunities, allowing individuals at different stages of...


  • Palo Alto, United States Tykhe Inc Full time

    Would you be interested in exploring a perm full-time role for our start-up in Palo Alto, CA who are specialized in building GenAI infrastructure concentrating on the Voice/Audio/Speech, Vision, Multi-modal platforms.If you are an expertise in any of this space: design, develop, train, fine-tune, implement state-of-art optimizing techniques and deploy these...


  • Palo Alto, United States Tykhe Inc Full time

    Would you be interested in exploring a perm full-time role for our start-up in Palo Alto, CA who are specialized in building GenAI infrastructure concentrating on the Voice/Audio/Speech, Vision, Multi-modal platforms.If you are an expertise in any of this space: design, develop, train, fine-tune, implement state-of-art optimizing techniques and deploy these...


  • Palo Alto, United States Tesla Full time

    What to ExpectAs a member of the Dojo Machine Learning team, you will be responsible for developing and optimizing simulations of the architecture of a massively parallel machine for AI training. The ideal candidate will have a strong background in computer architecture, analytical and cycle-based simulation, and AI workloads, with a passion for delivering...


  • Palo Alto, California, United States Tykhe Inc Full time

    Opportunity Overview: We are seeking a highly skilled individual to join Tykhe Inc. as a Lead Research Scientist/Engineer. Our organization is at the forefront of developing advanced GenAI infrastructure, focusing on Voice, Audio, Speech, Vision, and Multi-modal platforms.Role Responsibilities:Design, develop, and optimize state-of-the-art models in the...


  • Palo Alto, California, United States Tesla, Inc. Full time

    Research Engineer, Foundation Models, Self-DrivingTesla is in search of outstanding software engineers to develop the foundation models for Tesla AI. You will collaborate with a select group of elite deep learning specialists to create cutting-edge neural networks and advance the frontiers of AI research and innovation. Your contributions will facilitate the...


  • Palo Alto, California, United States RI Research Instruments GmbH Full time

    RI Research Instruments GmbH is dedicated to pioneering multimodal AI technologies that enhance human creativity and capabilities. We recognize that true intelligence requires a multimodal approach. Our focus is on advancing beyond traditional language models to develop systems that can perceive, comprehend, and interact with the world around us. We are in...


  • Palo Alto, California, United States OPPO US Research Center Full time

    Job OverviewWe are seeking a highly skilled Senior Machine Learning Engineer to join our Research and Development team at OPPO US Research Center. As a key member of our technology innovation leadership team, you will play a crucial role in refining our recommender system, search algorithms, and ad targeting algorithms.Key ResponsibilitiesLead Algorithmic...


  • Palo Alto, United States Tesla Full time

    What to ExpectAt Tesla, you will have access to unparalleled resources that set us apart from other companies in the AI industry. You will have access to the largest self-driving dataset in the world, providing a unique, and perhaps the only, environment to investigate scaling laws for sequential decision-making problems. Tesla also offers one of the highest...


  • Palo Alto, California, United States RI Research Instruments GmbH Full time

    RI Research Instruments GmbH is dedicated to advancing multimodal artificial intelligence to enhance human creativity and capabilities. We recognize that multimodality is essential for true intelligence. Our goal is to transcend traditional language models by integrating vision into our systems. We are focused on developing and scaling multimodal foundation...


  • Palo Alto, California, United States Ford Motor Company Full time

    Job SummaryWe are seeking a highly skilled Senior Powertrain Thermal System Modeling Engineer to join our team at Ford Motor Company. As a key member of our Electric Vehicle Digital Design (EVDD) team, you will play a critical role in developing the next generation of electric powertrain architectures.Key ResponsibilitiesDevelop an integrated powertrain...


  • Palo Alto, California, United States Ford Motor Company Full time

    Job SummaryWe are seeking a highly skilled Senior Powertrain Thermal System Modeling Engineer to join our team at Ford Motor Company. As a key member of our Electric Vehicle Digital Design (EVDD) team, you will play a critical role in developing the next generation of electric powertrain architectures.Key ResponsibilitiesDevelop an integrated powertrain...