Lead Generative AI Engineer/Scientist

3 months ago


Palo Alto, United States Tykhe Inc Full time

About the Company: One of our well-established start-up clients in Palo Alto, CA are looking for an experienced Lead Generative AI Engineer/Scientist to train, optimize, scale, and deploy a variety of generative AI models such as large language models, voice/speech foundation models, vision and multi-modal foundation models using cutting-edge techniques and frameworks. In this hands-on role, you will architect and implement state of art neural architecture, robust training and inference infrastructure to efficiently take complex models with billions of parameters to production while optimizing for low latency, high throughput, and cost efficiency.


Responsibilities:

  • Architect and refine foundation model infrastructure to support the deployment of optimized AI models with a focus on C/C++, CUDA, and kernel-level programming enhancements.
  • Implement state-of-the-art optimization techniques, including quantization, distillation, sparsity, streaming, and caching, for model performance enhancements.
  • Spearhead the development of Vision pipelines, ensuring scalable training and inference workflows of 10s and 100s of billions of parameter foundation models.
  • Should be able to innovate for the state-of-the-art architectures involving Panoptic Segmentation, Image Classification and Image Generation. It is expected that the candidate experiments with the internals of Vision Transformers and convolutional Models like ConvNext, CLIP, Visual Question Answering (VQA) and Diffusion Models. Practice around AI Arts, Image Prompts, Conditional Image Generation will be an additional advantage.
  • Design, develop, and innovate state-of-the-art in large multimodal models like GPT-4o, Gemini, Chameleon. Make architectural choices across dense / Mixture-of-experts, early fusion / deep fusion, choice of modality encoders (VQ-GAN, ViT, CLIP/SigLIP), decoders (Stable diffusion, Stable cascade, AudioLDM).
  • Execute training and inference processes with a key emphasis on minimizing latency and maximizing throughput, utilizing GPU clusters and custom hardware.
  • Innovate on current model deployment platforms, employing AWS, GCP, and GPU clusters, to enable high scalability and responsiveness.
  • Integrate and tailor frameworks such as PyTorch, TensorFlow, DeepSpeed, Lightening, FSDP, and Habana for the advancement of super-fast model training and inference.
  • Advance the deployment infrastructure with MLOps frameworks such as KubeFlow, MosaicML, Anyscale, Terraform, ensuring robust development and deployment cycles.
  • Enhance post-deployment mechanisms with exhaustive testing, real-time monitoring, and sophisticated explainability and robustness checks.
  • Drive continuous improvement initiatives for deployed models with automated pipelines for drift detection and performance degradation.
  • Lead the charge in model management, encompassing version control, reproducibility, and lineage tracking.
  • Cultivate a culture of high-performance computing and optimization within the AI/ML domain, propagating best practices and knowledge sharing.


Qualifications:

  • Ph.D. with 5+ years or MS with 8+ years of experience in ML Engineering, Data Science, or related fields.
  • Demonstrated expertise in high-performance computing with proficiency in Python, C/C++, CUDA, and kernel-level programming for AI applications.
  • Extensive experience in the optimization of training and inference for large-scale AI models, including practical knowledge of quantization, distillation, and Vision Pipelines.
  • It will be of additional benefit if the Candidate understands Diffusion Models (DDPM), Variational Autoencoders, Bayesian Modelling, Stochastic Variational Inference (SVI) and Reinforcement Learning.
  • Experience in building 10s and 100s of billions of parameters generative AI foundation models.
  • AI training job scheduling, orchestration, and management via SLURM and Kubeflow.
  • Proven success in deploying optimized ML systems on a large scale, utilizing cloud infrastructures and GPU resources.
  • In-depth understanding and hands-on experience with advanced model optimization frameworks such as DeepSpeed, FSDP, PyTorch, TensorFlow, and corresponding MLOps tools.
  • Familiarity with contemporary MLOps frameworks like MosaicML, Anyscale, Terraform, and their application in production environments.
  • Strong grasp of state-of-the-art ML infrastructures, deployment strategies, and optimization methodologies.
  • An innovative problem-solver with strategic acumen and a collaborative mindset.
  • Exceptional communication and team collaboration skills, with an ability to lead and inspire.


For more details, please reach out to Jia at jia@lagomtechnologies.com and setup sometime to discuss the role



  • palo alto, United States Tykhe Inc Full time

    About the Company: One of our well-established start-up clients in Palo Alto, CA are looking for an experienced Lead Generative AI Engineer/Scientist to train, optimize, scale, and deploy a variety of generative AI models such as large language models, voice/speech foundation models, vision and multi-modal foundation models using cutting-edge techniques and...


  • Palo Alto, United States Tykhe Inc Full time

    About the Company: One of our well-established start-up clients in Palo Alto, CA are looking for an experienced Lead Generative AI Engineer/Scientist to train, optimize, scale, and deploy a variety of generative AI models such as large language models, voice/speech foundation models, vision and multi-modal foundation models using cutting-edge techniques and...


  • Palo Alto, California, United States Tykhe Inc Full time

    About the RoleWe are seeking an experienced Lead Generative AI Engineer/Scientist to join our team at Tykhe Inc in Palo Alto, CA. As a key member of our AI/ML team, you will be responsible for training, optimizing, scaling, and deploying a variety of generative AI models.Key ResponsibilitiesArchitect and refine foundation model infrastructure to support the...


  • Palo Alto, California, United States Tykhe Inc Full time

    Join Our Team as a Lead Research Scientist/EngineerWe are seeking a highly skilled and experienced Lead Research Scientist/Engineer to join our team at Tykhe Inc in Palo Alto, CA. Our company specializes in building cutting-edge GenAI infrastructure, focusing on Voice/Audio/Speech, Vision, and Multi-modal platforms.If you have expertise in designing,...


  • Palo Alto, California, United States Salesforce Inc Full time

    About the RoleWe are seeking a highly skilled and experienced Director of Software Engineering to lead the development of our Generative AI platform. This is a unique opportunity to work on next-generation AI products that integrate large-scale language models, sophisticated retrieval systems, and AI agent frameworks.Key ResponsibilitiesLead a team of...


  • Palo Alto, California, United States Luma AI Full time

    Job Title: Senior Research EngineerWe are seeking a highly skilled Senior Research Engineer to join our team at Luma AI. As a key member of our research team, you will be responsible for designing, developing, and deploying cutting-edge AI solutions using PyTorch and other deep learning frameworks.Responsibilities:Design and implement efficient algorithms...


  • Palo Alto, California, United States HP Development Company, L.P. Full time

    Job Title: AI Engineering ManagerHP Development Company, L.P. is seeking an experienced AI Engineering Manager to lead our software development team in the field of Artificial Intelligence. The successful candidate will be responsible for managing a team of engineers and data scientists to design, develop, and deploy AI solutions.Key Responsibilities:Lead a...


  • Palo Alto, California, United States Tykhe Inc Full time

    Join Our Team as a Lead Research Scientist/EngineerWe are seeking a highly skilled and experienced Lead Research Scientist/Engineer to join our team at Tykhe Inc in Palo Alto, CA. Our company specializes in building cutting-edge GenAI infrastructure, focusing on Voice/Audio/Speech, Vision, and Multi-modal platforms.If you have expertise in designing,...


  • Palo Alto, California, United States AISERA Full time

    Aisera is a leading provider of AI Copilot solutions, utilizing cutting-edge AI and machine learning technologies to drive business transformation and revenue growth through a self-service model.The role of Senior AI/ML Technical Lead is to lead a team of AI/ML engineers and data scientists to design, develop, and deploy state-of-the-art AI/ML models and...


  • Palo Alto, California, United States Tykhe Inc Full time

    Drive AI Transparency at Tykhe IncWe are seeking an experienced AI Interpretability Staff Research Scientist to join our team in Palo Alto, CA. As a key member of our research team, you will play a crucial role in developing transparent, explainable, and reliable AI systems.In this role, you will work closely with our product and research teams to ensure our...


  • Palo Alto, California, United States Tykhe Inc Full time

    Drive AI Transparency at Tykhe IncWe are seeking an experienced AI Interpretability Staff Research Scientist to join our team in Palo Alto, CA. As a key member of our research team, you will play a crucial role in developing transparent, explainable, and reliable AI systems.In this role, you will work closely with our product and research teams to ensure our...

  • AI Trust

    1 month ago


    palo alto, United States Tykhe Inc Full time

    Would you like to be a part of a well established startup in Bay area, CA who are looking to hire AI Trust & Safety Engineer/Scientist to spearhead their efforts in developing safe, ethical, and reliable AI systems. In this role, you will work closely with their product and research teams to ensure their foundation models, including large language models and...

  • AI Trust

    3 months ago


    Palo Alto, United States Tykhe Inc Full time

    Would you like to be a part of a well established startup in Bay area, CA who are looking to hire AI Trust & Safety Engineer/Scientist to spearhead their efforts in developing safe, ethical, and reliable AI systems. In this role, you will work closely with their product and research teams to ensure their foundation models, including large language models and...

  • AI Trust

    3 months ago


    Palo Alto, United States Tykhe Inc Full time

    Would you like to be a part of a well established startup in Bay area, CA who are looking to hire AI Trust & Safety Engineer/Scientist to spearhead their efforts in developing safe, ethical, and reliable AI systems. In this role, you will work closely with their product and research teams to ensure their foundation models, including large language models and...

  • Staff AI Engineer

    1 month ago


    Palo Alto, California, United States Discover International Full time

    About the RoleDiscover International is partnering with a fast-growing SaaS technology company to find their next Staff AI Engineer. This role involves working on Large Language Models (LLMs) and AI technologies to solve complex challenges in technology. The successful candidate will work in a dedicated team to deliver high-impact solutions in a...

  • Applied Scientist

    1 week ago


    Palo Alto, California, United States Amazon Full time

    Job Title:Applied Scientist, Customer Engagement TechnologyJob Summary:We are seeking an experienced Applied Scientist to join our Customer Engagement Technology (CET) team at Amazon. The successful candidate will play a critical role in the research, development, and implementation of solutions to key challenges in developing conversational AI systems that...


  • Palo Alto, United States Tykhe Inc Full time

    Would you be interested in working for a fast-growing start-up in Palo Alto, CA who are building AI infrastructure and seeking an experienced AI Interpretability Staff Research Scientist to drive their efforts in developing transparent, explainable, and reliable AI systems?In this role, you will work closely with our product and research teams to ensure our...


  • Palo Alto, United States Tykhe Inc Full time

    Would you be interested in working for a fast-growing start-up in Palo Alto, CA who are building AI infrastructure and seeking an experienced AI Interpretability Staff Research Scientist to drive their efforts in developing transparent, explainable, and reliable AI systems?In this role, you will work closely with our product and research teams to ensure our...


  • palo alto, United States Tykhe Inc Full time

    Would you be interested in working for a fast-growing start-up in Palo Alto, CA who are building AI infrastructure and seeking an experienced AI Interpretability Staff Research Scientist to drive their efforts in developing transparent, explainable, and reliable AI systems?In this role, you will work closely with our product and research teams to ensure our...

  • AI Ethics Specialist

    1 month ago


    Palo Alto, California, United States Tykhe Inc Full time

    AI Trust and Safety Engineer/ScientistAt Tykhe Inc, we are seeking a highly skilled AI Trust and Safety Engineer/Scientist to join our team in the Bay area, CA. As a key member of our product and research teams, you will play a crucial role in developing safe, ethical, and reliable AI systems that positively impact millions of lives.Key...