ML Systems Optimization Expert

6 days ago


Mountain View, California, United States MatX Full time

MatX is committed to pushing the boundaries of AI by developing innovative solutions that harness the power of silicon and systems. We're seeking a talented individual to drive our ML performance engineering efforts.

Job Summary

We're looking for a highly skilled professional to design and develop performance-optimized ML solutions. The ideal candidate will have expertise in popular ML frameworks, distributed computing, and high-performance networking.

Responsibilities
  • Design and implement performance models and tooling to inform scheduling decisions for current and future ML models.
  • Develop production-grade libraries for distributed training and serving, optimized for efficiency.
  • Collaborate with cross-functional teams to drive solutions from model development to hardware implementation.
Requirements
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
  • Proficiency in Python programming.
  • Expertise in popular ML frameworks such as JAX, PyTorch, or TensorFlow.
  • In-depth knowledge of the Transformer architecture.
  • Experience with distributed computing, high-performance networking, or large-scale ML systems.
Preferred Skills
  • Hands-on experience with techniques like flash attention, quantization, pruning, or other systems performance optimizations.
  • Familiarity with parallelism strategies that balance computation, communication, and memory to optimize throughput and latency.
  • Knowledge of performance analysis tools and profilers for large-scale systems.
  • Solid understanding of computer architecture and low-level optimization techniques.
Benefits

This role offers a competitive salary range of $160,000 - $480,000 per annum, plus equity and benefits. We also offer opportunities for professional growth and collaboration with a talented team.



  • Mountain View, California, United States MatX Full time

    Drive Innovation in Machine LearningWe're on a quest to redefine AI model efficiency at MatX. As part of our mission, we're seeking an accomplished leader to guide our machine learning research team towards unparalleled success.Your responsibilities will include:Developing and implementing a comprehensive strategy for the ML research team, encompassing all...


  • Mountain View, California, United States Waymo Full time

    About the Role: We're seeking highly motivated and talented individuals to join our team as Software Engineers for Autonomous Vehicles. As an intern, you'll work on developing tools to instrument ML models and inspect their internal computations, applying optimization techniques, and analyzing their impacts on model quality.Key Responsibilities:Developing...

  • ML Performance Expert

    2 weeks ago


    Mountain View, California, United States MatX Full time

    About This RoleThis is an exciting opportunity to join MatX as an AI Systems Optimization Engineer. As a member of our team, you will contribute to the development of cutting-edge AI solutions that aim to bridge the gap between AI capabilities and real-world limitations.ResponsibilitiesYour key responsibilities will include:Building performance models and...


  • Mountain View, California, United States Waymo Full time

    At Waymo, we're pushing the boundaries of autonomous driving technology. Our mission is to make transportation safer, more efficient, and accessible to everyone.We're looking for a highly skilled Machine Learning (ML) engineer to join our team. As an ML Engineer at Waymo, you'll play a key role in developing and optimizing our ML systems for compute...


  • Mountain View, California, United States NewsBreak Full time

    We are seeking a highly skilled ML Systems Architect to join our team at NewsBreak. As a key member of our infrastructure team, you will design and develop scalable machine learning infrastructure to support our products.This role requires expertise in designing large-scale distributed backend systems, familiarity with cloud services such as AWS, GCP, Azure,...


  • Mountain View, California, United States Databricks Full time

    Journey to Data InnovationJoin Databricks as a Staff Software Engineer and embark on a journey to revolutionize data processing and analysis. Our mission is to simplify the entire data lifecycle from ingestion to ETL, BI, and ML/AI with a unified platform, leveraging the power of Lakehouse architecture.You'll work on building next-generation systems for...


  • Mountain View, California, United States MatX Full time

    At MatX, we're committed to making AI accessible and efficient. We need a skilled leader to guide our machine learning research team.About the JobThe successful candidate will have a strong background in machine learning research and prior experience in leading a team or contributing to a highly cited paper.Lead a team of experts in developing and...


  • Mountain View, California, United States MatX Full time

    About MatXWe're committed to making the world's best AI models run as efficiently as allowed by physics. Our goal is to bring the world years ahead in AI quality and availability. We're seeking a leader who is excited about systems-focused ML research to lead our team of ML researchers.Estimated Salary RangeThe estimated annual salary for this position is...


  • Mountain View, California, United States Huntington Ingalls Industries Full time

    About the RoleThis position is a fantastic opportunity to join Huntington Ingalls Industries as a Senior AI/ML Expert in our Mission Technologies Division. As a key member of our team, you will be responsible for leading and advising on AI/ML projects, ensuring they meet the highest standards of quality and efficiency.We are looking for a highly skilled...


  • Mountain View, California, United States Google Full time

    We are seeking a highly skilled Senior Software Engineer to join our Core team at Google. This role involves designing and implementing cutting-edge AI/ML solutions, leveraging large-scale system design, natural language processing, and UI development.Responsibilities:Develop and test product or system development code using software development best...


  • Mountain View, California, United States Waymo Full time

    Required Skills and QualificationsPhD in Computer Science, Robotics, Statistics, Physics, Math, or another quantitative area, or equivalent work experience2+ years of experience building productionized ML modelsStrong coding and design skills: comfort building production systems (Python/C++)Strong background and experience in applied Deep LearningA strong...

  • AI Hardware Engineer

    3 weeks ago


    Mountain View, California, United States Matx Full time

    About MatxWe are a cutting-edge technology company developing vertically integrated full-stack solutions from silicon to systems, including hardware and software, to train and run the largest Machine Learning (ML) workloads for Artificial General Intelligence (AGI).Our team primarily utilizes the Rust programming language to push the boundaries of...


  • Mountain View, California, United States Tik Tok Full time

    About TikTokTikTok is a leading short-form mobile video platform that inspires creativity and brings joy to its users. As part of the company's mission, the U.S. Data Security (USDS) division was established to enhance data protection policies and content assurance protocols for U.S. users.Job SummaryWe are seeking a highly skilled System Reliability...


  • Mountain View, California, United States Akraya Full time

    **About Akraya**Award-winning IT staffing firm with a commitment to excellence and a thriving work environment.**Job Description:**We're seeking a Senior Business Systems Analyst to drive innovation and efficiency through expert application of technology and systems analysis. You'll be the force behind optimizing operations, accelerating organizational...


  • Mountain View, California, United States Waymo Full time

    About WaymoWaymo is a pioneering autonomous driving technology company dedicated to revolutionizing the way people move. With a mission to be the most trusted driver, we have been at the forefront of this industry since its inception as the Google Self-Driving Car Project in 2009.Our journey has been marked by significant milestones, including providing over...


  • Mountain View, California, United States Intuit Full time

    About the RoleWe are hiring a highly skilled Machine Learning Expert to join our collaborative and creative group of data scientists and machine learning engineers. Your expertise in statistical techniques, programming languages, and large dataset manipulation will be invaluable in designing, building, and deploying AI systems that directly affect hundreds...


  • Mountain View, California, United States Perot Jain Full time

    Job DescriptionWe are seeking an expert in Autonomous Driving Behavior to join our team at Perot Jain. As a key member of our AI Research group, you will play a crucial role in developing machine learning models that accurately predict the behavior of surrounding agents and plan optimal trajectories for our autonomous vehicles.In this position, you...


  • Mountain View, California, United States MatX Full time

    MatX Compute Platform for AGIWe are developing vertically integrated full-stack solutions from silicon to systems, including hardware and software to train and run the largest ML workloads for AGI.The estimated salary for this position is $260,000 per year. The total compensation package includes equity and benefits.Key Responsibilities:Compiler Design:...


  • Mountain View, California, United States MatX Full time

    About the RoleAt MatX, we strive to make a meaningful impact in the field of artificial intelligence. Our mission is to push the boundaries of what is possible with AI, and we believe that starts with developing innovative machine learning models that leverage our proprietary hardware.We are looking for a seasoned expert in machine learning research to lead...


  • Mountain View, California, United States Waymo Full time

    Job DescriptionThis Hybrid role reports to our TLM of Machine Learning Training and involves:Developing the infrastructure components necessary for distributed trainingImplementing automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructureMonitoring system health and performing routine maintenance tasks...