AI Performance Engineer

4 days ago


Palo Alto, California, United States xAI Full time
Job Description

We are seeking a highly skilled AI Performance Engineer to join our team at xAI.

Located in the Bay Area, this role offers a unique opportunity to work on cutting-edge AI projects and contribute to the development of innovative solutions.

As an AI Performance Engineer, you will be responsible for developing and optimizing low-level CUDA kernel optimizations for state-of-the-art inference and training software stack. This involves profiling, debugging, and optimizing single and multi-GPU operations using tools such as Nsight.

Key responsibilities include:

  • Developing high-performance GeMM CUDA kernels using Tensor cores or CUDA cores from scratch or by utilizing CuTe/CUTLASS.
  • Implementing features for attention kernel by extending existing kernels or writing them from scratch.
  • Ensuring correctness while considering floating point errors.
  • Optimizing for both memory-bound and compute-bound operations.
  • Reasoning about register pressure, shared-memory usage and GPU utilization through tools such as Nsight and removing bottlenecks.

The ideal candidate will have strong technical skills and experience with C/C++ and Python binding tools. Familiarity with JAX/XLA is also a plus.

We operate with a flat organizational structure, where all employees are expected to be hands-on and contribute directly to the company's mission. Leadership is given to those who show initiative and consistently deliver excellence.

xAI offers a competitive annual salary range of $200,000 - $460,000 USD, depending on experience. All engineers and researchers are expected to have strong communication skills and be able to concisely and accurately share knowledge with their teammates.

If you are passionate about AI and performance optimization, we encourage you to apply for this exciting opportunity.



  • Palo Alto, California, United States Luma AI Full time

    Luma AI is looking for a skilled Senior Software Engineer to join our applied research team. As a key member of our team, you will design, build, and automate infrastructure for processing large-scale data across multiple clusters of thousands of GPUs. Your expertise in Backend Data Engineering will be crucial in building highly efficient, resilient systems...

  • Senior SRE Engineer

    2 days ago


    Palo Alto, California, United States Luma AI Full time

    Join Our TeamLuma AI is a fast-paced, rapidly scaling company that requires experienced professionals like you. As a Senior SRE Engineer - High-Performance Computing, you will collaborate with researchers and engineers to specify availability, performance, correctness, and efficiency requirements of our GPU infrastructure.


  • Palo Alto, California, United States Lutra AI Full time

    About Lutra AILutra AI is a pioneering technology company that empowers individuals to harness the full potential of AI and maximize their personal productivity.We are a tight-knit team based in the San Francisco Bay Area, renowned for our expertise in AI innovation. Are you passionate about learning and applying the latest AI technologies to create...


  • Palo Alto, California, United States Mistral AI Full time

    About Mistral AI">Mistral AI is a leading provider of AI solutions, dedicated to helping businesses succeed in today's fast-paced digital landscape. ">Job Summary">We are seeking an experienced Data Science Engineer to join our team, working closely with customers to understand their needs and deliver tailored AI solutions.">About the Role">This is an...


  • Palo Alto, California, United States Luma AI Full time

    **Our Approach to Multimodal AI**Luma AI's approach to multimodal AI focuses on unlocking the power of foundational models for interesting applications. We believe that multimodality is critical for intelligence and are working on training and scaling up multimodal foundation models.We are looking for an experienced researcher to join our team and contribute...


  • Palo Alto, California, United States Inflection AI Full time

    Company OverviewInflection AI is a public benefit corporation leveraging our world-class large language model to build the first AI platform focused on enterprise needs. We are an organization passionate about building innovative solutions, enjoy working together, and strive to hire individuals with diverse backgrounds and experience.We value and support our...


  • Palo Alto, California, United States Luma AI Full time

    **Job Description**We are seeking a highly skilled AI/ML System Reliability Expert to join our team at Luma AI. As a key member of our Infrastructure and Research teams, you will be responsible for ensuring the health and reliability of our GPU clusters.The ideal candidate will have a strong background in AI/ML system reliability, cloud infrastructure, and...


  • Palo Alto, California, United States Luma AI Full time

    We are looking for a skilled Senior Backend Data Engineer to join our team at Luma AI. As a key member of our applied research team, you will play a crucial role in building highly efficient systems and pipelines for large-scale data processing. You will work closely with researchers to identify and implement technical data requirements and optimize...


  • Palo Alto, California, United States Acceler8 Talent Full time

    About the RoleWe're seeking an experienced Research Engineer to optimize our advanced AI models for efficient deployment in enterprise environments. This role involves fine-tuning inference processes, reducing latency, and enhancing throughput while maintaining high model performance standards.Key Responsibilities:Fine-tune large language models (LLMs) for...


  • Palo Alto, California, United States Tesla Full time

    Company OverviewTesla is a pioneering electric vehicle and clean energy company that is revolutionizing the transportation industry. As a leader in autonomous driving technology, we are seeking highly skilled engineers to join our team.Job SummaryWe are looking for a talented Software Engineer with expertise in performance optimization of AI infrastructure...

  • AI Innovation Lead

    4 days ago


    Palo Alto, California, United States Lutra AI Full time

    Job OverviewLutra AI, a cutting-edge technology company based in the San Francisco Bay Area, is looking for an exceptional AI Innovation Lead. As a member of our small team, you will have the opportunity to deeply leverage AI in our work and lives.We are passionate about turning the latest AI technologies into innovative products that make a real impact. If...


  • Palo Alto, California, United States Luma AI Full time

    Senior Product Designer OpportunityWe are seeking a skilled Senior Product Designer to join our team at Luma AI.About the RoleIn this role, you will play a key part in shaping the future of human-machine relationships and interactions built for AI-native products. As a Senior Product Designer, you will be responsible for creating world-class experiences,...


  • Palo Alto, California, United States Latitude AI LLC Full time

    Latitude AI LLC, an automated driving technology company, is developing a hands-free, eyes-off driver assist system for next-generation Ford vehicles at scale. Our mission is to reimagine what it's like to drive and make travel safer, less stressful, and more enjoyable for everyone.We're seeking a highly skilled Simulation Software Engineer - AI Autonomy to...


  • Palo Alto, California, United States Luma AI Full time

    Unlock the Future of Human-Machine InteractionsLuma AI is a pioneering force in shaping the landscape of AI-native products. We are on the lookout for a highly skilled Senior Product Designer to join our Design team and contribute to the development of cutting-edge, world-class product experiences.About the Role:We seek an exceptional individual with a deep...


  • Palo Alto, California, United States Lightning AI Full time

    Company OverviewWe are Lightning AI, the pioneering company reimagining the way artificial intelligence is developed. Our mission is to simplify AI development, making it accessible to everyone-from solo researchers to large enterprises.Salary$175,000 per year (base salary) plus competitive stock options and benefits package.Job DescriptionWe are seeking a...

  • Senior AI Engineer

    2 weeks ago


    Palo Alto, California, United States AISERA Full time

    About AISERAAISERA is a leader in AI-powered business transformation, offering innovative AI Copilot solutions that drive revenue growth and enhance customer experiences. Our AI Copilot utilizes industry-specific Large Language Models (LLMs) to deliver personalized interactions and automate requests through sophisticated AI workflows. With 400+ integrations...


  • Palo Alto, California, United States Mistral AI Full time

    About Mistral AI">Mistral AI is a leading innovator in the field of artificial intelligence, dedicated to delivering cutting-edge solutions that meet and exceed client expectations. ">Job Summary">We are seeking an exceptional Data Science Engineer to join our team. As a key member of our Applied AI Engineering team, you will work closely with customers from...


  • Palo Alto, California, United States Inflection AI Full time

    Role OverviewAt Inflection AI, we are building the first AI platform focused on enterprise needs. We are seeking a skilled Machine Learning Software Engineer to join our team.About UsWe are a public benefit corporation leveraging our world-class large language model to drive innovation in the field of artificial intelligence. Our leadership team is comprised...


  • Palo Alto, California, United States ZipRecruiter Full time

    About Our FirmWe are a leading research and development organization in the field of artificial intelligence, pushing the boundaries of innovation and technology. Our mission is to create impactful AI solutions that address real-world challenges and improve lives.Role OverviewThis Senior Full-Stack Engineer role combines creative problem-solving with...

  • AI Engineer, Amazon

    2 weeks ago


    Palo Alto, California, United States Amazon Full time

    Job Title: AI Engineer, AmazonAbout the Job:We are seeking a talented AI Engineer to join our team at Amazon. As an AI Engineer, you will be responsible for developing and implementing cutting-edge AI technologies that drive business results.Key Responsibilities:- Develop and deploy AI models that improve customer experience and drive sales growth-...