AI Infrastructure Specialist

3 weeks ago


Palo Alto, California, United States Tesla Full time

The Tesla AI Infrastructure team is looking for an experienced AI Infrastructure Specialist to join our team. As a key member of the team, you will be responsible for maintaining and improving our AI infrastructure, which includes virtual simulations, Autopilot hardware, silicon design, and Dojo. With the rapidly-growing need for more data and optimized compute resources, cluster builds are getting larger and increasingly complex.

  • Main Responsibilities:
  • Support the AI/ML cluster infrastructure on both GPU and Dojo platforms, focusing on systems automation, configuration management and deployment at scale
  • Improve our monitoring & self-healing pipelines, as well as security posture
  • Work with hardware and storage vendors to tune and optimize our server, storage and network performance
  • Performance tuning & OS provisioning on Linux systems
  • Manage HPC clusters, workloads and applications
  • Automation and systems engineering
  • Participate in 24x7 on-call rotation
Benefits:
  • Aetna PPO and HSA plans, including options with $0 payroll deduction
  • Family-building, fertility, adoption and surrogacy benefits
  • Dental and vision plans with options that include no paycheck contribution
  • Company-paid HSA Contribution when enrolled in the High Deductible Aetna medical plan with HSA
  • Healthcare and Dependent Care Flexible Spending Accounts (FSA)
  • LGBTQ+ care concierge services
  • 401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits
  • Company-paid Basic Life, AD&D, short-term and long-term disability insurance
  • Employee Assistance Program
  • Sick and Vacation time, and Paid Holidays
  • Back-up childcare and parenting support resources
  • Voluntary benefits, including critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insurance
  • Weight Loss and Tobacco Cessation Programs
  • Tesla Babies program
  • Commuter benefits
  • Employee discounts and perks program

$120,000 - $300,000 annual salary + benefits package available.



  • Palo Alto, California, United States Qualified Health Full time

    **Job Overview:**Qualified Health is seeking an exceptional MLOps Engineer to design, implement, and maintain infrastructure for deploying and managing advanced gen-AI agents and workflows powered by large language models.**Estimated Salary:** $185,000 - $205,000 per yearJob Responsibilities:Create production-ready infrastructure and scalable deployment...


  • Palo Alto, California, United States Mistral AI Full time

    About Mistral AI">Mistral AI is a leading provider of AI solutions, dedicated to helping businesses succeed in today's fast-paced digital landscape. ">Job Summary">We are seeking an experienced Data Science Engineer to join our team, working closely with customers to understand their needs and deliver tailored AI solutions.">About the Role">This is an...


  • Palo Alto, California, United States Luma AI Full time

    **Job Description**We are seeking a highly skilled AI/ML System Reliability Expert to join our team at Luma AI. As a key member of our Infrastructure and Research teams, you will be responsible for ensuring the health and reliability of our GPU clusters.The ideal candidate will have a strong background in AI/ML system reliability, cloud infrastructure, and...


  • Palo Alto, California, United States xAI Full time

    Transforming AI with Scalable InfrastructureWe are seeking a highly skilled AI Infrastructure Architect to join our team at xAI. Located in the Bay Area, this role involves designing and developing cutting-edge AI infrastructure that enables our researchers to push the boundaries of what is possible.About the RoleThe successful candidate will be responsible...


  • Palo Alto, California, United States Luma AI Full time

    Backend Data EngineerWe are seeking a skilled Backend Data Engineer to design and build highly efficient, resilient systems and pipelines for large-scale data processing.As part of Luma AI's applied research team, you will work directly on mission-critical workstreams utilizing thousands of GPUs. Our goal is to develop innovative solutions that drive...


  • Palo Alto, California, United States Foundry Full time

    Foundry LLC is a pioneering company transforming the way AI companies access compute power. Our mission is to orchestrate the world's compute capacity, making it easier to use and optimized for AI workloads. We're building a new type of public cloud designed specifically for AI, where accessing high-performance compute is as simple and reliable as flipping a...


  • Palo Alto, California, United States Tesla Full time

    Job Summary:Tesla is seeking a talented Cloud Systems Architect to design and implement our AI infrastructure, ensuring seamless operations for Full-Self-Driving (FSD), Tesla Bot & Dojo engineering teams. As a key member of the team, you will play a vital role in managing AI infrastructure, monitoring compute/GPU/network metrics, Linux troubleshooting &...


  • Palo Alto, California, United States Luma AI Full time

    **Developing Intelligent Systems**Luma AI is committed to developing intelligent systems that can see, understand, show, and interact with our world. Our mission is to advance human capabilities through multimodal AI.We are seeking a highly skilled researcher to join our foundation models research team. The ideal candidate will have a strong research...

  • AI Developer

    2 weeks ago


    Palo Alto, California, United States Qualified Health Full time

    Revolutionize Healthcare with AIAt Qualified Health, we're on a mission to transform the healthcare industry with cutting-edge AI technology. We're seeking an experienced AI Developer - Front End Specialist to join our early engineering team and help shape the user experience of our innovative AI SaaS platform.About the RoleWe're looking for a skilled...


  • Palo Alto, California, United States Inflection AI Full time

    Company OverviewInflection AI is a public benefit corporation leveraging our world-class large language model to build the first AI platform focused on enterprise needs. We are an organization passionate about building innovative solutions, enjoy working together, and strive to hire individuals with diverse backgrounds and experience.We value and support our...


  • Palo Alto, California, United States Luma AI Full time

    Data Infrastructure SpecialistWe are seeking an experienced professional to fill the role of Senior Software Engineer - Data Infrastructure at Luma AI. In this position, you will be responsible for designing and building high-performance systems and pipelines for large-scale data processing. You will work closely with researchers to develop and implement...


  • Palo Alto, California, United States Foundry Technologies, Inc. Full time

    Job DescriptionThis role involves leading the development of our security infrastructure from the ground up, safeguarding our AI cloud platform, ensuring customer data, AI models, and high-performance workloads are secure at every stage. You will be at the forefront of protecting an entirely new form of AI infrastructure, working with advanced hardware...


  • Palo Alto, California, United States Ai Build Limited Full time

    About the RoleWe are seeking an experienced AI Sales Director to lead our team in developing and managing a small group of high-performing sales specialists. As a senior leader, you will be responsible for paving the way and developing a team that can drive customer value through AI projects.The successful candidate will have experience in technology-related...


  • Palo Alto, California, United States Lightning AI Full time

    Company OverviewWe are Lightning AI, the pioneering company reimagining the way artificial intelligence is developed. Our mission is to simplify AI development, making it accessible to everyone-from solo researchers to large enterprises.Salary$175,000 per year (base salary) plus competitive stock options and benefits package.Job DescriptionWe are seeking a...


  • Palo Alto, California, United States Foundry Technologies, Inc. Full time

    As a Senior Infrastructure Engineer at Foundry Technologies, Inc., you will be responsible for designing, building, and deploying cutting-edge infrastructure solutions that support our mission to transform the way AI companies access high-performance computing power. We offer a competitive salary range of $170,000 - $230,000 per year, depending on...


  • Palo Alto, California, United States Tesla Full time

    ResponsibilitiesAs an HPC Engineer, your key responsibilities will be:Spearheading AI/ML cluster infrastructure support on both GPU and Dojo platforms, emphasizing systems automation, configuration management, and large-scale deployment.Enhancing monitoring & self-healing pipelines, along with security protocols.Collaborating with hardware and storage...


  • Palo Alto, California, United States ZipRecruiter Full time

    About Our FirmWe are a pioneering organization at the forefront of artificial intelligence research and development, pushing boundaries akin to the most renowned AI labs worldwide. Our mission is to develop groundbreaking AI technologies that benefit humanity, tackling complex problems across various domains.Role Overview:This key position involves...


  • Palo Alto, California, United States Tesla Full time

    **Accelerate Innovation with Tesla's Autopilot AI Team**We are seeking a highly skilled **Software Engineer - Model Scaling, Autopilot AI** to join our team at Tesla. As a key member of our Autopilot AI team, you will play a crucial role in optimizing and scaling our neural network training infrastructure.You will work closely with a specialized team of...


  • Palo Alto, California, United States Qualified Health Full time

    Key ResponsibilitiesOur ideal candidate will have expertise in generative AI models, particularly those applicable to healthcare scenarios.They will also have strong problem-solving skills, excellent communication and interpersonal skills, and experience with cloud platforms, containerization technologies, and CI/CD tools.The successful candidate will ensure...


  • Palo Alto, California, United States Glean Full time

    About GleanGlean is pioneering a new era of work where humans and AI collaborate seamlessly. Our platform combines AI and knowledge to make information instantly accessible to employees across the organization. We're a tight-knit team of passionate engineers who are committed to achieving great things. Our expertise spans Google, Facebook, and other top tech...