Staff Engineer, Infrastructure Software for AI

1 week ago


palo alto, United States SB Telecom America Corp. Full time

Company Description:

SB Telecom America Corp. offers innovative technology solutions to drive business growth and success. As part of the SoftBank Group, we focus on AI, IoT, Security, and Digital Marketing to create new business values for our clients. Our digital marketing services cater to the Japanese market with bilingual experts in the U.S. and U.K.


About Softbank:

Softbank is making significant investments in infrastructure for AI. Softbank Corp. has recently established a new US center in Silicon Valley, focused on infrastructure software for AI and AI foundations for mobile networks. Our goals are to challenge the norms and create products making use of our SOTA infrastructure (like Nvidia MGX, DGX Grace & Hopper platforms, and beyond) and cloud-native software. These products are geared towards centralized AI data centers as well as distributed AI Radio Access Network (AI RAN) data centers. We are looking for expert practitioners who are inspired to bring innovation and build transformative products.


Minimum Qualifications:

  • Bachelor's degree in Computer Science, Electrical Engineering, or related field.
  • 12+ years in software, hardware, engineering, including platforms and distributed systems.
  • 7+ years in Technical Lead roles, leading high-impact projects, teams.
  • Experience in building systems & systems SW, AI frameworks, and applied AI.


Preferred Qualifications:

  • Master's or PhD in a relevant field.
  • Deep product experience with Kubernetes and container orchestration.
  • Experience with GPU systems and high-performance computing environments.
  • Expertise in building scalable infrastructure to support AI workloads.
  • Experience with AI developer frameworks, tools, and automation systems.


Role:

Lead the infrastructure team of Senior Engineers responsible for building foundational software on top of GPU systems supporting AI workloads (training, fine-tuning and serving). Guide the development of new AI infrastructure with a focus on Kubernetes and GPU systems. Drive innovation in systems software architecture and automation for maximizing resource utilization. As a Directly Responsible Individual (DRI) for major engineering tasks, work with Principal Engineers, product management and program management to lead execution towards commercialization.


Responsibilities:

  • Develop and lead engineering team to build systems software for supporting AI workloads on large-scale GPU systems.
  • Deliver control plane for workloads including scheduling and orchestration. Deliver management plane for underlying platforms.
  • Provide northbound APIs for customer portal to interact with the infrastructure.
  • Contribute to Product Definition (PRD) and own the resulting product execution schedules.
  • Attract and build engineering talent.
  • Role model and foster a culture of humility and innovation for product delivery.


  • palo alto, United States SB Telecom America Corp. Full time

    Company Description:SB Telecom America Corp. offers innovative technology solutions to drive business growth and success. As part of the SoftBank Group, we focus on AI, IoT, Security, and Digital Marketing to create new business values for our clients. Our digital marketing services cater to the Japanese market with bilingual experts in the U.S. and...


  • Palo Alto, United States SB Telecom America Corp. Full time

    Company Description:SB Telecom America Corp. offers innovative technology solutions to drive business growth and success. As part of the SoftBank Group, we focus on AI, IoT, Security, and Digital Marketing to create new business values for our clients. Our digital marketing services cater to the Japanese market with bilingual experts in the U.S. and...


  • palo alto, United States SB Telecom America Corp. Full time

    Company Description:SB Telecom America Corp. offers innovative technology solutions to drive business growth and success. As part of the SoftBank Group, we focus on AI, IoT, Security, and Digital Marketing to create new business values for our clients. Our digital marketing services cater to the Japanese market with bilingual experts in the U.S. and...


  • Palo Alto, United States SB Telecom America Corp. Full time

    Company Description:SB Telecom America Corp. offers innovative technology solutions to drive business growth and success. As part of the SoftBank Group, we focus on AI, IoT, Security, and Digital Marketing to create new business values for our clients. Our digital marketing services cater to the Japanese market with bilingual experts in the U.S. and...


  • Palo Alto, United States SB Telecom America Corp. Full time

    Company Description:SB Telecom America Corp. offers innovative technology solutions to drive business growth and success. As part of the SoftBank Group, we focus on AI, IoT, Security, and Digital Marketing to create new business values for our clients. Our digital marketing services cater to the Japanese market with bilingual experts in the U.S. and...


  • Palo Alto, California, United States Latitude AI Full time

    About Latitude AILatitude AI is an innovative company at the forefront of automated driving technology, developing a hands-free, eyes-off driver assist system for next-generation Ford vehicles at scale. Our mission is to reimagine the driving experience, making travel safer, less stressful, and more enjoyable for everyone.Job SummaryWe are seeking a highly...


  • Palo Alto, United States SB Telecom America Corp. Full time

    Company Description:SB Telecom America Corp. offers innovative technology solutions to drive business growth and success. As part of the SoftBank Group, we focus on AI, IoT, Security, and Digital Marketing to create new business values for our clients. Our digital marketing services cater to the Japanese market with bilingual experts in the U.S. and...


  • Palo Alto, California, United States Inflection AI Full time

    At Inflection AI, we're building a cutting-edge AI platform for enterprise applications, and we're looking for a talented Machine Learning Software Engineer to join our team.About the RoleThis is a critical role in integrating ML frameworks and models into our platform for enterprise applications. As a Machine Learning Software Engineer, you will develop,...


  • Palo Alto, California, United States Tesla Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our AI Infrastructure team at Tesla. As a key member of our team, you will be responsible for maintaining and improving our platform to ensure our Full-Self-Driving (FSD), Tesla Bot & Dojo engineering teams have the necessary tools and resources to be productive.Key...


  • Palo Alto, California, United States Tesla Full time

    Job SummaryWe are seeking a highly skilled Software Engineer to join our Autonomy team at Tesla. As a Software Engineer, you will contribute to the development of our AI inference and runtime stack, working closely with AI Engineers and Hardware Engineers to build the frameworks and infrastructure that enable the seamless deployment, integration, and...

  • DevOps Engineer

    4 weeks ago


    Palo Alto, California, United States OpenTeams Full time

    Job OverviewOpenTeams is seeking a talented DevOps Engineer to join our dynamic team. As a DevOps Engineer, you will play a critical role in managing infrastructure for our AI/ML workloads, ensuring seamless operations for our internal teams and external customers.You will be responsible for setting up, maintaining, and optimizing cloud and on-premise...

  • Advanced AI Engineer

    2 weeks ago


    Palo Alto, California, United States Biostate AI Full time

    Job SummaryWe are seeking a highly skilled AI Engineer with expertise in Generative AI to join our team at Biostate AI. The ideal candidate will have a strong background in artificial intelligence, machine learning, and deep learning, with a focus on developing and deploying advanced AI models. You will work closely with our cross-functional teams to design...


  • Palo Alto, California, United States Machinify Full time

    About the RoleMachinify is a leading provider of AI-powered software products that transform healthcare claims and payment operations. The company's revolutionary AI-platform has enabled the development and deployment of industry-specific products that increase the speed and accuracy of claims processing by orders of magnitude.We're seeking a Staff Software...


  • Palo Alto, CA, United States SB Telecom America Corp. Full time

    Company Description:SB Telecom America Corp. offers innovative technology solutions to drive business growth and success. As part of the SoftBank Group, we focus on AI, IoT, Security, and Digital Marketing to create new business values for our clients. Our digital marketing services cater to the Japanese market with bilingual experts in the U.S. and...


  • Palo Alto, California, United States Machinify, Inc. Full time

    Machinify, Inc. is a leading provider of AI-powered software products that transform healthcare claims and payment operations. Our revolutionary AI-platform has enabled us to develop and deploy industry-specific products that increase the speed and accuracy of claims processing by orders of magnitude.We're seeking a Sr/Staff Software Engineer, BE|ML to join...

  • Software Engineer

    4 weeks ago


    Palo Alto, United States Acceler8 Talent Full time

    Software Engineer - AI Training DataIntroductionWe are seeking a Software Engineer - AI Training Data to tackle complex challenges in data management. This role is ideal for an engineer passionate about building innovative systems and optimizing large-scale datasets for advanced AI applications.About the CompanyThis organization is dedicated to reshaping the...

  • Software Engineer

    2 weeks ago


    palo alto, United States Acceler8 Talent Full time

    Software Engineer - AI Training DataIntroductionWe are seeking a Software Engineer - AI Training Data to tackle complex challenges in data management. This role is ideal for an engineer passionate about building innovative systems and optimizing large-scale datasets for advanced AI applications.About the CompanyThis organization is dedicated to reshaping the...

  • Software Engineer

    4 weeks ago


    Palo Alto, United States Acceler8 Talent Full time

    Software Engineer - AI Training DataIntroductionWe are seeking a Software Engineer - AI Training Data to tackle complex challenges in data management. This role is ideal for an engineer passionate about building innovative systems and optimizing large-scale datasets for advanced AI applications.About the CompanyThis organization is dedicated to reshaping the...

  • Software Engineer

    4 weeks ago


    palo alto, United States Acceler8 Talent Full time

    Software Engineer - AI Training DataIntroductionWe are seeking a Software Engineer - AI Training Data to tackle complex challenges in data management. This role is ideal for an engineer passionate about building innovative systems and optimizing large-scale datasets for advanced AI applications.About the CompanyThis organization is dedicated to reshaping the...


  • Palo Alto, California, United States Rivian Full time

    About RivianRivian is a pioneering company dedicated to creating emissions-free Electric Adventure Vehicles. We're on a mission to keep the world adventurous forever.Our team is comprised of diverse individuals who share a passion for the outdoors and a commitment to protecting it for future generations.Role SummaryAs a Staff Machine Learning Engineer at...