Senior Principal, ML Infrastructure Software Engineer

4 weeks ago


San Jose CA, United States Conductor Full time

What You’ll Do

The AGI (Artificial General Intelligence) Computing Lab is dedicated to solving the complex system-level challenges posed by the growing demands of future AI/ML workloads. Our team is committed to designing and developing scalable platforms that can effectively handle the computational and memory requirements of these workloads while minimizing energy consumption and maximizing performance. To achieve this goal, we collaborate closely with both hardware and software engineers to identify and address the unique challenges posed by AI/ML workloads and to explore new computing abstractions that can provide a better balance between the hardware and software components of our systems. Additionally, we continuously conduct research and development in emerging technologies and trends across memory, computing, interconnect, and AI/ML, ensuring that our platforms are always equipped to handle the most demanding workloads of the future. By working together as a dedicated and passionate team, we aim to revolutionize the way AI/ML applications are deployed and executed, ultimately contributing to the advancement of AGI in an affordable and sustainable manner. Join us in our passion to shape the future of computing

Location: Hybrid, working onsite at our office 3 days per week with the flexibility to work remotely the remainder of your time

Reports to: SVP

Req: 41882

  • Stay up-to-date with the latest advancements in parallel computing, distributed systems, and ML technologies, and contribute to the development of new techniques and approaches.
  • Analyze and profile ML workloads to identify bottlenecks and inefficiencies.
  • Design and implement parallel and distributed computing systems to improve the scalability and performance of ML workloads.
  • Optimize ML algorithms and models to reduce memory usage, improve computational efficiency, and minimize communication overhead.
  • Communicate effectively with stakeholders, including users, partners, and management, to ensure that the systems are delivered on time and within budget
  • Complete other responsibilities as assigned.

What You Bring

  • BS in Computer/Electrical Engineering or Computer Science with 20+ years of working experiences in silicon development or MS in Computer/Electrical Engineering or Computer Science with 18+ years of relevant working experience or PhD and 15+ years of relevant working experience preferred.
  • Experience with deep learning techniques and architectures.
  • Strong proficiency in C++, or a similar programming language.
  • Experience with popular ML frameworks such as TensorFlow, PyTorch, or JAX.
  • Experience with ML lowering infrastructure such as MLIR.
  • Excellent problem-solving skills and ability to think critically and creatively.
  • Strong analytical and problem-solving skills
  • Excellent communication and interpersonal skills
  • Ability to work independently and as part of a team
  • You’re inclusive, adapting your style to the situation and diverse global norms of our people.
  • An avid learner, you approach challenges with curiosity and resilience, seeking data to help build understanding.
  • You’re collaborative, building relationships, humbly offering support and openly welcoming approaches.
  • Innovative and creative, you proactively explore new ideas and adapt quickly to change.

#LI-MD1

#J-18808-Ljbffr

  • San Francisco, United States Twelvelabs Full time

    Who we are We’re a fast-moving, diverse team pushing the frontiers of artificial intelligence. At Twelve Labs, our mission is to help developers build programs that can see, listen, and understand the world as we do by bringing the world’s most powerful video understanding infrastructure to market. As a part of achieving this mission, we are building...


  • Foster City, CA, United States Zoox Full time

    Foster City, CASoftware – Software & Machine Learning Infrastructure /Full-time /On-siteZoox is on a mission to reimagine transportation and ground-up build autonomous robotaxis that are safe, reliable, clean, and enjoyable for everyone. We are still in the early stages of deploying our robotaxis on public roads, and it is a great time to join Zoox and...


  • San Mateo, CA, United States Snowflake Computing Full time

    Build the future of data. Join the Snowflake team.Snowflake’s App and Collaboration team builds platform infrastructure underneath Snowflake apps to enable data analysis and AI/Ml modeling, with access to powerful data and AI/ML API resources on Snowflake. AS A SENIOR SOFTWARE ENGINEER - APP AND COLLABORATION PLATFORM TEAM YOU WILL:Contribute to highly...


  • San Diego, CA, United States Shield AI Full time

    San Diego Metro AreaHivemind – Design /Full Time Employee /On-siteHivemind Design (HMD) is an innovative simulation, data science, and infrastructure team at Shield AI. We own the tools for designing, developing, testing, deploying and evaluating instances of the Hivemind AI pilot and commander. Our software products enable companies to construct and...

  • Senior ML Engineer

    4 weeks ago


    San Francisco, United States Cleanlab Full time

    At Cleanlab you willPioneer novel software systems for the rapidly growing field of data-centric AI. Our tools enable data scientists/engineers (across all industries) to effectively diagnose/fix issues in their datasets thus improving the quality of their business’s core asset.Determine how to best leverage new Generative AI advances/infrastructure for...


  • San Francisco, CA, United States Discord Full time

    Discord is about giving people the power to create space to find belonging in their lives. We want to make it easier for you to talk regularly with the people you care about. We want you to build genuine relationships with your friends and communities close to home or around the world. Original, reliable, playful, and relatable. These are the values that...

  • Sr Principal

    4 hours ago


    San Jose, CA, United States SiMa Technologies Full time

    DescriptionJob Title: Sr Principal - Solution Architect, Edge AIML and GenAI       Job Location: San Jose, CA (Onsite Only, No Remote Work)  Job ID: AI2314Job Description:    SiMa is Accelerating the Efficiency, Effectiveness and Ease of Use of AIML and GenAI applications at the Edge Generative AI is enabling AIML and Embedded developers to build...


  • San Francisco, CA, United States Discord Full time

    Discord is about giving people the power to create space to find belonging in their lives. We want to make it easier for you to talk regularly with the people you care about. We want you to build genuine relationships with your friends and communities close to home or around the world. Original, reliable, playful, and relatable. These are the values that...


  • San Francisco, United States ThinkBAC Consulting Full time

    Job DescriptionJob DescriptionThis is a remote position.Lead Energy Storage Quantitative Software Optimization Engineer - Energy Trading Location: FULLY REMOTE (Anywhere in the USA)This is an opportunity to join an industry leading renewable energy venture with strong private equity backing that is focused on the development, execution, and operations of...


  • San Diego, California, United States Tendo Full time

    The ideal candidate has full stack experience building SaaS and/or Cloud Native software for a regulated industry.Additionally, the Senior Principal Software Engineer will bring deep expertise in one or more technologies including distributed microservice architecture, Go, Ent, gRPC, Twirp, and/or AWS technologies like EventBridge and Aurora.The Senior...


  • San Jose, United States IC Resources Full time

    ML/LLVM Compiler Engineer Exciting Blockchain Compiler Role / Remote working / Token equity on offer! An ML/LLVM Compiler Engineer is required to join an exciting ML powered Blockchain company specialisingin all aspects of computer architecture relating to CPU's, GPU's and customer accelerators! My client also integrates advanced machine learning algorithms,...


  • San Francisco, United States Anthropic Limited Full time

    Anthropic is seeking an experienced engineer for our Research Infrastructure team. You'll lead initiatives supporting some of the largest, most sophisticated clusters in industry used to train, research, and ultimately serve AI models. Your work will be crucial in ensuring Anthropic is able to continue reliably and safely training frontier models! The...


  • San Francisco, United States Abnormal Security Full time

    Job DescriptionJob DescriptionAbout the RoleAbnormal Security is looking for a Senior ML Infra Engineer to join the Detection Team. The Detection Division is focused on building the world's most advanced technology for identifying and stopping email and cloud-based attacks that were previously undetectable and help make the world a safer place. As an ML...


  • San Jose, California, United States IC Resources Full time

    ML/LLVM Compiler Engineer Exciting Blockchain Compiler Role / Remote working / Token equity on offer An ML/LLVM Compiler Engineer is required to join an exciting ML powered Blockchain company specialisingin all aspects of computer architecture relating to CPU's, GPU's and customer accelerators My client also integrates advanced machine learning algorithms,...


  • San Francisco, CA, United States Nextdoor Full time

    #TeamNextdoorNextdoor is where you connect to the neighborhoods that matter to you so you can belong. Our purpose is to cultivate a kinder world where everyone has a neighborhood they can rely on.Neighbors around the world turn to Nextdoor daily to receive trusted information, give and get help, get things done, and build real-world connections with those...


  • San Jose, United States SiMa Technologies Full time

    Job Title: Principal - Solution Architect, Edge AIML and GenAI           Job Location: San Jose, CA (Onsite Only, No Remote Work)   Job ID: AI2315    Job Description:        SiMa is Accelerating the Efficiency, Effectiveness and Ease of Use of AIML and GenAI applications at the Edge Generative AI is enabling AIML and Embedded developers...


  • San Francisco, United States Voxel Full time

    Who Are We Industrial labor is incredibly dangerous work - almost 3 million people in the US per year are injured in the workplace for entirely preventable and at times, fatal or debilitating causes. Protecting these essential people who power our world is what motivates Voxelitos, and we'd love for you to join us. At Voxel, we're passionate about...


  • San Francisco, CA, United States Genentech Full time

    The PositionThe PositionAt Genentech Computational Sciences (gCS) Prescient Design, we are at the forefront of employing machine learning to revolutionize drug discovery, adopting novel methods, techniques, and infrastructure to transform the field. Our Engineering team is seeking engineers with strong skills and hands-on experience in designing,...


  • San Francisco, California, United States Fathom Full time

    Fathom is on a mission to use AI to understand and structure the world's medical data, starting by making sense of the terabytes of clinician notes contained within the electronic health records of the world's largest health systems. Our deep learning engine automates the translation of patient records into the billing codes used for healthcare provider...


  • San Francisco, California, United States Fathom Full time

    Fathom is on a mission to use AI to understand and structure the world's medical data, starting by making sense of the terabytes of clinician notes contained within the electronic health records of the world's largest health systems. Our deep learning engine automates the translation of patient records into the billing codes used for healthcare provider...