Principal AI/ML Software Architect

4 days ago


San Jose, United States Advanced Micro Devices, Inc. Full time

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

THE ROLE:

AMD is looking for an AI/ML software architect who is passionate about improving the performance of key Machine Learning applications and benchmarks on NPU. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology.

THE PERSON:

We are looking for a dynamic, energetic software architect to join our growing team in the AI group. As an ML software stack architect, you will be responsible for architecting runtime stack, defining the operator mapping and dataflow, operator mapping and scheduling on AMD's XDNA Neural Processing Units that power cutting edge generative models like Stable diffusion, SDXL-Turbo, Llama2, etc. Your work will directly impact the efficiency, scalability, and reliability of our ML applications. If you thrive in a fast-paced environment and love working on cutting edge machine learning inference, this role is for you.

KEY RESPONSIBILITIES:

  • Define software stack that interfaces with open source runtime env like ONNX, PyTorch and NPU compiler.
  • Define runtime operator scheduling, memory management, operator dataflow based on tensor residency.
  • Propose algorithmic optimization in operators that are mapped to CPU using AVX512.
  • Interface with ONNX / Pytorch runtime engines to deploy the model on CPUs.
  • Develop efficient model loading mechanisms to minimize startup latency.
  • Collaborate with kernel developers to integrate ML operators seamlessly into high level ML frameworks.
  • Design and implement C++ runtime wrappers, APIs, and frameworks for ML model execution.
  • Architect optimized CPU alternative implementation for ML operators that are not supported on NPUs.

PREFERRED EXPERIENCE:

  • Detailed and thorough understanding of ONNX, PyTorch runtime stack, open source frameworks.
  • Strong experience in scheduling operators between NPU, GPU and CPU.
  • Experience with graph parsing, operator fusion.
  • Strong experience with AVX, AVX512 instruction set, cache behavior in CPU.
  • Strong experience with managing system memory.
  • Detailed understanding of compiler interfacing with runtime stack, JIT compilation flow.
  • Strong programming skills in C++, Python.
  • Experience with ML frameworks (e.g., TensorFlow, PyTorch) is required.
  • Experience with ML models such as CNN, LSTM, LLMs, Diffusion is a must.
  • Experience with ONNX, Pytorch runtime stacks is a must.
  • Knowledge of parallel computing is a bonus.
  • Familiarity with containerization (Docker, Anaconda, etc) is good to have.
  • Motivating leader with good interpersonal skills.

ACADEMIC CREDENTIALS:

  • PhD degree in Computer Science, Computer Engineering, Electrical Engineering.

Location:

San Jose, Ca

At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD's Employee Stock Purchase Plan. You'll also be eligible for competitive benefits described in more detail here.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.

#J-18808-Ljbffr

  • San Jose, California, United States AMD Full time

    Transforming Lives with AMD TechnologyWe are seeking a highly skilled AI/ML software architect to join our team at AMD. As a key member of our AI group, you will be responsible for architecting runtime stacks, defining operator mapping and dataflow, and scheduling operators on AMD's XDNA Neural Processing Units.Your work will directly impact the efficiency,...


  • San Jose, California, United States Cisco Full time

    About the RoleCisco is seeking a highly skilled Principal AI/ML Engineer to join our Security AI team. As a key member of our team, you will play a critical role in designing, implementing, and evolving our AI platforms and products.Key ResponsibilitiesLead with technical and industry vision, driving our team's strategy in AI and Data Science.Define and...


  • San Jose, United States Cisco Full time

    The Cisco Security AI team delivers AI products and platform for all Cisco secure products and portfolios so businesses around the world can defend against threats and safeguard the most vital aspects of their business with security resilience. We are passionate about making businesses secure and simplifying security with zero compromise using AI and...

  • Software Architect:

    6 days ago


    San Jose, United States IBM Computing Full time

    IBM Software Architect: AI Technical Advocate in San Jose, CaliforniaIntroductionA career in IBM Software means you'll be part of a team that transforms our customers' challenges into industry-leading solutions. We are an infinitely curious team, always seeking new possibilities, and dedicated to creating the world's leading AI-powered, cloud-native software...


  • San Francisco, California, United States Together AI Full time

    Job ResponsibilitiesInfrastructure Development:Identify and resolve infrastructure gaps to ensure reliable, efficient, and scalable AI/ML solutions.AI/ML Solutions:Develop advanced AI/ML infrastructure solutions to enhance the efficiency of our ML teams, leveraging expertise in distributed systems and large-scale data processing.System Design:Design and...


  • San Francisco, United States Relyance AI Full time

    As Relyance AI's Senior Software Engineer, ML, you will strategize, drive, and execute on the initiatives in NLP for information extraction from legal documents, ML/NLP for information extraction from code and general ML in code analysis, as well as overall AI backend initiatives. You will partner with cross-functional stakeholders to design and build...


  • San Jose, United States Cisco Full time

    The Cisco Security AI team delivers AI products and platform for all Cisco secure products and portfolios so businesses around the world can defend against threats and safeguard the most vital aspects of their business with security resilience. We are passionate about making businesses secure and simplifying security with zero compromise using AI and...

  • Software Architect

    4 weeks ago


    San Francisco, California, United States Distyl AI Full time

    Join Distyl AI's Mission to Revolutionize Human ProductivityAt Distyl AI, we're on a mission to bring AI to the Fortune 500 and deliver novel and delightful user experiences. We're looking for experienced Front End Engineers who are excited to define the future of work with a focus on human value.You will build the UI/UX patterns in which AI is deployed and...

  • Software Engineer

    4 weeks ago


    San Jose, California, United States Coactive AI Full time

    Unlock the Power of AI with CoactiveCoactive is revolutionizing the way businesses harness the power of machine learning to unlock the potential of unstructured data. As a Software Engineer focused on AI solutions, you will be a key contributor to our Solutions team, bridging sales, customer success, product, and engineering.Your primary goal is to ensure...


  • San Jose, California, United States NetApp Full time

    About NetAppNetApp is a leader in intelligent data infrastructure, empowering customers to turn disruption into opportunity. Our innovative solutions help organizations unlock the full potential of their data, driving business success and growth.Job SummaryThe AI Product Platform Team is responsible for developing industry-leading products for Enterprise...


  • San Jose, California, United States NetApp Full time

    About NetAppNetApp is a leading provider of data infrastructure solutions, empowering customers to unlock the full potential of their data. Our innovative approach to data management enables businesses to turn data into a strategic asset, driving growth, innovation, and customer success.At NetApp, we're passionate about helping our customers overcome the...


  • San Mateo, California, United States Roblox Full time

    Transform the Future of Human InteractionAt Roblox, we're pushing the boundaries of what's possible in the world of artificial intelligence and machine learning. As a Principal Software Engineer - AI/ML Expert, you'll play a key role in shaping the future of human interaction by developing and implementing state-of-the-art ML models that empower a new...

  • AI / ML Engineer

    2 weeks ago


    San Francisco, United States Seven Seven Software Full time

    AI / ML (Artificial Intelligence , Machine Learning) Engineer 1. Experience in engineering and deploying Generative AI models, specifically focusing on Retrieval-Augmented Generation (RAG) systems and multi-agent workflows. 2. Strong software engineering foundation in developing and implementing state-of-the-art generative techniques and designing advanced...


  • San Jose, California, United States Advanced Micro Devices , Inc. Full time

    Unlock Innovation with AMDWe are Advanced Micro Devices, Inc., a leader in the field of data center, artificial intelligence, PCs, gaming, and embedded technologies. Our mission is to build great products that accelerate next-generation computing experiences.As an AI Solutions Architect at AMD, you will have the opportunity to own and manage the customer...


  • San Francisco, California, United States Together AI Full time

    AI Infrastructure Expertise:Design and implement high-performance AI/ML infrastructure, ensuring scalability, availability, and efficient resource utilization.Automation and Optimization:Develop and deploy automation tools, monitoring solutions, and operational strategies to streamline infrastructure management and reduce manual tasks.Collaboration and...


  • San Jose, United States NetApp Full time

    Title: AI Solutions Architect, APAC Location: AU Requisition ID: 128344 Job SummaryAs an AI Solutions Architect for the APAC Geo at NetApp, you will work closely with Sales Specialists, Partner Managers, Technical Leaders and Sales Teams to drive sales execution and business growth for AI solutions and technologies. Your passion for technology, strategic...


  • San Jose, United States NetApp Full time

    Title: AI Solutions Architect, APAC Location: AU Requisition ID: 128344 Job SummaryAs an AI Solutions Architect for the APAC Geo at NetApp, you will work closely with Sales Specialists, Partner Managers, Technical Leaders and Sales Teams to drive sales execution and business growth for AI solutions and technologies. Your passion for technology, strategic...


  • San Jose, California, United States NetApp Full time

    Job SummaryWe are seeking a highly skilled Senior Software Engineer to join our team responsible for the development and design of AI/ML features for NetApp's flagship storage operating system, ONTAP.As a member of our software engineering team, you will work on developing and testing distributed software that drives NetApp cloud, hybrid-cloud, and...


  • San Francisco, United States Dealpath Full time

    Job DescriptionJob DescriptionDealpath is looking for an experienced Principal AI/ML Engineer to join our growing team, delivering best-in-class solutions for the Commercial Real Estate industry. This is an opportunity to join a team of innovators and play a critical role by utilizing your NLP expertise to define, explore, build and deliver state-of-the-art...


  • San Francisco, California, United States Sephora Full time

    Job Overview:We are seeking a highly skilled Senior Manager to lead our Machine Learning Engineering team at Sephora. As a key member of our technology team, you will be responsible for driving the development and deployment of AI/ML models across the enterprise. This is an exciting opportunity to join a dynamic team and contribute to the growth and success...