Principal AI/ML Software Architect

3 days ago


San Jose, United States Advanced Micro Devices, Inc. Full time

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

THE ROLE:

AMD is looking for an AI/ML software architect who is passionate about improving the performance of key Machine Learning applications and benchmarks on NPU. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology.

THE PERSON:

We are looking for a dynamic, energetic software architect to join our growing team in the AI group. As an ML software stack architect, you will be responsible for architecting the runtime stack, defining the operator mapping and dataflow, and scheduling on AMD's XDNA Neural Processing Units that power cutting edge generative models like Stable diffusion, SDXL-Turbo, Llama2, etc. Your work will directly impact the efficiency, scalability, and reliability of our ML applications. If you thrive in a fast-paced environment and love working on cutting edge machine learning inference, this role is for you.

KEY RESPONSIBILITIES:

  1. Define software stack that interfaces with open source runtime env like ONNX, PyTorch and NPU compiler.
  2. Define runtime operator scheduling, memory management, operator dataflow based on tensor residency.
  3. Propose algorithmic optimization in operators that are mapped to CPU using AVX512.
  4. Interface with ONNX / Pytorch runtime engines to deploy the model on CPUs.
  5. Develop efficient model loading mechanisms to minimize startup latency.
  6. Collaborate with kernel developers to integrate ML operators seamlessly into high level ML frameworks.
  7. Design and implement C++ runtime wrappers, APIs, and frameworks for ML model execution.
  8. Architect optimized CPU alternative implementation for ML operators that are not supported on NPUs.

PREFERRED EXPERIENCE:

  1. Detailed and thorough understanding of ONNX, PyTorch runtime stack, open source frameworks.
  2. Strong experience in scheduling operators between NPU, GPU and CPU.
  3. Experience with graph parsing, operator fusion.
  4. Strong experience with AVX, AVX512 instruction set, cache behavior in CPU.
  5. Strong experience with managing system memory.
  6. Detailed understanding of compiler interfacing with runtime stack, JIT compilation flow.
  7. Strong programming skills in C++, Python.
  8. Experience with ML frameworks (e.g., TensorFlow, PyTorch) is required.
  9. Experience with ML models such as CNN, LSTM, LLMs, Diffusion is a must.
  10. Experience with ONNX, Pytorch runtime stacks is a must.
  11. Knowledge of parallel computing is a bonus.
  12. Familiarity with containerization (Docker, Anaconda, etc) is good to have.
  13. Motivating leader with good interpersonal skills.

ACADEMIC CREDENTIALS:

  1. PhD degree in Computer Science, Computer Engineering, Electrical Engineering.

Location:

San Jose, Ca

#J-18808-Ljbffr

  • San Jose, United States Cisco Full time

    The Cisco Security AI team delivers AI products and platform for all Cisco secure products and portfolios so businesses around the world can defend against threats and safeguard the most vital aspects of their business with security resilience. We are passionate about making businesses secure and simplifying security with zero compromise using AI and...

  • Software Architect:

    1 month ago


    San Jose, United States IBM Computing Full time

    IBM Software Architect: AI Technical Advocate in San Jose, CaliforniaIntroductionA career in IBM Software means you'll be part of a team that transforms our customers' challenges into industry-leading solutions. We are an infinitely curious team, always seeking new possibilities, and dedicated to creating the world's leading AI-powered, cloud-native software...


  • San Jose, CA, United States ThisWay Full time

    ThisWay Unlock Human Potential with: • Ethical AI • Business Automation • Automated Integrations • Unbiased Talent Transformation. Get started with compliant AI and automation to instantly identify top qualified candidates for every...Our partner is seeking a Principal AI/ML Engineer specializing in Security AI for their San Jose, CA location. This...


  • San Jose, California, United States Adobe Full time

    About the RoleWe're seeking a highly skilled Principal AI Services Engineer to join our team at Firefly, a family of creative generative AI models coming to Adobe products. As a key member of our team, you'll be responsible for designing and developing scalable, reliable cloud services with observability, logging and tracing to enable quick detection,...


  • San Francisco, California, United States Untether AI Full time

    Software Architect for AI InferenceWe are seeking an exceptional Software Architect to join our team at Untether AI, where you will play a key role in designing and developing software that interacts with our innovative chip. As part of our top-notch team, you will collaborate closely with hardware engineers and fellow software engineers to create software...


  • San Francisco, United States Relyance AI Full time

    As Relyance AI's Senior Software Engineer, ML, you will strategize, drive, and execute on the initiatives in NLP for information extraction from legal documents, ML/NLP for information extraction from code and general ML in code analysis, as well as overall AI backend initiatives. You will partner with cross-functional stakeholders to design and build...


  • San Francisco, United States Relyance AI Full time

    As Relyance AI's Senior Software Engineer, ML, you will strategize, drive, and execute on the initiatives in NLP for information extraction from legal documents, ML/NLP for information extraction from code and general ML in code analysis, as well as overall AI backend initiatives. You will partner with cross-functional stakeholders to design and build...


  • San Francisco, United States Relyance AI Full time

    As Relyance AI's Senior Software Engineer, ML, you will strategize, drive, and execute on the initiatives in NLP for information extraction from legal documents, ML/NLP for information extraction from code and general ML in code analysis, as well as overall AI backend initiatives. You will partner with cross-functional stakeholders to design and build...

  • AI/ML Architect

    2 days ago


    San Jose, California, United States Advanced Micro Devices , Inc. Full time

    Transforming Lives with AMD TechnologyWe are a team of innovators at Advanced Micro Devices, Inc., dedicated to enriching our industry, communities, and the world through cutting-edge technology. Our mission is to accelerate next-generation computing experiences, including data centers, artificial intelligence, PCs, gaming, and embedded solutions.Our vision...


  • San Jose, California, United States Adobe Full time

    Unlock the full potential of Firefly, Adobe's new family of creative generative AI models. As a Principal Machine Learning Services Engineer, you will play a crucial role in designing and developing scalable GenAI backed solutions for Enterprise customers.About the OpportunityWe are seeking an experienced engineer to contribute to the backend services that...


  • San Jose, California, United States Cisco Systems, Inc. Full time

    Unlock Your Potential as an AI/ML Infrastructure Architect at CiscoLocation: San Jose, California, USCompensation Range: 173100 - 241700 USDJob Type: ProfessionalJob Id:We're at the forefront of developing products that power the largest networks in the world. Our networking industry is undergoing a massive transformation to build next-generation...


  • San Antonio, United States Insight Global Full time

    Duration: 6-month contract, with possible extensionsLocation: San Antonio, TX (4 days onsite, Friday remote)Please see the job description below:Must Haves:Bachelor’s Degree in Computer Science, Engineering, or a related field.7+ years in DevOps, MLOps, or related field.Extensive experience with Azure DevOps, AzureML, and Linux-based platforms.Proficient...


  • San Francisco, United States Relyance AI Full time

    As Relyance AI’s Senior Software Engineer, ML, you will strategize, drive, and execute on the initiatives in NLP for information extraction from legal documents, ML/NLP for information extraction from code and general ML in code analysis, as well as overall AI backend initiatives. You will partner with cross-functional stakeholders to design and build...


  • San Jose, United States Cisco Full time

    The Cisco Security AI team delivers AI products and platform for all Cisco secure products and portfolios so businesses around the world can defend against threats and safeguard the most vital aspects of their business with security resilience. We are passionate about making businesses secure and simplifying security with zero compromise using AI and...


  • San Antonio, Texas, United States Insight Global Full time

    **Job Overview**We are seeking a highly skilled AI/ML Operations Architect to join our team in San Antonio, TX. As an AI/ML Op Architect, you will play a crucial role in designing and overseeing the implementation of AI/MLOps/LLMOps solutions on the Azure AI/ML platform.The successful candidate will work closely with cross-functional teams to optimize the...


  • San Jose, California, United States IBM Full time

    Company Overview: IBM Software is a leading provider of AI-powered, cloud-native software solutions.Salary: $150,000 - $200,000 per yearJob Description: As an AI Solutions Architect at IBM, you will transform clients' business challenges into industry-leading AI-powered solutions.Key Responsibilities:Architect and deploy scalable AI solutions that integrate...


  • San Francisco, California, United States Oleria Security Full time

    Company OverviewOleria Security is an innovative cybersecurity startup founded by industry leaders to tackle access risks in cloud applications. Our platform uses AI to provide unparalleled visibility, control, and defense against identity-based attacks. We believe in empowering customers to secure their cloud identities and prevent data breaches.We're a...


  • San Jose, California, United States Adobe Full time

    Elevate Digital Experiences with AdobeOur MissionAt Adobe, we're committed to transforming the world through digital experiences that inspire and captivate audiences. As a leader in creative generative AI, we're poised to revolutionize the way our Enterprise customers interact with their customers.The OpportunityWe're seeking a seasoned Principal ML Services...


  • San Jose, California, United States Capital One Full time

    Transformative AI ExpertWe are seeking a visionary Distinguished AI Solutions Architect to join our team at Capital One. As an expert in artificial intelligence, you will play a key role in designing and developing innovative AI solutions that drive business growth and customer satisfaction.About the RoleThis is a unique opportunity to work on cutting-edge...


  • San Francisco, California, United States Factory Full time

    Job OverviewWe are seeking an experienced Senior AI Software Architect to help build out our core platform, enabling our enterprise customers to seamlessly collaborate with advanced AI systems.About the RoleYou will design and develop AI-driven agentic systems that enhance Factory's core AI capabilities, focusing on retrieval systems, code generation...