Sr. AI Models Engineer, Efficient Generative AI

3 weeks ago


San Jose, United States Advanced Micro Devices , Inc. Full time

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance_

SMTS SOFTWARE DEVELOPMENT ENGINEER

  • THE ROLE:

    Join our innovative team at AMD as an AI Models Software Engineer. We are seeking passionate individuals who are dedicated to optimizing, accelerating, and applying AI models. As a member of our dynamic team, you will have the opportunity to shape the future of AI model development and make a significant impact in various industries and applications.

    THE PERSON:

    We are looking for a candidate who is deeply passionate about AI models and possesses strong engineering skills to tackle complex challenges. You should have experience in optimizing and accelerating NLP/Generative AI models. Effective communication and collaboration with diverse teams across AMD are essential for success in this role.

    As an AI Models Software Engineer, you will:
    1. Drive Innovation: Work on cutting-edge projects and collaborate with a team of highly skilled industry specialists. Push the boundaries of AI model optimization and application and contribute to the advancement of AI technology.
    2. Optimize AI Models: Leverage quantization, sparsity, and architecture search methods to optimize and enhance the performance, efficiency, and accuracy of Generative AI models. Unlock the full potential of AI technology and enable groundbreaking solutions.
    3. Collaborate with Experts: Collaborate closely with software engineers, data scientists, and researchers to integrate AI models into software applications and platforms. Learn from the best in the field and work together to achieve seamless integration and optimal performance.
    4. Stay Ahead of the Curve: Stay updated with the latest advancements in AI algorithms, frameworks, and technologies. Continuously explore new techniques and approaches to stay at the forefront of AI model development and optimization.

    By joining our team, you will be part of a collaborative environment that fosters innovation and encourages professional growth. You will have access to cutting-edge technology, training programs, mentorship opportunities, and a clear career progression path.

    If you are passionate about AI models and possess the skills to drive innovation and optimization, we invite you to join our team at AMD. Apply now and be at the forefront of the AI revolution

    KEY RESPONSIBILITIES:

    • Research, design, and implement novel methods for efficient Generative AI.
    • Algorithmic model optimization methods design including quantization, sparsity, NAS, etc.
    • Publish your work.
    • Collaborate with other team members and teams.

    PREFERRED EXPERIENCE:

    • Project experiences on generative tasks. Familiar with deep learning framework, e.g., Pytorch/ONNX/TensorFlow.
    • Project experiences on model compression, quantization, and end-to-end inference optimization.
    • Strong knowledge of artificial intelligence, deep learning or machine learning.
    • Strong coding skills in Python required, C/C++ skills a plus.
    • Skilled in academic paper writing and technical innovation.
    • Additional plus if recent publications include conferences such as NeuRIPS, CVPR, ECCV/ICCV, ICML, ICLR, etc.
    • Experience with any of the following also a plus: LLMs, stable diffusion, NeRF, or text-to-video generation.

    ACADEMIC CREDENTIALS:

    • A PhD or master's degree in artificial intelligence, machine learning, or a related field.
    LOCATION:
    • Remote CA locations considered.

#LI-MV1

#HYBRID

#REMOTE

At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD's Employee Stock Purchase Plan. You'll also be eligible for competitive benefits described in more detail here.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.



  • San Jose, United States Advanced Micro Devices, Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • San Francisco, United States Advanced Micro Devices , Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • San Francisco, United States Advanced Micro Devices , Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • San Francisco, United States Untether AI Full time

    Untether AI is looking for a talented AI Applications Engineer to join our Product team to support our customers with SDK for our custom AI accelerator devices. You will be working with data scientists to ensure their AI workloads are ported and running efficiently on Untether AI products.Must be a US or Canadian citizen to apply.Ideal candidate profileYou...

  • Lead AI Engineer

    4 weeks ago


    San Francisco, United States Distyl AI Full time

    Distyl AI develops production-grade AI systems to power core operational workflows for the Fortune 500. Working in partnership with OpenAI, Distyl brings deep expertise in enterprise AI, and technical investments that support the development of production-grade AI systems with rapid time-to-value. Led by proven leaders from top companies like Palantir and...


  • San Francisco, United States Scale AI, Inc. Full time

    Scale's Generative AI Data Engine powers the most advanced LLMs and generative models in the world through RLHF/RLAIF, data generation, model evaluation, safety, and alignment.As the Manager of the Generative AI Applied ML team, you will lead a talented team of research engineers and ML engineers focused on delivering scalable, production-ready solutions to...

  • Lead AI Engineer

    3 weeks ago


    San Francisco, United States Distyl AI, Inc. Full time

    Distyl AI develops production-grade AI systems to power core operational workflows for the Fortune 500. Working in partnership with OpenAI, Distyl brings deep expertise in enterprise AI, and technical investments that support the development of production-grade AI systems with rapid time-to-value.Led by proven leaders from top companies like Palantir and...

  • AI Inference Engineer

    3 weeks ago


    San Francisco, United States Perplexity AI Full time

    Job DescriptionJob DescriptionWe are looking for an AI Inference to join our growing team. Our current stack is Python, C++, TensorRT-LLM, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.ResponsibilitiesDevelop APIs for AI inference that will be used by both internal and external...


  • San Francisco, California, United States Scale AI, Inc. Full time

    Scale AI, Inc. is a leading provider of training and evaluation data and end-to-end solutions for the ML lifecycle. Our Generative AI team conducts research on models, supervision, and algorithms that advance frontier models for our applied-ML teams and the broader AI community.In this role, you will work closely with our Generative AI product team focused...


  • San Francisco, United States Truva AI Full time

    Why Join Truva.aiTruva stands at the forefront of SaaS innovation, specializing in automating tasks, optimizing workflows, and delivering unparalleled operational efficiency with LLMs.Truva is backed by top VCs such as YCombinator and Fintech Collective and led by Gaurav - 2x founder and an alumnus of Stanford, and Anuja - an alumnus of Haas MBA from UC...

  • Research Scientist

    6 days ago


    San Francisco, United States techire ai Full time

    Do you want to join a research team working on cutting-edge foundational multimodal models?You’ll be an experienced Researcher / Scientist who wants to shape the future of multimodal generative AI. You’ll need to have worked on complex AI models and have relevant research publications in either:Vision generation - video, image, 3D, LVMsAudio generation -...


  • San Francisco, United States Scale AI, Inc. Full time

    Scale's Generative AI ML team conducts research on models, supervision, and algorithms that advance frontier models for Scale's applied-ML teams and the broader AI community. Scale is uniquely positioned at the heart of the field of AI as an indispensable provider of training and evaluation data and end-to-end solutions for the ML lifecycle. You will work...


  • San Francisco, United States Abridge AI Inc. Full time

    Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients.Our enterprise-grade technology transforms patient-clinician conversations into...


  • San Francisco, California, United States Cynch AI Full time

    About Cynch AICynch AI is a cutting-edge startup on a mission to transform the accounting industry using artificial intelligence.Job OverviewWe're seeking a seasoned Principal AI Engineer with expertise in Knowledge Representation and Reasoning (KRR) to join our team. As a key member of our engineering team, you will be responsible for designing,...


  • San Francisco, United States Scout AI Full time

    Intro Scout AIis a new hiring platform that connects software engineers to opportunities with world-class companies. On Scout, you get a more relevant and growthful interviewing experience, you receive feedback on your performance, and you also get end-to-end support to improve your chances of getting hired. If you perform well on the Scout interview, you...

  • AI Software Engineer

    3 weeks ago


    San Francisco, United States Perplexity AI Full time

    Job DescriptionJob DescriptionPerplexity is seeking an experienced Full Stack AI Software Engineer to help revolutionize the way people search and interact online. In this role, you'll translate cutting-edge AI advances into tangible products for our users.ResponsibilitiesPropose novel product features that can be built with LLMs and integrate them into...


  • san jose, United States CDRP Technologies Full time

    W2 role.Job Title: Data Engineer with Generative AI ExpertiseLocation: San Jose, CA - RemoteDuration: Long TermTechnical Skills:Looking for minimum 12-15 years of experience. -- Proficiency in data pipeline and Big Data / ETL tools-- Strong programming skills in Python, SQL, and experience with cloud services (e.g., AWS, Google Cloud, Azure).-- Familiarity...


  • San Jose, United States CDRP Technologies Full time

    W2 role.Job Title: Data Engineer with Generative AI ExpertiseLocation: San Jose, CA - RemoteDuration: Long TermTechnical Skills:Looking for minimum 12-15 years of experience. -- Proficiency in data pipeline and Big Data / ETL tools-- Strong programming skills in Python, SQL, and experience with cloud services (e.g., AWS, Google Cloud, Azure).-- Familiarity...

  • AI Research Engineer

    3 weeks ago


    San Francisco, United States Perplexity AI Full time

    Job DescriptionJob DescriptionPerplexity is seeking experienced AI Research Engineers and Scientists to continue to improve our in house Online LLMs, the Sonar models. Your job is to take advantage of our rich query/answer dataset to continue to scale our Sonar model performance and provide the SOTA Online LLM experience to our...


  • San Francisco, United States Abridge AI Inc. Full time

    Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients.Our enterprise-grade technology transforms patient-clinician conversations into...