Current jobs related to AI/ML Server Performance Architect - Austin - AMD


  • Austin, Texas, United States Amazon Full time

    Join Our Team as a Senior AI Solutions ArchitectAre you enthusiastic about the realms of Artificial Intelligence, Machine Learning, and Deep Learning? Do you aspire to assist clients in crafting innovative solutions utilizing cutting-edge AI/ML/DL technologies on the Amazon Web Services (AWS) platform? If so, we invite you to explore this opportunity.About...

  • AI Architect

    2 months ago


    Austin, Texas, United States Advanced Micro Devices, Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • Austin, Texas, United States Amazon Full time

    Are you driven by a passion for Artificial Intelligence and Generative Technologies? Do you enjoy collaborating with clients to develop innovative solutions utilizing cutting-edge AI/ML tools on the Amazon Web Services (AWS) platform? If so, we invite you to explore this opportunity.About Amazon: With over two decades of investment in artificial...


  • Austin, Texas, United States augmentjobs Full time

    Job DescriptionOverview: We are seeking a skilled and forward-thinking AI Solutions Architect to join our team at AugmentJobs. In this role, you will be tasked with designing and implementing AI-driven solutions that enhance operational efficiency and improve user engagement. Collaboration with diverse teams will be essential as you create and deploy AI...

  • AI/ML Engineer

    4 weeks ago


    Austin, United States SRI Tech Solutions Inc. Full time

    AI/ML EngineerLocation – Raleigh, NC (Day 1 Onsite)Job Description –We are looking for an NLP/Gen Ai Engineer who is Specialize in Natural Language Processing (NLP) and generative AI techniques, focusing on enabling machines to comprehend and generate human language.Play a crucial part in creating solutions that involve unstructured data. Independently...


  • Austin, Texas, United States SambaNova Systems Full time

    The age of ubiquitous AI is upon us. Organizations are increasingly leveraging generative AI to uncover latent value within their data, streamline operations, minimize expenses, enhance efficiency, and foster innovation, thereby fundamentally reshaping their business landscapes.SambaNova SuiteTM stands as the pioneering full-stack generative AI platform,...

  • Gen AI Engineer

    5 days ago


    Austin, United States Tror - AI for everyone Full time

    Job Title - : Gen AI EngineerLocation: Austin, TXContact: W2Visa: H1B Only Requirement:Years of experience- 10+Job Description:Build scalable software solutions using LLM's and other ML models to solve challenges in healthcareBuild enterprise grade AI solutions with focus on privacy, security, fairness.Work with Product Development as a Generative Artificial...

  • Gen AI Engineer

    1 week ago


    Austin, United States Tror - AI for everyone Full time

    Job Title - : Gen AI EngineerLocation: Austin, TXContact: W2Visa: H1B Only Requirement:Years of experience- 10+Job Description:Build scalable software solutions using LLM's and other ML models to solve challenges in healthcareBuild enterprise grade AI solutions with focus on privacy, security, fairness.Work with Product Development as a Generative Artificial...

  • Senior AI Architect

    3 months ago


    Austin, United States SADA Full time

    Job DescriptionJob DescriptionJoin SADA as a Senior Artificial Intelligence ArchitectYour Mission As a Senior Artificial Intelligence Architect at SADA, you will work in a fast-paced, highly-skilled team developing state-of-the-art enterprise solutions using a wide variety of artificial intelligence technologies. You will work directly with customers,...

  • Sr. AI/ML Engineer

    3 months ago


    Austin, United States Quilr Full time

    Job DescriptionJob DescriptionQuilr is a start up in stealth mode based out of Austin. We are aiming to solve a problem in the Cybersecurity space that has been untouched till now. The founder has a 12 year history in Cybersecurity and has exited a company for $1B+. We are looking for a Senior Data Scientist to join our team to build a core engine of the...


  • Austin, United States Advanced Micro Devices , Inc. Full time

    Overview: WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • Austin, United States Advanced Micro Devices, Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...

  • Staff AI Researcher

    6 days ago


    Austin, United States Aledade Full time

    As a Staff AI Researcher, you will develop ML and AI solutions that will improve health for millions of people. Here at Aledade we empower primary care physicians with technology to keep their patients healthy and prevent unnecessary hospitalizations. You will partner with other engineering and analytics teams, bringing AI technology into existing products...


  • Austin, Texas, United States Apple Full time

    SummaryPosted: Jul 3, 2024Role Number: Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish Join us to help deliver system design of next generation groundbreaking AI/ML based...

  • AI Systems Architect

    2 weeks ago


    Austin, Texas, United States Eidon AI Full time

    Location: Bay Area candidates are also welcome About Eidon AI At Eidon AI, we are a collective of AI and blockchain innovators, comprised of experts from leading tech firms and investment institutions. Our mission is to collaboratively build a fully decentralized AI infrastructure that operates in a trustless and permissionless manner, ensuring equitable...


  • Austin, Texas, United States Advanced Micro Devices, Inc Full time

    Job SummaryWe are seeking a highly skilled Server Performance Architect to join our team at Advanced Micro Devices, Inc. As a key member of our organization, you will be responsible for designing and developing high-performance server systems that accelerate next-generation computing experiences.Key ResponsibilitiesDesign and develop server systems that...


  • Austin, Texas, United States Advanced Micro Devices, Inc Full time

    Job SummaryWe are seeking a highly skilled Server Performance Architect to join our team at Advanced Micro Devices, Inc. This is a unique opportunity to work on cutting-edge server performance optimization and simulation.Key ResponsibilitiesDevelop and implement advanced server performance simulation methodologies to analyze and optimize high-performance...

  • Sr. AI/ML Engineer

    4 weeks ago


    Austin, United States Siri InfoSolutions Inc Full time

    Job DescriptionJob DescriptionTitle: Sr. AI/ML Engineer Location: Austin, TX (Day 1 Onsite)Job Description:Responsibilities:Design, develop, and implement Retrieval-Augmented Generation (RAG) systems utilizing Large Language Models (LLMs) to enhance our AI capabilities.Work with vector stores, semantic search, and other relevant technologies to optimize the...

  • AI/ ML Engineer

    1 month ago


    Austin, United States Infosys Full time

    Infosys is seeking a Lead AI/ML Engineer. This position will interface with key stakeholders and apply your technical proficiency across different stages of the Software Development Life Cycle including Requirements Elicitation, Application Architecture definition and Design. You will play an important role in creating the high level design artifacts. You...

  • AI/ ML Engineer

    2 months ago


    Austin, United States Infosys Full time

    Infosys is seeking a Lead AI/ML Engineer. This position will interface with key stakeholders and apply your technical proficiency across different stages of the Software Development Life Cycle including Requirements Elicitation, Application Architecture definition and Design. You will play an important role in creating the high level design artifacts. You...

AI/ML Server Performance Architect

2 months ago


Austin, United States AMD Full time

WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. AMD together we advance_ AI/ML Server Performance Architect THE ROLE: In the Data center Server Solutions Group AI Performance Architect who will be engaged in data driven design and analysis to advance application performance of AI/ML on server systems THE PERSON: As an AI/ML performance architect you will work on new trends and requirements in datacenter AI/ML areas and identify performance bottlenecks, translate them into hardware requirements for future products. You will have deep subject matter expertise in key AI domains and successfully influence data center solutions. KEY RESPONSIBILITIES: Lead and analyze cutting-edge trends and requirements of DL/ML models in the data center segment Design/explore system level trade-offs on compute/ memory/ capacity and network on current and future generation AI HW/SW products Project AI application performance from proposed design changes Implement different optimized solutions and compare against expected “roofline” behavior. Optimize models to deploy on heterogeneous platform through quantization, partitioning as well as accuracy trade-offs to hit target KPIs Analyze performance of AI workloads on silicon and architecture models to identify performance limiters Build performance tools and infrastructure for automation and accelerating analysis of application performance PREFERRED EXPERIENCE: Proven experience in AI workload development and subject matter expertise at least in one area (LLMs, Vision or Recommendation systems) Experience in performance analysis, performance tool development and performance projections Experience programming and porting models in DL frameworks (such as Pytorch and Tensorflow) Experience in Quantization flows such as QAT, PTQ, Fine tuning/ transfer learning Prefer background course work in computer architecture and system performance analyses Optional experience in CUDA programming as well as any Huggingface pipeline development Quickly grasp new concepts and technologies Strong communication, interpersonal team work skills and data organization skills through presentations, excel and document organization/automation Can operate with focus and be results-oriented with ability to handle ambiguity ACADEMIC CREDENTIALS: Masters or higher degree in relevant fields, i.e. Computer Science, Electrical Engineer, Computer Engineering Publications in AI and Architecture domains a plus #LI-Hybrid #LI-RW1 At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. AI/ML Server Performance Architect THE ROLE: In the Data center Server Solutions Group AI Performance Architect who will be engaged in data driven design and analysis to advance application performance of AI/ML on server systems THE PERSON: As an AI/ML performance architect you will work on new trends and requirements in datacenter AI/ML areas and identify performance bottlenecks, translate them into hardware requirements for future products. You will have deep subject matter expertise in key AI domains and successfully influence data center solutions. KEY RESPONSIBILITIES: Lead and analyze cutting-edge trends and requirements of DL/ML models in the data center segment Design/explore system level trade-offs on compute/ memory/ capacity and network on current and future generation AI HW/SW products Project AI application performance from proposed design changes Implement different optimized solutions and compare against expected “roofline” behavior. Optimize models to deploy on heterogeneous platform through quantization, partitioning as well as accuracy trade-offs to hit target KPIs Analyze performance of AI workloads on silicon and architecture models to identify performance limiters Build performance tools and infrastructure for automation and accelerating analysis of application performance PREFERRED EXPERIENCE: Proven experience in AI workload development and subject matter expertise at least in one area (LLMs, Vision or Recommendation systems) Experience in performance analysis, performance tool development and performance projections Experience programming and porting models in DL frameworks (such as Pytorch and Tensorflow) Experience in Quantization flows such as QAT, PTQ, Fine tuning/ transfer learning Prefer background course work in computer architecture and system performance analyses Optional experience in CUDA programming as well as any Huggingface pipeline development Quickly grasp new concepts and technologies Strong communication, interpersonal team work skills and data organization skills through presentations, excel and document organization/automation Can operate with focus and be results-oriented with ability to handle ambiguity ACADEMIC CREDENTIALS: Masters or higher degree in relevant fields, i.e. Computer Science, Electrical Engineer, Computer Engineering Publications in AI and Architecture domains a plus #LI-Hybrid #LI-RW1 At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan. You’ll also be eligible for competitive benefits described in more detail here. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. Tags: No, USD $165,060.00/Yr., USD $235,800.00/Yr., US Careers (External)