AI Inference Solutions Engineer

3 weeks ago

San Jose, California, United States Advanced Micro Devices , Inc. Full time

Job Summary

The Advanced Micro Devices , Inc. is seeking a highly skilled Ai Application Engineer to join our team in San Jose, CA or Santa Clara, CA.

We are looking for an innovative and hands-on individual to work on cutting-edge inference solutions and ensure the stability, performance, and usability of AI software before it is released to the public.

About the Role

Collaborate with cross-functional teams (software engineering, marketing, competitive analysis) to improve AI inference solutions.
Ensure stability and usability of AI software, performing rigorous testing and profiling before public release.
Participate in software & solution analysis and contribute to strategic planning.
Develop, deploy, and optimize AI applications with a focus on inference efficiency.

What We Offer

At AMD, your base pay will be around $140,000 per year, depending on experience. You may also be eligible for incentives based upon your role such as an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD's Employee Stock Purchase Plan. You'll also be eligible for competitive benefits described in more detail here.

Requirements

A team player with good communication skills and willingness to work with cross-functional teams.
Hands-on experience with AI application development, with a focus on inference solutions.
Strong coding skills in Python, C++, or similar languages.
Experience with AI frameworks like TensorFlow, PyTorch, or similar.

About Us

At AMD, we care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming, and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

Machine Learning Engineer

4 weeks ago

San Francisco, California, United States Perplexity AI Full time

Job DescriptionWe are seeking an AI Inference Engineer to join our growing team. As a key member of our engineering team, you will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.Benchmark and address bottlenecks throughout our inference stackImprove the reliability and observability of our systems...
AI Inference Specialist

3 weeks ago

San Jose, California, United States AMD Full time

ResponsibilitiesCollaborate with cross-functional teams (software engineering, marketing, competitive analysis) to improve AI inference solutions.Ensure stability and usability of AI software by performing rigorous testing and profiling before public release.Participate in software & solution analysis and contribute to strategic planning.Develop, deploy, and...
Technical Architect

3 weeks ago

San Jose, California, United States Recogni Full time

Job DescriptionWe are seeking a highly experienced Principal Software Engineer to join our world-class engineering team at Recogni. This position requires a strong technical background in software engineering and a passion for developing innovative solutions.Key ResponsibilitiesDevelop multi-disciplinary end-to-end system development for our cutting-edge,...
AI Inference Solutions Architect

4 days ago

San Jose, California, United States Recogni Full time

About UsAt Recogni, we believe that people come first. We prioritize our employees' well-being and their families, aiming for a healthier, happier life inside and outside work.We value their contributions and offer tailored benefits for health and financial security, catering to different life stages.We are an equal opportunity employer and believe that a...
Machine Learning Engineer

3 weeks ago

San Francisco, California, United States Together AI Full time

About the Role">We are looking for a talented Machine Learning Engineer to join our team at Together AI. As an MLOps engineer, you will develop systems and APIs that enable our customers to perform inference and fine-tune LLMs.">Responsibilities">Develop and deploy systems and APIs that enable customers to perform inference and fine-tune LLMs.Work closely...
Senior Engineering Manager

2 days ago

San Jose, California, United States Adobe Inc. Full time

The Opportunity">We're seeking an experienced Senior Engineering Manager to lead our AI Inference Platform team. As a key member of our organization, you'll be responsible for developing and maintaining a scalable and efficient platform that enables our customers to leverage the power of AI.Your primary goal will be to design, develop, and deploy the AI...
AI Engineer for Enterprise AI Solutions

3 weeks ago

San Jose, California, United States Adobe Full time

About the RoleWe are seeking a highly skilled Principal Machine Learning Services Engineer to join our team at Adobe. In this role, you will contribute to the backend services for Firefly that power the Generative AI features on various Adobe applications and surfaces for Enterprise customers.The OpportunityFirefly is Adobe's new family of creative...
Principal AI Engineer

3 weeks ago

San Jose, California, United States Recogni Full time

**Recogni: Unlocking the Potential of AI Inference**Recogni is a pioneering system solution company that empowers businesses to harness the full potential of AI inference. Our expertise lies in designing high-performance, low-power AI inferencing solutions that accelerate multimodal Generative AI inference at scale. By leveraging the latest research and...
Data Inference Specialist

2 months ago

San Francisco, California, United States Perplexity AI Full time

We are seeking an experienced Data Inference Specialist to join our team at Perplexity AI.OverviewAt Perplexity AI, we've achieved tremendous growth and adoption since launching the world's first fully functional conversational answer engine. Our AI-powered search assistant has amassed 10 million monthly active users, with mobile apps installed over 1...
High-Performance AI Model Engineer

3 hours ago

San Francisco, California, United States Perplexity AI Full time

We're revolutionizing information access and knowledge synthesis with our cutting-edge question-answering and information retrieval systems.As an experienced AI Inference Engineer, you'll join our team to work on the internal workings of our AI inference stack, running neural networks that power our systems. Collaborate closely with AI Model Engineers and...
Inference Stack Architect

3 weeks ago

San Francisco, California, United States Liquid AI Full time

Harness Machine Learning Potential: As a key member of our team, you'll play a vital role in shaping the future of machine learning at Liquid AI. With a competitive salary range of $150,000 - $170,000 per annum, depending on experience and qualifications, you'll have the opportunity to grow professionally and make a meaningful impact. Job Description: Our...
Machine Learning Inference Specialist

4 weeks ago

San Francisco, California, United States Perplexity AI Full time

Company OverviewPerplexity AI is a leading innovator in the field of conversational answer engines, boasting 10 million monthly active users and serving over 500 million queries worldwide.We've experienced tremendous growth since publicly launching our fully functional search assistant just over a year ago and have raised significant funding from top...
AI Inference Performance Optimization Expert

2 days ago

San Jose, California, United States Untether AI Full time

At Untether AI, we're pushing the boundaries of AI performance and efficiency. We're looking for a skilled NPI Product/Test Engineer to join our team and help us bring innovative hardware solutions to market.About the TeamOur team is comprised of talented engineers and scientists who are passionate about creating groundbreaking technology. We're a...
AI Optimization Engineer

2 weeks ago

San Francisco, California, United States Naptha AI Full time

About the Role:Naptha AI seeks a skilled AI Optimization Engineer to drive advancements in test time compute optimization for large language models. This role requires researching and developing novel approaches to improve inference efficiency, reduce computational requirements, and enhance model performance at deployment.Key Responsibilities:Design and...
Staff Software Engineer

4 days ago

San Francisco, California, United States Crusoe Full time

A Day in the LifeAs a Staff Software Engineer on the Managed AI team at Crusoe, you'll have a pivotal role in shaping the architecture and scalability of our next-generation AI inference platform.You will lead the design and implementation of core systems for our AI services, including resilient fault-tolerant queues, model catalogs, and scheduling...
Inference Performance Specialist

3 weeks ago

San Francisco, California, United States Liquid AI Full time

Job DescriptionWe are looking for a talented Senior Optimization Engineer to join our team and help us develop highly optimized ML inference stacks for various hardware platforms. The successful candidate will have extensive experience in coding, with expertise in Python, PyTorch, CUDA, and C++. They should be able to work independently, taking ownership of...
AI Infrastructure Engineer

2 days ago

San Francisco, California, United States Magic AI Full time

Magic AI is dedicated to building safe AGI that accelerates humanity's progress on the world's most important problems. Our approach combines frontier-scale pre-training, domain-specific reinforcement learning, ultra-long context, and inference-time compute to achieve this goal.About the Role:As a Distributed Systems Engineer, you will build the data and...
AI Systems Architect for Enterprise AI Solutions

1 month ago

San Jose, California, United States Adobe Full time

Unlock the full potential of Firefly, Adobe's new family of creative generative AI models. As a Principal Machine Learning Services Engineer, you will play a crucial role in designing and developing scalable GenAI backed solutions for Enterprise customers.About the OpportunityWe are seeking an experienced engineer to contribute to the backend services that...
Advanced AI Infrastructure Engineer

1 month ago

San Francisco, California, United States Together AI Full time

About the RoleWe are seeking an experienced Systems Research Engineer to join our team at Together AI. As a key member of our research-driven artificial intelligence company, you will play a crucial role in researching and building the next generation AI platform.Company OverviewTogether AI is committed to creating open and transparent AI systems that drive...
Software Engineer

3 days ago

San Francisco, California, United States Virtue AI Full time

Virtue AI is a leading San Francisco-based company at the forefront of AI technology. This Full Stack Engineer position will be based onsite in San Francisco.The future of AI depends on our ability to keep it safe and responsible. We're seeking an experienced Full Stack Engineer to champion our efforts in doing so.This role involves joining our engineering...

Americas

Europe

Asia / Oceania

Africa

AI Inference Solutions Engineer