Current jobs related to AI Inference Engineer for Embedded Platforms - San Francisco, California - Invisible AI Inc.

AI Inference Systems Engineer

4 days ago

San Francisco, California, United States Perplexity AI Full time

We are seeking an experienced AI Inference Systems Engineer to join our growing team at Perplexity AI. Our current stack includes Python, C++, TensorRT-LLM, and Kubernetes, providing a unique opportunity to work on large-scale deployment of machine learning models for real-time inference.Key Responsibilities:Develop APIs for AI inference that will be used by...
Senior Engineering Manager for AI Inference Platform

4 weeks ago

San Jose, California, United States Adobe Full time

Job SummaryWe are seeking a highly skilled Senior Engineering Manager to lead the development of our AI Inference Platform at Adobe. As a key member of our team, you will be responsible for driving the architecture, design, development, and testing of the platform. Your primary goal will be to enable the Firefly Product Team to easily run and deploy ML...
Senior Inference Optimization Engineer

4 days ago

San Francisco, California, United States Liquid AI Full time

At Liquid AI, we're seeking a highly skilled engineer to optimize inference stacks tailored to various hardware platforms.The ideal candidate has extensive experience in CUDA, C++, and Triton, as well as a deep understanding of GPU, CPU, and NPU architectures.They should be self-motivated, capable of working independently, and driven by a passion for...
Senior Inference Optimization Engineer

2 weeks ago

San Francisco, California, United States Liquid AI Full time

Job Title: Member of Technical StaffAt Liquid AI, we're seeking a highly skilled engineer to optimize inference stacks for our models across various device types, including GPUs, CPUs, and NPUs.Key Responsibilities:Collaborate with ML Teams: Work with machine learning staff to effectively interface with our technical team.Hardware Awareness: Understand...
Senior Inference Optimization Engineer

1 month ago

San Francisco, California, United States Liquid AI Full time

Optimize Inference Stacks for Liquid AIAs we prepare to deploy our models across various device types, including GPUs, CPUs, and NPUs, we're seeking an expert who can optimize inference stacks tailored to each platform. We're looking for someone who can take our models, dive deep into the task, and return with a highly optimized inference stack-leveraging...
Senior Inference Optimization Engineer

2 weeks ago

San Francisco, California, United States Liquid AI Full time

About the RoleWe're seeking a highly skilled engineer to join our team at Liquid AI, where you'll play a critical role in optimizing inference stacks for our AI models.As a key member of our team, you'll be responsible for taking our models and delivering highly optimized inference stacks that leverage existing frameworks like ggml, vllm, and DeepSpeed to...
Platform ML Engineering Manager, Inference

2 weeks ago

San Francisco, California, United States Openai Full time

About the TeamThe Platform ML team is responsible for building the ML side of our internal training framework, which is used to train cutting-edge models.We work on distributed model execution, as well as the interfaces and implementation for model code, training, and inference.Our priorities are to maximize training throughput and researcher throughput,...
AI Platform Engineer

3 days ago

San Francisco, California, United States Labelbox Full time

About the RoleLabelbox is seeking a skilled AI Platform Engineer to join our team. As a key member of our engineering organization, you will be responsible for building and maintaining a scalable AI platform that utilizes foundation models for real-world applications.Your Day to DayEnhance and improve Labelbox's core machine learning capabilities, including...
AI Infrastructure Architect

1 week ago

San Francisco, California, United States Together AI Full time

AI Infrastructure Expertise:Design and implement high-performance AI/ML infrastructure, ensuring scalability, availability, and efficient resource utilization.Automation and Optimization:Develop and deploy automation tools, monitoring solutions, and operational strategies to streamline infrastructure management and reduce manual tasks.Collaboration and...
Senior Software Engineer

2 weeks ago

San Francisco, California, United States Altana AI Full time

About the RoleWe are seeking a highly skilled Senior Software Engineer to join our team at Altana AI. As a key member of our engineering team, you will be responsible for designing and developing our cloud-based AI platform.Key Responsibilities:Ingest product requirements and define technical requirements for our AI platform.Design and develop scalable,...
Software Engineer, Inference

2 weeks ago

San Francisco, California, United States Anthropic Full time

About AnthropicAnthropic is a public benefit corporation headquartered in San Francisco, dedicated to creating reliable, interpretable, and steerable AI systems. Our mission is to develop AI that is safe and beneficial for users and society as a whole.Job DescriptionWe are seeking a skilled Software Engineer, Inference to join our Inference team. As a key...
Machine Learning Operations Engineer

4 days ago

San Francisco, California, United States Together AI Full time

Together AI is seeking an experienced MLOps engineer to develop and deploy scalable AI/ML systems. The ideal candidate will have a strong understanding of machine learning, particularly large language models, and experience with DevOps practices like CI/CD, automation, and containerization.Key ResponsibilitiesDesign and implement runtime systems for...
Site Reliability Engineering Manager, AI Platform

3 weeks ago

San Jose, California, United States Adobe Full time

About the RoleWe are seeking an exceptional Site Reliability Engineering Manager to lead our team in driving reliability for Adobe's AI Inference Platform, Adobe Firefly. As a key member of our Engineering organization, you will be responsible for developing a team of Site Reliability Engineers who will work closely with our Engineering teams to build,...
Site Reliability Engineering Manager, AI Platform

4 days ago

San Jose, California, United States Adobe Full time

Job Title: Site Reliability Engineering Manager, AI PlatformAbout the Role:We are seeking an experienced Site Reliability Engineering Manager to lead our AI Inference Platform team at Adobe. As a key member of our Engineering organization, you will be responsible for developing and implementing strategies to ensure the reliability, scalability, and security...
Site Reliability Engineering Manager, AI Platform Expert

1 week ago

San Jose, California, United States Adobe Full time

Transforming Digital Experiences with AdobeWe're a company that's passionate about empowering people to create beautiful and powerful digital experiences. Our mission is to give everyone the tools they need to design and deliver exceptional experiences across every screen.The OpportunityWe're seeking an exceptional Site Reliability Engineering Manager to...
Senior Embedded Software Engineer

3 days ago

San Francisco, California, United States MoTek Technologies Full time

Senior Embedded Software Engineer - Computer Vision ExpertWe are seeking a highly skilled Senior Embedded Software Engineer to develop and deploy advanced robot perception systems on integrated hardware. You will collaborate with a team of leading researchers and engineers in robotics and AI to build the next generation of robotics vision systems. The role...
AI Researcher

1 month ago

San Francisco, California, United States Inflection AI Full time

About Inflection AIInflection AI is a public benefit corporation that leverages its world-class large language model to build the first AI platform focused on the needs of the enterprise.Our MissionWe are passionate about building enterprise AI solutions that make a positive impact. Our team is dedicated to creating innovative products that empower...
Software Engineer, AI/ML Infrastructure Specialist

4 days ago

San Francisco, California, United States Together AI Full time

Job ResponsibilitiesInfrastructure Development:Identify and resolve infrastructure gaps to ensure reliable, efficient, and scalable AI/ML solutions.AI/ML Solutions:Develop advanced AI/ML infrastructure solutions to enhance the efficiency of our ML teams, leveraging expertise in distributed systems and large-scale data processing.System Design:Design and...
Software Engineer, Inference

7 days ago

San Francisco, California, United States Anthropic Full time

About AnthropicAnthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.About the role:Our...
Research Scientist

2 weeks ago

San Francisco, California, United States techire ai Full time

Research Scientist - Multimodal AIWe're seeking an experienced Research Scientist to join our team working on cutting-edge foundational multimodal models. As a key member of our research team, you'll be responsible for shaping the future of multimodal generative AI.About the RoleYou'll need to have a strong research focus on generative models, including...

AI Inference Engineer for Embedded Platforms

2 months ago

San Francisco, California, United States Invisible AI Inc. Full time

About Invisible AI Inc.

At Invisible AI, we are pioneering advancements in computer vision technology. Our primary mission is to create a comprehensive platform that transforms manufacturing processes. By utilizing edge AI cameras, we aim to enhance the accuracy, reliability, and safety of manual assembly tasks, thereby revolutionizing people-driven manufacturing.

Our founders, who have extensive experience in the self-driving car industry, are committed to building and deploying robust AI and Machine Learning pipelines. We invite you to join our team and contribute to a company that is unlocking the vast potential of computer vision for real-world applications.

Role Overview

As an Embedded Machine Learning Engineer, you will engage with state-of-the-art technologies to assess the performance of our machine learning infrastructure across various hardware accelerators and deep learning inference platforms. Your work will be instrumental in shaping the future of our hardware solutions.

You will address the challenges associated with deploying machine learning models developed in diverse libraries on edge computing platforms, focusing on feasibility, computational efficiency, and runtime performance. Collaborating with a talented team of engineers, you will help launch innovative AI products that are ready for immediate use across multiple domains.

Key Responsibilities

Implement Pytorch models on Nvidia Jetson platforms, applying optimal TensorRT enhancements.
Connect off-the-shelf hardware accelerators with single-board computers such as Orange Pi and Raspberry Pi.
Work with various hardware accelerators (e.g., GPUs), troubleshoot issues, and refine C++ code for peak performance.
Investigate power consumption issues related to SSDs, USB cameras, AI boards, and CPU/GPU configurations.
Examine the compatibility of different machine learning operations with various computing platforms.

Qualifications

Graduate students specializing in Electrical Engineering with a focus on Machine Learning/Deep Learning for Computer Vision, or undergraduate students with relevant experience.
Strong proficiency in C++ and practical experience with embedded Linux systems.
Experience in developing and deploying machine learning algorithms.
Solid understanding of PCIe interfaces for NVMe drives, hardware accelerators, and WiFi modules.
Comprehensive knowledge of machine learning concepts, including convolutions, encoders, decoders, optimizers, and loss functions, particularly in embedded environments.
Familiarity with the complete Linux stack and debugging techniques.
Experience with Nvidia Jetson platforms and a good grasp of their hardware components (tensor cores, DLA, video encoders, and decoders).
Knowledge of various digital communication interfaces (I2C, SPI, USB, CAN, HDMI, DDR3/4).
Proficiency in scripting languages such as Python or Bash.
Experience with arm64-based platforms.

Compensation

The estimated hourly pay range for this position is between $45.00, subject to adjustments based on market conditions and individual qualifications assessed during the interview process. Invisible AI is an equal opportunity employer and values diversity in the workplace.

Americas

Europe

Asia / Oceania

Africa

Current jobs related to AI Inference Engineer for Embedded Platforms - San Francisco, California - Invisible AI Inc.

AI Inference Engineer for Embedded Platforms