Current jobs related to AI Inference Engineer for Embedded Platforms - San Francisco, California - Invisible AI Inc.


  • San Francisco, California, United States Perplexity AI Full time

    We are seeking an experienced AI Inference Systems Engineer to join our growing team at Perplexity AI. Our current stack includes Python, C++, TensorRT-LLM, and Kubernetes, providing a unique opportunity to work on large-scale deployment of machine learning models for real-time inference.Key Responsibilities:Develop APIs for AI inference that will be used by...


  • San Jose, California, United States Adobe Full time

    Job SummaryWe are seeking a highly skilled Senior Engineering Manager to lead the development of our AI Inference Platform at Adobe. As a key member of our team, you will be responsible for driving the architecture, design, development, and testing of the platform. Your primary goal will be to enable the Firefly Product Team to easily run and deploy ML...


  • San Francisco, California, United States Liquid AI Full time

    At Liquid AI, we're seeking a highly skilled engineer to optimize inference stacks tailored to various hardware platforms.The ideal candidate has extensive experience in CUDA, C++, and Triton, as well as a deep understanding of GPU, CPU, and NPU architectures.They should be self-motivated, capable of working independently, and driven by a passion for...


  • San Francisco, California, United States Liquid AI Full time

    Job Title: Member of Technical StaffAt Liquid AI, we're seeking a highly skilled engineer to optimize inference stacks for our models across various device types, including GPUs, CPUs, and NPUs.Key Responsibilities:Collaborate with ML Teams: Work with machine learning staff to effectively interface with our technical team.Hardware Awareness: Understand...


  • San Francisco, California, United States Liquid AI Full time

    Optimize Inference Stacks for Liquid AIAs we prepare to deploy our models across various device types, including GPUs, CPUs, and NPUs, we're seeking an expert who can optimize inference stacks tailored to each platform. We're looking for someone who can take our models, dive deep into the task, and return with a highly optimized inference stack-leveraging...


  • San Francisco, California, United States Liquid AI Full time

    About the RoleWe're seeking a highly skilled engineer to join our team at Liquid AI, where you'll play a critical role in optimizing inference stacks for our AI models.As a key member of our team, you'll be responsible for taking our models and delivering highly optimized inference stacks that leverage existing frameworks like ggml, vllm, and DeepSpeed to...


  • San Francisco, California, United States Openai Full time

    About the TeamThe Platform ML team is responsible for building the ML side of our internal training framework, which is used to train cutting-edge models.We work on distributed model execution, as well as the interfaces and implementation for model code, training, and inference.Our priorities are to maximize training throughput and researcher throughput,...


  • San Francisco, California, United States Labelbox Full time

    About the RoleLabelbox is seeking a skilled AI Platform Engineer to join our team. As a key member of our engineering organization, you will be responsible for building and maintaining a scalable AI platform that utilizes foundation models for real-world applications.Your Day to DayEnhance and improve Labelbox's core machine learning capabilities, including...


  • San Francisco, California, United States Together AI Full time

    AI Infrastructure Expertise:Design and implement high-performance AI/ML infrastructure, ensuring scalability, availability, and efficient resource utilization.Automation and Optimization:Develop and deploy automation tools, monitoring solutions, and operational strategies to streamline infrastructure management and reduce manual tasks.Collaboration and...


  • San Francisco, California, United States Altana AI Full time

    About the RoleWe are seeking a highly skilled Senior Software Engineer to join our team at Altana AI. As a key member of our engineering team, you will be responsible for designing and developing our cloud-based AI platform.Key Responsibilities:Ingest product requirements and define technical requirements for our AI platform.Design and develop scalable,...


  • San Francisco, California, United States Anthropic Full time

    About AnthropicAnthropic is a public benefit corporation headquartered in San Francisco, dedicated to creating reliable, interpretable, and steerable AI systems. Our mission is to develop AI that is safe and beneficial for users and society as a whole.Job DescriptionWe are seeking a skilled Software Engineer, Inference to join our Inference team. As a key...


  • San Francisco, California, United States Together AI Full time

    Together AI is seeking an experienced MLOps engineer to develop and deploy scalable AI/ML systems. The ideal candidate will have a strong understanding of machine learning, particularly large language models, and experience with DevOps practices like CI/CD, automation, and containerization.Key ResponsibilitiesDesign and implement runtime systems for...


  • San Jose, California, United States Adobe Full time

    About the RoleWe are seeking an exceptional Site Reliability Engineering Manager to lead our team in driving reliability for Adobe's AI Inference Platform, Adobe Firefly. As a key member of our Engineering organization, you will be responsible for developing a team of Site Reliability Engineers who will work closely with our Engineering teams to build,...


  • San Jose, California, United States Adobe Full time

    Job Title: Site Reliability Engineering Manager, AI PlatformAbout the Role:We are seeking an experienced Site Reliability Engineering Manager to lead our AI Inference Platform team at Adobe. As a key member of our Engineering organization, you will be responsible for developing and implementing strategies to ensure the reliability, scalability, and security...


  • San Jose, California, United States Adobe Full time

    Transforming Digital Experiences with AdobeWe're a company that's passionate about empowering people to create beautiful and powerful digital experiences. Our mission is to give everyone the tools they need to design and deliver exceptional experiences across every screen.The OpportunityWe're seeking an exceptional Site Reliability Engineering Manager to...


  • San Francisco, California, United States MoTek Technologies Full time

    Senior Embedded Software Engineer - Computer Vision ExpertWe are seeking a highly skilled Senior Embedded Software Engineer to develop and deploy advanced robot perception systems on integrated hardware. You will collaborate with a team of leading researchers and engineers in robotics and AI to build the next generation of robotics vision systems. The role...

  • AI Researcher

    1 month ago


    San Francisco, California, United States Inflection AI Full time

    About Inflection AIInflection AI is a public benefit corporation that leverages its world-class large language model to build the first AI platform focused on the needs of the enterprise.Our MissionWe are passionate about building enterprise AI solutions that make a positive impact. Our team is dedicated to creating innovative products that empower...


  • San Francisco, California, United States Together AI Full time

    Job ResponsibilitiesInfrastructure Development:Identify and resolve infrastructure gaps to ensure reliable, efficient, and scalable AI/ML solutions.AI/ML Solutions:Develop advanced AI/ML infrastructure solutions to enhance the efficiency of our ML teams, leveraging expertise in distributed systems and large-scale data processing.System Design:Design and...


  • San Francisco, California, United States Anthropic Full time

    About AnthropicAnthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.About the role:Our...

  • Research Scientist

    2 weeks ago


    San Francisco, California, United States techire ai Full time

    Research Scientist - Multimodal AIWe're seeking an experienced Research Scientist to join our team working on cutting-edge foundational multimodal models. As a key member of our research team, you'll be responsible for shaping the future of multimodal generative AI.About the RoleYou'll need to have a strong research focus on generative models, including...

AI Inference Engineer for Embedded Platforms

2 months ago


San Francisco, California, United States Invisible AI Inc. Full time

About Invisible AI Inc.

At Invisible AI, we are pioneering advancements in computer vision technology. Our primary mission is to create a comprehensive platform that transforms manufacturing processes. By utilizing edge AI cameras, we aim to enhance the accuracy, reliability, and safety of manual assembly tasks, thereby revolutionizing people-driven manufacturing.

Our founders, who have extensive experience in the self-driving car industry, are committed to building and deploying robust AI and Machine Learning pipelines. We invite you to join our team and contribute to a company that is unlocking the vast potential of computer vision for real-world applications.

Role Overview

As an Embedded Machine Learning Engineer, you will engage with state-of-the-art technologies to assess the performance of our machine learning infrastructure across various hardware accelerators and deep learning inference platforms. Your work will be instrumental in shaping the future of our hardware solutions.

You will address the challenges associated with deploying machine learning models developed in diverse libraries on edge computing platforms, focusing on feasibility, computational efficiency, and runtime performance. Collaborating with a talented team of engineers, you will help launch innovative AI products that are ready for immediate use across multiple domains.

Key Responsibilities

  • Implement Pytorch models on Nvidia Jetson platforms, applying optimal TensorRT enhancements.
  • Connect off-the-shelf hardware accelerators with single-board computers such as Orange Pi and Raspberry Pi.
  • Work with various hardware accelerators (e.g., GPUs), troubleshoot issues, and refine C++ code for peak performance.
  • Investigate power consumption issues related to SSDs, USB cameras, AI boards, and CPU/GPU configurations.
  • Examine the compatibility of different machine learning operations with various computing platforms.

Qualifications

  • Graduate students specializing in Electrical Engineering with a focus on Machine Learning/Deep Learning for Computer Vision, or undergraduate students with relevant experience.
  • Strong proficiency in C++ and practical experience with embedded Linux systems.
  • Experience in developing and deploying machine learning algorithms.
  • Solid understanding of PCIe interfaces for NVMe drives, hardware accelerators, and WiFi modules.
  • Comprehensive knowledge of machine learning concepts, including convolutions, encoders, decoders, optimizers, and loss functions, particularly in embedded environments.
  • Familiarity with the complete Linux stack and debugging techniques.
  • Experience with Nvidia Jetson platforms and a good grasp of their hardware components (tensor cores, DLA, video encoders, and decoders).
  • Knowledge of various digital communication interfaces (I2C, SPI, USB, CAN, HDMI, DDR3/4).
  • Proficiency in scripting languages such as Python or Bash.
  • Experience with arm64-based platforms.

Compensation

The estimated hourly pay range for this position is between $45.00, subject to adjustments based on market conditions and individual qualifications assessed during the interview process. Invisible AI is an equal opportunity employer and values diversity in the workplace.