Forward-Deployed AI Inference Engineer on Kubernetes

7 days ago


Boston, United States Red Hat, Inc. Full time

A leading enterprise open source software provider is looking for a Forward Deployed Engineer in Boston, MA. This role requires an experienced engineer to deploy and optimize advanced Large Language Model solutions in complex environments. The ideal candidate will have strong Kubernetes expertise, a background in backend systems, and proficiency in Python and Go. This position offers a competitive salary range of $189,600.00 - $312,730.00, based on qualifications.
#J-18808-Ljbffr



  • Boston, United States Red Hat, Inc. Full time

    The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. In this role, you will not just build software; you will be the bridge between our cutting‑edge inference platform (LLM-D https://llm‑d.ai/, and vLLM https://github.com/vllm-project/vllm) and our customers’ most...


  • Boston, MA, United States Red Hat, Inc. Full time

    The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer . In this role, you will not just build software; you will be the bridge between our cutting‑edge inference platform (LLM-D ‑ /, and vLLM ) and our customers’ most critical production environments. If your skills,...


  • Boston, MA, United States Red Hat Full time

    The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer . In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform (LLM-D (https://llm-d.ai/) , and vLLM (https://github.com/vllm-project/vllm) ) and our customers' most...


  • Boston, MA, United States Red Hat Full time

    The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer . In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform (LLM-D (https://llm-d.ai/) , and vLLM (https://github.com/vllm-project/vllm) ) and our customers' most...


  • Boston, MA, United States Red Hat Full time

    The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer . In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform (LLM-D (https://llm-d.ai/) , and vLLM (https://github.com/vllm-project/vllm) ) and our customers' most...


  • Boston, MA, United States Red Hat, Inc. Full time

    A leading enterprise open source software provider is looking for a Forward Deployed Engineer in Boston, MA. Find out if this opportunity is a good fit by reading all of the information that follows below. This role requires an experienced engineer to deploy and optimize advanced Large Language Model solutions in complex environments. The ideal candidate...

  • Senior ML Engineer

    7 days ago


    Boston, United States Red Hat, Inc. Full time

    A leading software solutions provider is seeking a Senior Machine Learning Engineer to join the AI Inference Engineering team in Boston. You will design and develop innovative features for distributed vLLM infrastructure, enhance resource utilization, and collaborate closely with cross-functional teams. The role demands strong proficiency in Python and Go,...


  • Boston, United States Liquid AI Full time

    About Liquid AI Spun out of MIT CSAIL, we build general-purpose AI systems that run efficiently across deployment targets, from data center accelerators to on-device hardware, ensuring low latency, minimal memory usage, privacy, and reliability. We partner with enterprises across consumer electronics, automotive, life sciences, and financial services. We are...


  • Boston, United States Blitzy AI Full time

    Blitzy is a Boston-based Generative AI startup on a mission to automate custom software creation to unlock the next industrial revolution. We're transforming how enterprises build software—turning human ideas into production-ready applications with AI that can autonomously generate up to 80% of enterprise-grade code. We're backed by tier 1 investors and...


  • Boston, United States Blitzy AI Full time

    Blitzy is a Boston-based Generative AI startup on a mission to automate custom software creation to unlock the next industrial revolution. We're transforming how enterprises build software—turning human ideas into production-ready applications with AI that can autonomously generate up to 80% of enterprise-grade code. We're backed by tier 1 investors and...