AI Platform Engineer

1 day ago


Boston, Massachusetts, United States Axiomatic_AI Full time $120,000 - $200,000 per year

Position Overview
As an AI Platform Engineer, you are the bridge between AI research and production software. You will:

  • Build and maintain AI infrastructure: model serving, vector databases, embedding pipelines
  • Enable AI developers to deploy their work reproducibly and safely
  • Design APIs for AI inference, prompt management, and evaluation
  • Implement MLOps pipelines: versioning, monitoring, logging, experimentation tracking
  • Optimize performance: latency, cost, throughput, reliability
  • Collaborate with backend engineers to integrate AI capabilities into the product

Key Responsibilities

  • AI Infrastructure

  • Deploy and serve LLMs (OpenAI, Anthropic, HuggingFace, fine-tuned models)

  • Optimize inference latency and costs
  • Implement caching, rate limiting, and retry strategies

  • MLOps & Pipelines

  • Version models, prompts, datasets, and evaluation results

  • Implement experiment tracking (Weights & Biases)
  • Build CI/CD pipelines for model deployment
  • Monitor model performance and drift
  • Set up logging and observability for AI services

  • API Development

  • Design and implement APIs (FastAPI)

  • Create endpoints for prompt testing, model selection, and evaluation
  • Integrate AI services with backend application
  • Ensure API reliability, security, and performance

  • Collaboration & Enablement

  • Work with AI Developers to productionize their experiments regarding improving user workflows

  • Define workflows: notebook/test repository → PR → staging → production
  • Document AI infrastructure and best practices
  • Review code and mentor AI developers on software practices

Required Skills & Experience
Must-Have

  • 7+ years of software engineering experience (Python preferred)
  • Experience with LLMs and AI/ML in production: OpenAI API, HuggingFace, LangChain, or similar
  • Understanding of vector databases (Pinecone, Chroma, Weaviate, FAISS)
  • Cloud infrastructure experience: GCP (Vertex AI preferred) or AWS (SageMaker)
  • API development: FastAPI, REST, async programming
  • CI/CD and DevOps: Docker, Terraform, GitHub Actions
  • Monitoring and observability
  • Problem-solving mindset: comfortable debugging complex distributed systems
  • Operating experience with AI deployment in enterprise environment

Nice-to-Have

  • Experience fine-tuning or training models
  • Familiarity with LangChain, Pydantic AI or similar frameworks
  • Knowledge of prompt engineering and evaluation techniques
  • Experience with real-time inference and streaming responses
  • Background in data engineering or ML engineering
  • Understanding of RAG architectures
  • Contributions to open-source AI/ML projects

Tech Stack
Current Stack:

  • Languages:
    Python (primary), Bash
  • AI/ML:
    OpenAI API, Anthropic, HuggingFace, LangChain, Pydantic AI
  • Vector DBs:
    Pinecone, Chroma, Weaviate, or FAISS
  • Backend:
    FastAPI, SQLAlchemy, Pydantic
  • Cloud:
    GCP (Vertex AI, Cloud Run), Terraform
  • CI/CD:
    GitHub Actions
  • Experiment Tracking:
    MLflow, Weights & Biases, or custom
  • Containers:
    Docker, Kubernetes (optional)

What we offer:
Competitive compensation

  • Stock Options Plan:
    Empowering you to share in our success and growth.
  • Cutting-Edge Tools:
    Access to state-of-the-art tools and collaborative opportunities with leading experts in artificial intelligence, physics, hardware and electronic design automation.
  • Work-Life Balance:
    Flexible work arrangements in one of our offices with potential options for remote work.
  • Professional Growth:
    Opportunities to attend industry conferences, present research findings, and engage with the global AI research community.
  • Impact-Driven Culture:
    Join a passionate team focused on solving some of the most challenging problems at the intersection of AI and hardware.


  • Boston, Massachusetts, United States Red Hat Full time $120,000 - $180,000 per year

    Red Hat's AI Engineering team is seeking a Product Owner to support the PyTorch function within AI Platform Core Components (AIPCC).In this role, you'll serve as the key connection point between our PyTorch engineering team and its stakeholders, including Product Management, Engineering leadership, the upstream PyTorch community and more.You'll manage and...


  • Boston, Massachusetts, United States Manifold AI Full time $25 - $40

    Our CultureAt Manifold, we value intellectual rigor, humility, and mission-driven collaboration. We believe that technology is only as powerful as the people behind it, and we're building a culture that supports growth, inclusion, and curiosity. We work fast, think deeply, and strive to make a lasting impact on patients' lives.About ManifoldAs the amount of...


  • Boston, Massachusetts, United States InterSystems Full time $127,000 - $167,000 per year

    We are seeking anAI Engineerto join our Managed Services team. You will design, integrate, and deploy AI solutions—including agentic applications, LLM inference, similarity search, guardrails, and model evaluation— to improve workflows and scalability, and to optimize the deployment, operation, and upgrade of InterSystems cloud solutions.This role is...


  • Boston, Massachusetts, United States InterSystems Full time $127,000 - $167,000 per year

    We are seeking an AI Engineer to join our Managed Services team. You will design, integrate, and deploy AI solutions—including agentic applications, LLM inference, similarity search, guardrails, and model evaluation— to improve workflows and scalability, and to optimize the deployment, operation, and upgrade of InterSystems cloud solutions.This role is...


  • Boston, Massachusetts, United States Mastercard Full time $138,000 - $265,000

    Our PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...


  • Boston, Massachusetts, United States Smartcat Full time $120,000 - $200,000 per year

    About SmartcatSmartcat is building the future of work, where human expertise meets digital teammates to drive 10x to 1000x productivity gains for the world's leading enterprises.We're on the frontier of an entirely new category: Agentic AI. We enable enterprises to build high-performing hybrid workforces made up of both humans and AI agents. These AI agents...


  • Boston, Massachusetts, United States Shield AI Full time $166,990 - $250,486 per year

    Founded in 2015, Shield AI is a venture-backed deep-tech company with the mission of protecting service members and civilians with intelligent systems. Its products include the V-BAT and X-BAT aircraft, Hivemind Enterprise, and the Hivemind Vision product lines. With nine offices and facilities across the U.S., Europe, the Middle East, and the Asia-Pacific,...


  • Boston, Massachusetts, United States ServiceNow Full time $120,000 - $180,000 per year

    Company Description At ServiceNow, our technology makes the world work for everyone, and our people make it possible. We move fast because the world can't wait, and we innovate in ways no one else can for our customers and communities. By joining ServiceNow, you are part of an ambitious team of change makers who have a restless curiosity and a drive for...


  • Boston, Massachusetts, United States AI Jobs Full time $174,000 - $234,250 per year

    Role : Machine Learning EngineerRole Overview :Join the team building large-scale infrastructure empowering ML engineers to develop next-generation self-driving technology. As a Principal Engineer, you will define the technical vision, lead complex architecture, and design systems that process massive data, run advanced simulations, and train cutting-edge AI...


  • Boston, Massachusetts, United States Scion Staffing Full time $163,040 - $244,560 per year

    About The OpportunityScion Technology has been engaged to lead a search for aPrincipal AI Engineeron behalf of our client — a pioneering creative technology studio launching an innovative AI platform for interactive experiences.This is an exciting opportunity to serve as the founding technical hire for a new AI initiative, working directly with the CEO to...