AI Platform Engineer
1 day ago
Position Overview
As an AI Platform Engineer, you are the bridge between AI research and production software. You will:
- Build and maintain AI infrastructure: model serving, vector databases, embedding pipelines
- Enable AI developers to deploy their work reproducibly and safely
- Design APIs for AI inference, prompt management, and evaluation
- Implement MLOps pipelines: versioning, monitoring, logging, experimentation tracking
- Optimize performance: latency, cost, throughput, reliability
- Collaborate with backend engineers to integrate AI capabilities into the product
Key Responsibilities
AI Infrastructure
Deploy and serve LLMs (OpenAI, Anthropic, HuggingFace, fine-tuned models)
- Optimize inference latency and costs
Implement caching, rate limiting, and retry strategies
MLOps & Pipelines
Version models, prompts, datasets, and evaluation results
- Implement experiment tracking (Weights & Biases)
- Build CI/CD pipelines for model deployment
- Monitor model performance and drift
Set up logging and observability for AI services
API Development
Design and implement APIs (FastAPI)
- Create endpoints for prompt testing, model selection, and evaluation
- Integrate AI services with backend application
Ensure API reliability, security, and performance
Collaboration & Enablement
Work with AI Developers to productionize their experiments regarding improving user workflows
- Define workflows: notebook/test repository → PR → staging → production
- Document AI infrastructure and best practices
- Review code and mentor AI developers on software practices
Required Skills & Experience
Must-Have
- 7+ years of software engineering experience (Python preferred)
- Experience with LLMs and AI/ML in production: OpenAI API, HuggingFace, LangChain, or similar
- Understanding of vector databases (Pinecone, Chroma, Weaviate, FAISS)
- Cloud infrastructure experience: GCP (Vertex AI preferred) or AWS (SageMaker)
- API development: FastAPI, REST, async programming
- CI/CD and DevOps: Docker, Terraform, GitHub Actions
- Monitoring and observability
- Problem-solving mindset: comfortable debugging complex distributed systems
- Operating experience with AI deployment in enterprise environment
Nice-to-Have
- Experience fine-tuning or training models
- Familiarity with LangChain, Pydantic AI or similar frameworks
- Knowledge of prompt engineering and evaluation techniques
- Experience with real-time inference and streaming responses
- Background in data engineering or ML engineering
- Understanding of RAG architectures
- Contributions to open-source AI/ML projects
Tech Stack
Current Stack:
- Languages:
Python (primary), Bash - AI/ML:
OpenAI API, Anthropic, HuggingFace, LangChain, Pydantic AI - Vector DBs:
Pinecone, Chroma, Weaviate, or FAISS - Backend:
FastAPI, SQLAlchemy, Pydantic - Cloud:
GCP (Vertex AI, Cloud Run), Terraform - CI/CD:
GitHub Actions - Experiment Tracking:
MLflow, Weights & Biases, or custom - Containers:
Docker, Kubernetes (optional)
What we offer:
Competitive compensation
- Stock Options Plan:
Empowering you to share in our success and growth. - Cutting-Edge Tools:
Access to state-of-the-art tools and collaborative opportunities with leading experts in artificial intelligence, physics, hardware and electronic design automation. - Work-Life Balance:
Flexible work arrangements in one of our offices with potential options for remote work. - Professional Growth:
Opportunities to attend industry conferences, present research findings, and engage with the global AI research community. - Impact-Driven Culture:
Join a passionate team focused on solving some of the most challenging problems at the intersection of AI and hardware.
-
AI Platform Core Components
1 day ago
Boston, Massachusetts, United States Red Hat Full time $120,000 - $180,000 per yearRed Hat's AI Engineering team is seeking a Product Owner to support the PyTorch function within AI Platform Core Components (AIPCC).In this role, you'll serve as the key connection point between our PyTorch engineering team and its stakeholders, including Product Management, Engineering leadership, the upstream PyTorch community and more.You'll manage and...
-
A.I. Engineering Intern/Fellow
2 days ago
Boston, Massachusetts, United States Manifold AI Full time $25 - $40Our CultureAt Manifold, we value intellectual rigor, humility, and mission-driven collaboration. We believe that technology is only as powerful as the people behind it, and we're building a culture that supports growth, inclusion, and curiosity. We work fast, think deeply, and strive to make a lasting impact on patients' lives.About ManifoldAs the amount of...
-
AI Application Engineer
6 days ago
Boston, Massachusetts, United States InterSystems Full time $127,000 - $167,000 per yearWe are seeking anAI Engineerto join our Managed Services team. You will design, integrate, and deploy AI solutions—including agentic applications, LLM inference, similarity search, guardrails, and model evaluation— to improve workflows and scalability, and to optimize the deployment, operation, and upgrade of InterSystems cloud solutions.This role is...
-
AI Application Engineer
1 day ago
Boston, Massachusetts, United States InterSystems Full time $127,000 - $167,000 per yearWe are seeking an AI Engineer to join our Managed Services team. You will design, integrate, and deploy AI solutions—including agentic applications, LLM inference, similarity search, guardrails, and model evaluation— to improve workflows and scalability, and to optimize the deployment, operation, and upgrade of InterSystems cloud solutions.This role is...
-
AI Engineering Manager
5 days ago
Boston, Massachusetts, United States Mastercard Full time $138,000 - $265,000Our PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...
-
Customer AI Engineer
5 days ago
Boston, Massachusetts, United States Smartcat Full time $120,000 - $200,000 per yearAbout SmartcatSmartcat is building the future of work, where human expertise meets digital teammates to drive 10x to 1000x productivity gains for the world's leading enterprises.We're on the frontier of an entirely new category: Agentic AI. We enable enterprises to build high-performing hybrid workforces made up of both humans and AI agents. These AI agents...
-
Staff Engineer, FPGA
1 day ago
Boston, Massachusetts, United States Shield AI Full time $166,990 - $250,486 per yearFounded in 2015, Shield AI is a venture-backed deep-tech company with the mission of protecting service members and civilians with intelligent systems. Its products include the V-BAT and X-BAT aircraft, Hivemind Enterprise, and the Hivemind Vision product lines. With nine offices and facilities across the U.S., Europe, the Middle East, and the Asia-Pacific,...
-
Sr. Solution Consultant – Platform, AI
1 day ago
Boston, Massachusetts, United States ServiceNow Full time $120,000 - $180,000 per yearCompany Description At ServiceNow, our technology makes the world work for everyone, and our people make it possible. We move fast because the world can't wait, and we innovate in ways no one else can for our customers and communities. By joining ServiceNow, you are part of an ambitious team of change makers who have a restless curiosity and a drive for...
-
Machine Learning Engineer
2 days ago
Boston, Massachusetts, United States AI Jobs Full time $174,000 - $234,250 per yearRole : Machine Learning EngineerRole Overview :Join the team building large-scale infrastructure empowering ML engineers to develop next-generation self-driving technology. As a Principal Engineer, you will define the technical vision, lead complex architecture, and design systems that process massive data, run advanced simulations, and train cutting-edge AI...
-
Principal AI Engineer
4 days ago
Boston, Massachusetts, United States Scion Staffing Full time $163,040 - $244,560 per yearAbout The OpportunityScion Technology has been engaged to lead a search for aPrincipal AI Engineeron behalf of our client — a pioneering creative technology studio launching an innovative AI platform for interactive experiences.This is an exciting opportunity to serve as the founding technical hire for a new AI initiative, working directly with the CEO to...