Senior Platform Engineer, AI Evaluation Research Systems

5 days ago


Seattle, Washington, United States Apple Full time

Do you want to help shape the future of AI at Apple? Our team, part of Apple Services Engineering's (ASE) Human Centered AI Research organization, pioneers new methods and tools for AI evaluation. We are seeking a Senior or Staff Platform Engineer to develop systems that will transform evaluation research and methods into compounding advantages for teams across our division. You will help build tools that accelerate our team's research and empower the entire organization to build and evaluate AI more effectively. As a technical leader, you'll help set engineering standards for evaluation systems across ASE and mentor researchers and engineers on best practices for AI tooling.

Description

In this role, you will architect and build core evaluation frameworks, systems, and tools that empower applied research and accelerate AI development. Your work will help researchers and developers close the gap between prototypes and production-grade capabilities, giving them a stable foundation to build on and enabling them iterate more quickly and produce more durable work. The tools you will develop will serve as the primary delivery mechanism for evaluation research innovations, distributing them to partner teams across the organization. This will involve deep collaboration with researchers, developers, engineers, and other users to understand their needs and ensure that solutions are easy to adopt and scalable. This is an opportunity to help build foundational systems and tools that translate cutting-edge methods and proven best practices into reliable, production-ready tools for development teams across ASE and partner organizations.

Preferred Qualifications

Practical experience building or evaluating applications powered by large language models (e.g., using agentic frameworks, rag, or similar techniques)

Experience designing platforms or frameworks that shorten the path from prototype to production

Familiarity with distributed data processing (e.g., Spark, Dask, PySpark) and large-scale compute for AI workloads

Experience with inference optimization frameworks (e.g. vLLM, TensorRT-LLM)

Experience with LLM orchestration frameworks (e.g. LangChain, LangGraph)

Contributions to or deep engagement with open-source developer tooling or ML frameworks

Experience mentoring other engineers and raising a team's engineering standards

Minimum Qualifications

Demonstrated mastery in engineering robust, maintainable, and operable software systems

Deep expertise in Python, with demonstrated excellence in designing high-quality, extensible APIs and SDKs

Deep experience (5+ years) in platform engineering or adjacent roles, with a proven track record of designing, building, and scaling internal or developer-facing platforms

Demonstrated track record of driving adoption of developer tools, with experience gathering user feedback and iterating on developer experience

Experience with ML development lifecycle and ML platform development, including understanding of model training, evaluation, and deployment workflows

Understanding of standard AI stack components such as retrieval systems (vector databases, hybrid search, etc.), model serving platforms, and LLM application frameworks

Proven ability to collaborate with cross-functional teams including researchers, engineers, product owners, and leadership

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .



  • Seattle, Washington, United States Apple Full time

    Join Apple Services Engineering to build the next generation of AI evaluation systems. We are seeking machine learning platform engineers at multiple levels (Mid-Level to Principal) to architect and build high-availability services and internal tools that enable self-service evaluation at scale. You will partner with researchers to operationalize their...


  • Seattle, Washington, United States Apple Full time

    Join Apple Services Engineering to build the next generation of AI evaluation systems. We are seeking a hands-on Engineering Manager to architect high-availability services and internal tools that enable self-service evaluation at scale. You will partner with researchers to operationalize their innovations, transforming complex workflows into intuitive,...


  • Seattle, Washington, United States Scale AI Full time

    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and operators for fast and automatic training and evaluation of LLM's, as well as evaluation of data quality.Scale is uniquely positioned at the heart of the field of AI...


  • Seattle, Washington, United States Scale AI Full time

    Scale is the leading data and evaluation partner for frontier AI companies, playing an integral role in advancing the science of evaluating and characterizing large language models (LLMs). Our research focuses on tackling the hardest problems in scalable oversight and the evaluation of advanced AI capabilities. We collaborate broadly across industry and...


  • Seattle, Washington, United States Apple Full time

    Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other's ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the...


  • Seattle, Washington, United States Pryon Full time

    About Pryon:We're a team of AI, technology, and language experts whose DNA lives in Alexa, Siri, Watson, and virtually every human language technology product on the market. Now we're building an industry-leading knowledge management and Retrieval-Augmented Generation (RAG) platform. Our proprietary, cutting-edge natural language processing capabilities...


  • Seattle, Washington, United States Cobot Full time

    Are you passionate about advancing the state of artificial intelligence and machine learning? Our rapidly growing startup is seeking an AI Research Engineer to join our Foundational Models AI team. This role is ideal for researchers and builders who thrive at the intersection of machine learning research and software engineering—those who can turn...


  • Seattle, Washington, United States HackerOne Full time

    HackerOne is a global leader in Continuous Threat Exposure Management (CTEM). The HackerOne Platform unites agentic AI solutions with the ingenuity of the world's largest community of security researchers to continuously discover, validate, prioritize, and remediate exposures across code, cloud, and AI systems. Through solutions like bug bounty,...

  • AI Research Engineer

    2 weeks ago


    Seattle, Washington, United States Cobot Full time

    Are you passionate about advancing the state of artificial intelligence and machine learning? Our rapidly growing startup is seeking an AI Research Engineer to join our Foundational Models AI team. This role is ideal for researchers and builders who thrive at the intersection of machine learning research and software engineering—those who can turn...


  • Seattle, Washington, United States Meta Full time

    Reality Labs at Meta is building products that make it easier for people to connect with the ones they love most, enjoy top-notch, wire-free VR, and push the future of computing platforms. We are a team of world-class experts developing and shipping products at the intersection of hardware, software and content.We are seeking a Research Engineer to join our...