Senior ML Infrastructure Engineer

18 hours ago


San Francisco, CA, United States Gridware Full time
About Gridware

Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid. We pioneered a groundbreaking new class of grid management called active grid response (AGR), focused on monitoring the electrical, physical, and environmental aspects of the grid that affect reliability and safety. Gridware's advanced Active Grid Response platform uses high-precision sensors to detect potential issues early, enabling proactive maintenance and fault mitigation. This comprehensive approach helps improve safety, reduce outages, and ensure the grid operates efficiently. The company is backed by climate-tech and Silicon Valley investors. For more information, please visit www.Gridware.io.

Role Description

As a Senior ML Infrastructure Engineer, you will work directly in the Automation org with the core ML, Ops, and Analytics teams to help improve and build out the infrastructure around model deployment and monitoring. This role is essential to helping scale out the amount of time saving's Gridware brings to customers.

Responsibilities
    • Design, build, and maintain the infrastructure, tooling, and workflows that enable reliable, scalable deployment of ML models to production.
    • Develop monitoring and observability systems to track model performance, data drift, data quality, and overall system health.
    • Create and maintain end-to-end testing frameworks and simulation environments to validate models and pipelines prior to deployment.
    • Work closely with Data Engineering and Platform Engineering teams to ensure ML systems integrate cleanly with broader Gridware infrastructure and operational standards.
    • Improve CI/CD pipelines for ML workloads, ensuring reproducibility, safe rollout, and automated rollback strategies.
Required Skills
    • 5+ years of experience building production ML infrastructure
    • Strong software engineering skills and proficiency in Python
    • Experience with cloud platforms (AWS) and container orchestration (Kubernetes)
    • Familiarity with feature stores, model registries, or centralized metadata systems (i.e. MLFlow)


$190,000 - $210,000 a year

This describes the ideal candidate; many of us have picked up this expertise along the way. Even if you meet only part of this list, we encourage you to apply

Benefits

Health, Dental & Vision (Gold and Platinum with some providers plans fully covered)

Paid parental leave

Alternating day off (every other Monday)

"Off the Grid", a two week per year paid break for all employees.

Commuter allowance

Company-paid training

  • San Francisco, CA, United States Plaid Full time

    Senior Software Engineer - ML Infrastructure Plaid is evolving into an AI-first company, where data and machine learning are the key enablers of smarter, more secure insight products built on top of Plaid's vast financial data network. The Machine Learning Infrastructure team sits at the center of this transformation. We build the platforms that enable model...


  • San Francisco, CA, United States Plaid Full time

    Plaid is evolving into an AI-first company, where data and machine learning are the key enablers of smarter, more secure insight products built on top of Plaid's vast financial data network. The Machine Learning Infrastructure team sits at the center of this transformation. We build the platforms that enable model developers to experiment, train, deploy, and...


  • San Francisco, CA, United States Plaid Full time

    Plaid is evolving into an AI-first company, where data and machine learning are the key enablers of smarter, more secure insight products built on top of Plaid's vast financial data network. The Machine Learning Infrastructure team sits at the center of this transformation. We build the platforms that enable model developers to experiment, train, deploy, and...


  • San Francisco, CA, United States Plaid Full time

    Plaid is evolving into an AI-first company, where data and machine learning are the key enablers of smarter, more secure insight products built on top of Plaid's vast financial data network. The Machine Learning Infrastructure team sits at the center of this transformation. We build the platforms that enable model developers to experiment, train, deploy, and...


  • San Francisco, CA, United States LMArena Full time

    Senior Software Engineer, ML Infrastructure LMArena is seeking a Senior Software Engineer (Infrastructure) to lead the design and development of scalable, high-performance real-time data and API infrastructure. In this role, you'll architect systems that capture and process large volumes of serving requests in real time, powering the insights that help...


  • San Francisco, CA, United States LMArena Full time

    Senior Software Engineer, ML Infrastructure LMArena is seeking a Senior Software Engineer (Infrastructure) to lead the design and development of scalable, high-performance real-time data and API infrastructure. In this role, you'll architect systems that capture and process large volumes of serving requests in real time, powering the insights that help...


  • San Francisco, CA, United States LMArena Full time

    Senior Software Engineer, ML Infrastructure LMArena is seeking a Senior Software Engineer (Infrastructure) to lead the design and development of scalable, high-performance real-time data and API infrastructure. In this role, you'll architect systems that capture and process large volumes of serving requests in real time, powering the insights that help...


  • San Francisco, CA, United States LMArena Full time

    Senior Software Engineer, ML Infrastructure LMArena is seeking a Senior Software Engineer (Infrastructure) to lead the design and development of scalable, high-performance real-time data and API infrastructure. In this role, you'll architect systems that capture and process large volumes of serving requests in real time, powering the insights that help...


  • San Francisco, CA, United States LMArena Full time

    Senior Software Engineer, ML Infrastructure LMArena is seeking a Senior Software Engineer (Infrastructure) to lead the design and development of scalable, high-performance real-time data and API infrastructure. In this role, you'll architect systems that capture and process large volumes of serving requests in real time, powering the insights that help...


  • San Francisco, CA, United States Top Engineer Full time

    TOP ENGINEER JOB POST!!! Confidential Search for International Employer Industry: Social Commerce / AI Technology Degree: BS in Computer Science or Mathematics from Top 40 University Experience: 4-8 years in Production ML Systems AI-POWERED SOCIAL COMMERCE REVOLUTION Role: Senior Machine Learning Engineer - Multimodal AI Join a leading partner in social...