Reinforcement Learning Engineer

7 days ago


Redwood City, California, United States Hammerhead AI Full time
About Hammerhead

We're unleashing AI with intelligent orchestration while addressing one of the most pressing bottlenecks for AI access to Power. Our cutting-edge platform optimizes data center power infrastructure to maximize AI token generation within existing electrical limits, without requiring new power plants or grid expansions. Our team has optimized over 8 gigawatts of mission-critical power globally, and we're addressing a $64 billion-per-year market opportunity while dramatically reducing the environmental footprint of AI infrastructure.

At Hammerhead, you will:

Work at the intersection of AI, energy, and compute creating the next generation AI infrastructure

Collaborate with colleagues that are experts in modern RL and AI, IoT and IIoT software, and infrastructure technologies

Contribute to building a more efficient and sustainable future for AI compute.

Join a company at the cutting edge of modern data center design and operation

Receive competitive compensation, equity, and benefits in a high-growth, mission-driven environment.

Learn from an experienced team that has built and sold startups before

Learn more about Hammerhead
  • These AutoGrid alums want to change how data centers use power
  • How Hammerhead Wants to Rewrite the Economics of AI
  • News & Blogs
Role Description

As a Reinforcement Learning Engineer, you will be the architect of the core intelligence for Hammerhead's ORCA platform. Reporting to the Head of AI / Reinforcement Learning Engineering, you will design, train, and deploy the Orchestrated RL Control Agents that form the brain of our system, making real-time decisions to optimize power and compute resources across physical data centers. This role is for a hands-on expert who is passionate about applying cutting-edge RL research to complex, real-world industrial systems. You will be instrumental in developing the models that control physical assets like cooling systems and power distribution units to unlock massive efficiency gains in AI workloads.

Key Responsibilities
  • RL Model Development: Design and implement advanced reinforcement learning algorithms (e.g., multi-agent RL, model-based RL, deep RL) for real-time control of data center infrastructure.
  • Simulation and Training: Build and train RL agents that can generalize to real-world, physical systems.
  • From Lab to Production: Lead the transition of RL models from research and simulation to live deployment within the ORCA platform, ensuring stability and performance on mission-critical hardware.
  • System Optimization: Analyze agent performance to continuously improve control strategies for tasks like peak shaving, workload shifting, and thermal management.
  • Cross-Functional Collaboration: Partner with platform engineers to define the APIs, data telemetry, and infrastructure needed to support and scale our RL agents across a global portfolio of data centers.
Qualifications
  • RL Expertise: Proven experience developing and implementing reinforcement learning algorithms, demonstrated through publications in top conferences (e.g., NeurIPS, ICML, ICLR), open-source contributions, or shipped products.
  • Industry Experience: 3+ years of experience applying RL to real-world problems, preferably in industrial automation, robotics, autonomous vehicles, energy systems, or other physical systems. Experience from a leading industrial or academic RL lab is highly desirable.
  • Technical Skills: Deep proficiency in Python and modern ML frameworks such as PyTorch, Jax, or TensorFlow. Experience with simulation platforms and RL libraries (e.g., Ray RLlib, Isaac Gym) is a plus.
  • Educational Background: MS or PhD in Computer Science, Robotics, Operations Research, or a related field with a focus on machine learning or control theory.
  • Problem Solver: You possess a strong theoretical background but are driven by practical application, with an ability to bridge the gap between RL theory and the constraints of physical, real-world systems.
What We Offer
  • Competitive salary, bonus, 401(k) plan and equity in a rapidly growing startup
  • Comprehensive health, dental, and vision coverage
  • Opportunity to apply the latest AI technologies working with an experienced team

Join our team to shape the foundation of tomorrow's AI infrastructure



  • Redwood City, California, United States Poshmark Full time

    About PoshmarkPoshmark is a leading fashion resale marketplace powered by a vibrant, highly engaged community of buyers and sellers and real-time social experiences. Designed to make online selling fun, more social and easier than ever, Poshmark empowers its sellers to turn their closet into a thriving business and share their style with the world. Since its...


  • Redwood City, California, United States Moloco Full time

    About Moloco:Moloco builds some of the most powerful AI advertising solutions in the world. Our name—short for "machine learning company"—reflects our core mission: democratizing access to the advanced AI that has historically been reserved for tech giants. Led by machine learning pioneers who built some of the most successful ad systems at Google,...

  • AI/ML Engineer

    1 week ago


    Redwood City, California, United States AHUM AI Full time

    Job Title: AI/ML EngineerLocation: Redwood City, CAJob Type: Full-TimeWho we areWe're a stealth-mode startup reimagining how modern enterprises secure and manage their workspace. We're building anAI-native platformthat delivers autonomous control, deep observability, and resilient device operations. Backed by top-tier investors and led by repeat founders,...


  • Redwood City, California, United States Equinix Full time $163,000 - $245,000

    Who are we? Equinix is the world's digital infrastructure company, shortening the path to connectivity to enable the innovations that enrich our work, life and planet. A place where tech thinkers and future builders turn bold ideas into breakthrough experiences, we welcome your unique perspective.Help us challenge assumptions, uncover bias, and remove...

  • Platform Engineer

    7 days ago


    Redwood City, California, United States Hammerhead AI Full time

    About HammerheadWe're unleashing AI with intelligent orchestration while addressing one of the most pressing bottlenecks for AI access to Power. Our cutting-edge platform optimizes data center power infrastructure to maximize AI token generation within existing electrical limits, without requiring new power plants or grid expansions. Our team has optimized...

  • Simulation Engineer

    1 week ago


    Redwood City, California, United States Hammerhead AI Full time

    About HammerheadWe're unleashing AI with intelligent orchestration while addressing one of the most pressing bottlenecks for AI access to Power. Our cutting-edge platform optimizes data center power infrastructure to maximize AI token generation within existing electrical limits, without requiring new power plants or grid expansions. Our team has optimized...


  • Redwood City, California, United States Learning Commons Full time

    Learning Commons is Mark Zuckerberg and Priscilla Chan's education initiative, which aims to scale proven teaching and learning practices to benefit every learner. Learning Commons became the name of our education efforts in 2025 to build on the Chan Zuckerberg Initiative's work over the past decade to advance learning science and help translate that...


  • Redwood City, California, United States Moloco Full time

    About Moloco:Moloco builds some of the most powerful AI advertising solutions in the world. Our name—short for "machine learning company"—reflects our core mission: democratizing access to the advanced AI that has historically been reserved for tech giants. Led by machine learning pioneers who built some of the most successful ad systems at Google,...

  • Solutions Engineer

    1 week ago


    Redwood City, California, United States DatologyAI Full time $230,000 - $300,000

    About the CompanyModels are what they eat. But a large portion of training compute is wasted training on data that are already learned, irrelevant, or even harmful, leading to worse models that cost more to train and deploy.At DatologyAI, we've built a state of the art data curation suite to automatically curate and optimize petabytes of data to create the...


  • Redwood City, California, United States Learning Commons Full time $190,000 - $261,800

    Learning Commons is Mark Zuckerberg and Priscilla Chan's education initiative, which aims to scale proven teaching and learning practices to benefit every learner. Learning Commons became the name of our education efforts in 2025 to build on the Chan Zuckerberg Initiative's work over the past decade to advance learning science and help translate that...