Founding Engineer, ML Performance

2 weeks ago


Los Angeles, California, United States Isotron AI Full time $175,000 - $250,000 per year
About the Role

We're an early-stage stealth startup building a new kind of platform for generative media. Our mission is to enable the future of real-time generative applications: we're building the foundational tools and infrastructure that make entirely new categories of generative experiences and applications finally possible.

We're a small, focused team of ex-YC and unicorn founders and senior engineers with deep experience across 3D, generative video, developer platforms, and creative tools. We're backed by top-tier investors and top angels, and we're building a new technical foundation purpose-built for the next era of generative media.

We're operating at the edge of what's technically possible: high-performance inference and real-time orchestration of multimodal models. As one of our founding engineers, you'll play a key role in architecting the core platform, shaping system design decisions, and owning critical infrastructure from day one.

If you're excited about architecting and building high-performance infrastructure that empowers the next generation of developers and unlocks entirely new products categories, we'd love to talk.

About the Role

We're looking for a Founding Engineer, ML Performance & Systems with deep expertise in high-performance ML infrastructure. This is a highly technical, high-impact role focused on squeezing every drop of performance from real-time generative media models.

You'll work across the model-serving stack, designing novel architectures, optimizing inference performance, and shaping Reactor's competitive edge in ultra-low-latency, high-throughput environments

What You'll Do
  • Drive our frontier position on real-time model performance for diffusion models
  • Design and implement a high-performance in-house inference engine
  • Focus on maximizing throughput and minimizing latency and resource usage
  • Develop performance monitoring and profiling tools to identify bottlenecks and optimization opportunities

Requirements

About You
  • Strong foundation in systems programming, with a track record of identifying and resolving bottlenecks
  • Deep expertise in the ML infrastructure stack:
    • PyTorch, TensorRT, TransformerEngine, Nsight
    • Model compilation, quantization, and advanced serving architectures
  • Working knowledge of GPU hardware (NVIDIA) and the ability to dive deep into the stack as needed (e.g., writing custom GEMM kernels with CUTLASS)
  • Proficient in CUDA or willing to learn, with comparable experience in low-level accelerator programming
  • Excited by the frontier of multi-dimensional model parallelism (e.g., combining tensor, context, and sequence parallelism)
  • Familiarity with internals of cutting-edge techniques such as Ring Attention, FA3, and FusedMLP implementations
Minimum Qualifications
  • Expertise in systems programming (C++, CUDA)
  • Experience optimizing ML inference on GPUs
  • Proficient with PyTorch and tools like TensorRT
  • Deep understanding of NVIDIA GPU architecture
  • Familiar with model serving, compilation, and quantization

Benefits

  • Competitive SF salary and foundational team equity


  • Los Angeles, California, United States Cooperidge Consulting Firm Full time

    Cooperidge Consulting Firm is seeking an AI/ML Engineer for a top Signal Processing/Defense client.This critical role designs, develops, and implements Artificial Intelligence/Machine Learning solutions for complex Signal Intelligence (SIGINT) processing and decision-making problems. The Engineer focuses on building models for event characterization, anomaly...

  • Founding Engineer

    2 days ago


    Los Angeles, California, United States AURORA Full time

    Founding Engineer (Backend)New York, USASalary: $325,000 OTE ($275,000 Base + $50k Uncapped Performance-Based Bonus). Cash comp flexible for the right fit.Stock: Founding stock options package at our ground-floor valuation.USA Work Authorization RequiredIntra-USA Relocation Package AvailableAbout AuroraAurora is a multi–VC-backed startup building the...

  • AI/ML Tech Lead

    2 days ago


    Los Angeles, California, United States DMV IT Service Full time

    Job Title: AI/ML Tech Lead / ArchitectLocation: Minneapolis, MNEmployment Type: ContractAbout UsDMV IT Service LLC, founded in 2020, is a trusted IT consulting firm specializing in IT infrastructure optimization, cybersecurity, networking, and staffing solutions. We partner with clients to achieve technology goals through expert guidance, workforce...

  • Lead AI-ML Engineer

    4 days ago


    Los Angeles, California, United States Robotics Technologies Full time

    Job ID:J50582- Job Title:Lead AI-ML Engineer- Location:Westerville, CA- Duration:12 Months + Extension- Hourly Rate:Depending on Experience (DOE)- Work Authorization:US Citizen, Green Card, OPT-EAD, CPT, H-1B, H4-EAD, L2-EAD, GC-EAD- Client:To Be Discussed Later- Employment Type:W-2, 1099, C2CKey Responsibilities:Collaborate with stakeholders to understand...

  • AI ML Engineer

    1 week ago


    Los Angeles, California, United States Iceberg Full time

    AI/ML Engineer | Remote (PST)Want to design, train, and deploy ML models at scale? This role's fully remote, but we're looking for someone based in PST hours.Day to day:Build pipelinesOptimize modelsWork with data scientists + product teamsKeep up with the latest AI researchIf you're solid in Python, TensorFlow/PyTorch, and cloud (AWS/GCP/Azure), this is a...

  • Founding Engineer

    3 days ago


    Los Angeles, California, United States Weekday AI Full time

    This role is for one of the Weekday's clientsSalary range: $170K - $220KMin Experience: 3 yearsJobType: full-timeIn this role, you will collaborate directly with the founding team, contribute to strategic decisions, and have end-to-end ownership of technical execution. You will help us solve complex challenges, ship features quickly, and ensure a seamless...


  • Los Angeles, California, United States Serve Robotics Full time $155,000 - $190,000 per year

    At Serve Robotics, we're reimagining how things move in cities. Our personable sidewalk robot is our vision for the future. It's designed to take deliveries away from congested streets, make deliveries available to more people, and benefit local businesses.The Serve fleet has been delighting merchants, customers, and pedestrians along the way in Los Angeles,...

  • AI/ML Data Engineer

    2 weeks ago


    Los Angeles, California, United States Amerit Consulting Full time $120,000 - $180,000 per year

    Our client, a Medical Center facility under the aegis of a California Public Ivy university and one of largest health delivery systems in California, seeks an accomplishedAI/ML Data Engineer__________________________________________________NOTE- THIS IS 100% REMOTE ROLE & ONLY W2 CANDIDATES/NO C2C/1099*** Candidate must be authorized to work in USA without...

  • ML Data Engineer

    2 weeks ago


    Los Angeles, California, United States Glow Beauty on Demand Full time $157,945 - $177,385 per year

    Stanford University is seeking a Big Data Architect 1 for a 1 year fixed term (possibility of renewal) to design and develop applications, test and build automation tools and support the development of Big Data architecture and analytical solutions.About Us:The Department of Biomedical Data Science merges the disciplines of biomedical informatics,...

  • Founding Engineer

    4 days ago


    Los Angeles, California, United States TechMind RPO Full time

    The Role:We're hiring a Founding Engineer to architect, build, and ship the core of Pairtu's platform. This is not a "manage a dev team" position — it's a hands-on role where you'll be responsible for building production systems, integrating AI into real workflows, and shaping the technical direction of the company from the ground up.You'll join a small,...