Lead DevOps Engineer
2 days ago
Lead DevOps Engineer - AI and Machine Learning
Join a dynamic and innovative Series A AI Lab, backed by leading investors and advised by luminaries in generative and interactive media. We are seeking a talented Lead DevOps Engineer.
This lab, supported by top global VCs and experts from OpenAI, DeepMind, Meta, and more, represents a fusion of advanced AI research and startup agility, led by a team with deep expertise in both cutting-edge AI technology and large-scale distributed systems.
As the Lead DevOps Engineer, you will engage directly with the founders to architect, develop, and scale the computing infrastructure that underpins the next generation of AI solutions. Your responsibilities will include designing and optimizing our inference platform, managing GPU-based training clusters, and refining data processing pipelines that fuel real-time creativity and discovery. You will be instrumental in scaling systems for research and production, ensuring low-latency performance, high availability, and efficient utilization of petabyte-scale data and model-serving workloads.
Key Experience Required
- At least 5 years of experience in Software or ML Infrastructure Engineering.
- Extensive experience with distributed systems and GPU orchestration tailored for high-performance Machine Learning workloads.
- Strong proficiency in Python, Go, or similar programming languages, along with a solid understanding of software engineering best practices.
- Hands-on experience with Kubernetes, Docker, and Infrastructure as Code (IaC) using Terraform.
- Demonstrated ability to optimize model serving and data pipelines for improved latency and scalability.
- A passion for building and innovating – thriving in situations with ambiguity, selecting the best tools for the task, and delivering results.
This is an exciting opportunity to be part of a team at the forefront of real-time AI systems. If you're ready to make an impact, please apply as soon as possible for more information.
-
Principal DevOps Engineer
2 days ago
Menlo Park, CA, United States Strativ Group Full timePrincipal DevOps Engineer - AI/MLWe are partnered with a Series A AI Lab (backed by top-tier investors and advised by pioneering figures in generative and interactive media) that is hiring a Principal DevOps Engineer.They're backed by leading global VCs and AI research leaders (from OpenAI, DeepMind, Meta, and others), and guided by renowned figures in...
-
Principal DevOps Engineer
11 hours ago
Menlo Park, CA, United States Strativ Group Full timePrincipal DevOps Engineer - AI/MLWe are partnered with a Series A AI Lab (backed by top-tier investors and advised by pioneering figures in generative and interactive media) that is hiring a Principal DevOps Engineer.They're backed by leading global VCs and AI research leaders (from OpenAI, DeepMind, Meta, and others), and guided by renowned figures in...
-
Principal DevOps Engineer
16 hours ago
Menlo Park, CA, United States Strativ Group Full timePrincipal DevOps Engineer - AI/MLWe are partnered with a Series A AI Lab (backed by top-tier investors and advised by pioneering figures in generative and interactive media) that is hiring a Principal DevOps Engineer.They're backed by leading global VCs and AI research leaders (from OpenAI, DeepMind, Meta, and others), and guided by renowned figures in...
-
Senior DevOps Engineer/SRE
2 weeks ago
Menlo Park, CA, United States Saxon Global Full time7+ years in DevOps/SRE/Platform Engineering Strong experience with Kubernetes (EKS), Helm, networking, and security Expertise in CI/CD pipelines using Harness and Git Proficient in AWS services (EKS, EC2, S3, IAM, RDS, etc.) Strong scripting skills in Python or Golang Experience with Linux systems, performance tuning, and troubleshooting Familiarity with...
-
Senior DevOps Engineer/SRE
2 weeks ago
Menlo Park, CA, United States Saxon Global Full time7+ years in DevOps/SRE/Platform Engineering Strong experience with Kubernetes (EKS), Helm, networking, and security Expertise in CI/CD pipelines using Harness and Git Proficient in AWS services (EKS, EC2, S3, IAM, RDS, etc.) Strong scripting skills in Python or Golang Experience with Linux systems, performance tuning, and troubleshooting Familiarity with...
-
Lead Infra Engineer
1 week ago
Menlo Park, CA, United States Develop Health Full timeDevelop Health is on a mission to use AI to radically accelerate access to life-saving medications. By automating complex, manual healthcare processes-like benefit verification and prior authorization-we've grown from $0 to >$10M in annual recurring revenue in less than 2 years, and currently help more than 400,000 new patients every month. We're partnering...
-
Software Engineer
2 weeks ago
Menlo Park, CA, United States Reconstruct Full timeAbout the job: Software Engineer - SaaS Platform How often do you get the chance to make a global impact developing the latest AI inside of the "built world"? Reconstruct's Visual Command Center (VCC) uses AI and Machine Learning inside of computer vision to track the lifecycle of large capital assets like data centers, airports, hospitals, water treatment...
-
Software Engineer SaaS Platform
1 week ago
Menlo Park, CA, United States Reconstruct Full timeAbout the job: Software Engineer – SaaS Platform How often do you get the chance to make a global impact developing the latest AI inside of the “built world”? Reconstruct's Visual Command Center (VCC) uses AI and Machine Learning inside of computer vision to track the lifecycle of large capital assets like data centers, airports, hospitals, water...
-
Software Engineer
1 hour ago
Menlo Park, CA, United States Reconstruct Full timeAbout the job: Software Engineer - SaaS Platform How often do you get the chance to make a global impact developing the latest AI inside of the "built world"? Reconstruct's Visual Command Center (VCC) uses AI and Machine Learning inside of computer vision to track the lifecycle of large capital assets like data centers, airports, hospitals, water treatment...
-
Lead Mechanical Engineer
2 weeks ago
Menlo Park, CA, United States SLAC National Accelerator Laboratory Full timeLead Mechanical Engineer - LCLS Job ID 6542 Location SLAC - Menlo Park, CA Full-Time Regular SLAC Job Postings Position Overview: The Linac Coherent Light Source (LCLS) is an internationally preeminent science facility, operated by Stanford University at the SLAC National Accelerator Laboratory on behalf of the Department of Energy, Office of Science. This...