ML Ops Engineer
13 hours ago
Luminary Cloud is transforming how the world's most innovative companies generate vast amounts of CFD simulation data for Physics AI, design exploration, and optimization. Backed by Sutter Hill Ventures, our Series B startup is at the forefront of the transition to Physics-based AI through our scalable cloud platform.
Key Duties, Responsibilities, and Deliverables
- Build and maintain robust MLOps infrastructure enabling ML engineers and data scientists to train, track, and deploy models seamlessly without managing low-level Kubernetes infrastructure.
- Design and implement automated training pipelines and experiment tracking systems using modern MLOps frameworks including Kubeflow, MLflow, and Argo Workflows.
- Develop scalable data pipelines for large volumes of unstructured data, with particular focus on 3D geometric data using VTK and physics simulation outputs.
- Deploy machine learning models and set up production inference pipelines with focus on performance and reliability.
- Manage model registries and integrate them with automated workflows for seamless model lifecycle management.
- Implement comprehensive monitoring systems for continuous performance tracking of ML models in production environments.
- Collaborate with cross-functional teams to ensure MLOps infrastructure meets the evolving needs of physics-based AI applications.
- Write production-level code with velocity, maintaining high standards for performance and scalability.
- Optimize cloud infrastructure on Google Cloud Platform, leveraging Docker, Kubernetes, and Vertex AI for efficient resource utilization.
- Bachelor's degree or higher in Computer Science, Data Science, Statistics, Applied Mathematics, or related fields.
- 5+ years of industry experience in machine learning operations (MLOps), including model development, deployment, monitoring, and scaling ML systems in production environments.
- Proficiency in Python with demonstrated experience writing production-level code. Familiarity with BASH and SQL required.
- Solid experience with Google Cloud Platform (GCP), Docker, Kubernetes, and Vertex AI for cloud-based ML infrastructure.
- Hands-on experience with modern MLOps frameworks including Kubeflow, MLflow, and Argo Workflows.
- Strong experience building scalable data pipelines, particularly with large volumes of unstructured data and familiarity with VTK for 3D geometric data.
- Proven ability to independently deploy ML models and set up inference pipelines with monitoring capabilities.
- Experience managing model registries and integrating them with automated workflows.
- Strong problem-solving skills and ability to troubleshoot complex distributed systems.
- Physics Interest: Heavy interest in simulation technology and/or High-Performance Computing (HPC) with some exposure preferred. Understanding of how ML applies to physics simulations and scientific computing is valuable.
- MLOps at Scale: Demonstrated experience building and maintaining MLOps infrastructure at scale, with focus on reliability and performance in production environments.
- Technical Excellence: Hands-on approach with ability to write code with velocity while maintaining high quality standards. Experience with additional programming languages such as Go and C++ is a plus.
- Startup Environment: Curious and quick learner who thrives in a fast-paced environment. Clear communicator with a collaborative approach to working with diverse technical teams.
- In-Office Commitment: Enthusiastic about being in-office 5 days a week, contributing to our hands-on, collaborative engineering culture.
- Infrastructure Focus: Passionate about building systems that enable other engineers and scientists to be more productive with an understanding that great infrastructure should be invisible to end users.
-
ml ops
1 week ago
San Jose, California, United States Raas Infotek Full timePosition: ML OPSLocation: San Jose, CA(Onsite)ContractJob Description:We are looking for a skilled MLOps Engineer to join our team and help us build, deploy, and maintain robust and scalable machine learning systems. You will be responsible for the full lifecycle of our ML pipelines, from data ingestion to model serving. This is a hands-on role where you...
-
Manager, ML Ops
2 weeks ago
San Diego, California, United States ICW Group Full timeAre you looking to make an impactful difference in your work, yourself, and your community? Why settle for just a job when you can land a career? At ICW Group, we are hiring team members who are ready to use their skills, curiosity, and drive to be part of our journey as we strive to transform the insurance carrier space. We're proud to be in business for...
-
Senior ML Infrastructure Engineer
5 days ago
San Francisco, California, United States Gridware Full time $190,000 - $210,000About GridwareGridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid. We pioneered a groundbreaking new class of grid management called active grid response (AGR), focused on monitoring the electrical, physical, and environmental aspects of the grid that affect reliability and safety. Gridware's...
-
Software Engineer
5 days ago
San Diego, California, United States G2 Ops Full time Quick Position FactsLocation: San Diego, CA at our wonderful G2 Ops office and customer site.Work Setting: In person, some remote opportunity, and/or flexible working hours, not a fully remote position.Salary Range: $100,000+ plus comprehensive benefits package.Years of Industry Experience: 3+ years of relevant experience.Security Clearance...
-
Machine Learning Engineer III
6 days ago
San Mateo, California, United States Guidewire Software Full time $128,000 - $192,000SummaryJoin Guidewire's Product Strategy team in San Mateo, where we drive operational excellence and transformative innovation by embedding AI and GenAI across our product portfolio. Our mission is to deliver secure, scalable, and efficient solutions that create measurable value for customers worldwide. You'll collaborate in a culture that values curiosity,...
-
Senior Software Engineer, ML Platform
5 days ago
San Mateo, California, United States PlayStation Global Full timeWhy PlayStation?PlayStation isn't just the Best Place to Play — it's also the Best Place to Work. Today, we're recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStation5, PlayStation4, PlayStationVR, PlayStationPlus, acclaimed PlayStation software titles from PlayStation Studios, and...
-
Senior Software Engineer, ML Platform
3 days ago
San Mateo, California, United States Sony Interactive Entertainment Full timeWhy PlayStation?PlayStation isn't just the Best Place to Play — it's also the Best Place to Work. Today, we're recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStation5, PlayStation4, PlayStationVR, PlayStationPlus, acclaimed PlayStation software titles from PlayStation Studios, and...
-
Staff ML Engineer, ML Foundations
5 days ago
San Francisco, California, United States Stripe Full timeWho we areAbout StripeStripe's mission is to accelerate global economic and technological development. We offer financial infrastructure and a variety of services to serve the needs of a wide range of users, from startups to enterprises, with global scale and industry-leading reliability and product quality. All financial services businesses face a trade-off...
-
AI/ML Engineer/SME
1 week ago
San Diego, California, United States The Marlin Alliance, Inc. Full timeThe Marlin Alliance is seeking a forward-thinkingAI/ML Engineer/SMEin San Diego, CAto provide client support to our Navy client. This is an on-site role and applicants must have the ability to obtain a DoD Secret Clearance.Established in 2002, The Marlin Alliance is seeking to hire highly skilled individuals to support mission critical projects within the...
-
AI/ML engineer
3 days ago
San Francisco, California, United States Orbofi Full timeAbout Orbofi Orbofi is a pioneering AI-generated content engine, built specifically for Web3, games, and online communities. Our mission is to revolutionize the way content is created and consumed by harnessing the power of AI. We enable creators, developers, and businesses to generate high-quality, engaging content that captivates their audiences. We're a...