Platform ML Engineering Manager, Model Graph
3 weeks ago
The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference.
Our priorities are to maximize training throughput (how quickly we can train a new model) and researcher throughput (how quickly we can develop new models) with the goal of accelerating progress towards AGI. We frequently collaborate with other teams to speed up the development of new capabilities.
About the Role
We are looking for an experienced engineering manager to help lead critical work on model definition and efficient distributed execution within our shared internal training stack. Our internal training stack is used by Research for large scale and small scale runs.
In this role, you will:
- Reduce the time it takes to try out new architecture ideas for training new models and increase the robustness of model code.
- Collaborate closely with researchers and other systems engineers to maximize the benefits of our shared internal training stack.
- Make it feasible to get SOTA throughput for our most important research models.
- Hire world-class AI systems engineers in one of the most competitive hiring markets.
- Coordinate the training needs of OpenAI's research teams.
- Create a diverse, equitable, and inclusive culture that makes all feel welcome while enabling radical candor and the challenging of group think.
- Have 3+ years of experience in engineering management and 7+ years as an IC working with high scale distributed systems and ML systems.
- Have experience with ML systems, particularly high scale distributed training or inference for modern LLMs.
- Have familiarity with the latest AI research and working knowledge of how these systems are efficiently implemented.
- Care deeply about diversity, equity, and inclusion, and have a track record of building inclusive teams.
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.
OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement
For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
OpenAI Global Applicant Privacy Policy
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
-
Platform ML Engineering Manager, Training
1 month ago
San Francisco, United States OpenAI Full timeAbout the Team The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference. Our priorities are to maximize training throughput (how quickly we can train a new model)...
-
Platform ML Engineering Manager, Training
7 days ago
San Francisco, United States OpenAI Full timeAbout the Team The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference. Our priorities are to maximize training throughput (how quickly we can train a new model)...
-
Software Engineer, ML Infrastructure
4 weeks ago
San Francisco, United States Scale AI, Inc. Full timeAs a software engineer on the ML Infrastructure team, you will work on developing the platform for orchestrating post-training and model evaluation jobs. At Scale, we are constantly developing new data sources and running experiments to understand their impact on ML models. To support this effort, we are looking for engineers who are comfortable navigating...
-
Machine Learning Architect
14 hours ago
San Francisco, United States Salesforce.Com Inc Full timeTo get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category: Software Engineering About Salesforce We're Salesforce, the Customer Company, inspiring the future of business with AI + Data + CRM. Leading with our core values, we help companies across every...
-
Machine Learning Architect
3 weeks ago
San Francisco, United States Salesforce, Inc. Full timeMachine Learning Architect - Search & Knowledge GraphsAbout SalesforceWe’re Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driving your...
-
ML Platform Engineer
1 month ago
San Francisco, United States Abridge Al, Inc Full timeAbridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most-their patients. Our enterprise-grade technology transforms patient-clinician conversations into...
-
Machine Learning Architect
3 weeks ago
San Francisco, United States salesforce.com, inc. Full timeTo get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job Category: Software EngineeringAbout SalesforceWe're Salesforce, the Customer Company, inspiring the future of business with AI + Data + CRM. Leading with our core values, we help companies across every...
-
ML Platform Engineer
7 days ago
San Francisco, United States Abridge Al, Inc Full timeAbridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most-their patients. Our enterprise-grade technology transforms patient-clinician conversations into...
-
Senior Manager, AI/ML Platform
4 weeks ago
San Jose, United States PayPal Full timeThe CompanyPayPal has been revolutionizing commerce globally for more than 25 years. Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empowers consumers and businesses in approximately 200 markets to join and thrive in the global economy.We operate a global, two-sided network at scale that...
-
Principal Product Manager, ML Platform
3 weeks ago
San Francisco, United States The Product Folks Full timeAdobe is the global leader in digital media and digital marketing solutions. Our creative, marketing and document solutions empower everyone – from emerging artists to global brands – to bring digital creations to life and deliver immersive, compelling experiences to the right person at the right moment for the best results. In short, Adobe is...
-
San Francisco, California, United States Capital One Full timeCapital One is seeking an experienced engineering leader to lead our AI and ML platform. This role will involve managing and growing a team of software engineers, defining strategy and roadmap, and driving delivery of converged interaction patterns for our enterprise AI and ML platforms. The ideal candidate will have strong technical acumen, excellent...
-
San Francisco, California, United States Oleria Corp. Full timeJob SummaryWe are seeking an exceptional Principal AI/ML Engineer to join our creative team at Oleria Corp. as part of our mission to revolutionize access control solutions for enterprise cloud applications. As a key member of our engineering team, you will play a crucial role in building a data-driven, autonomous identity security platform that leverages AI...
-
Founding ML Engineer
17 hours ago
San Francisco, United States HealthLeap Inc. Full timeMake a difference in the future of healthcare Join an early stage team working to better diagnose and treat patients Location Type Full time Department HealthLeap is pioneering AI-driven healthcare solutions, starting with malnutrition - one of medicine's most under diagnosed conditions.We are developing tools to identify, treat, and prevent malnutrition,...
-
Senior Data Science Lead
5 days ago
San Francisco, California, United States Programmers Full timeJob OverviewWe are seeking an experienced Senior Data Science Lead to oversee the development of our AI/ML platform.Key Responsibilities:Design and develop a robust AI/ML platform that prioritizes accuracy, security, and efficiencyLead agile workstreams from requirement gathering to creating actionable task plans for the teamProvide coaching and mentorship...
-
Principal Applied AI/ML Engineer
7 days ago
San Francisco, United States Oleria Security Full timeCompany Overview We're seeking an exceptional Principal AI/ML Engineer to join our creative team. Oleria is an enterprise cybersecurity startup founded by notable industry senior leaders Jim Alkove and Jagadeesh Kunda, with deep security, data, and SaaS experience building and securing some of the world's largest platforms and products used by billions of...
-
Data Scientist
1 week ago
San Francisco, United States NovumTech Partners Full timeResponsibilitiesWorking as part of our team researching and develop machine learning modelsArchitecting ML training, validation and inference pipelinesDesigning and implementing approaches to maximizing the potential of data in AI modelsDefining creative solutions to deep problems, and communicating your ideas to the teamRequirementsPhD or masters in a...
-
Machine Learning Model Architect
5 days ago
San Francisco, California, United States NovumTech Partners Full timeJob SummaryWe are seeking a highly skilled Machine Learning Model Architect to join our team at NovumTech Partners. As a key member of our research and development team, you will be responsible for designing and implementing AI models that drive business growth and innovation.About the RoleThe successful candidate will have a strong background in machine...
-
Principal Applied AI/ML Engineer
4 weeks ago
San Francisco, United States Oleria Security Full timeCompany Overview We're seeking an exceptional Principal AI/ML Engineer to join our creative team. Oleria is an enterprise cybersecurity startup founded by notable industry senior leaders Jim Alkove and Jagadeesh Kunda, with deep security, data, and SaaS experience building and securing some of the world's largest platforms and products used by billions of...
-
Principal Applied AI/ML Engineer
4 weeks ago
San Francisco, United States Oleria Corp. Full timeCompany OverviewWe’re seeking an exceptional Principal AI/ML Engineer to join our creative team. Oleria is an enterprise cybersecurity startup founded by notable industry senior leaders Jim Alkove and Jagadeesh Kunda, with deep security, data, and SaaS experience building and securing some of the world’s largest platforms and products used by billions of...
-
ML Operations Engineer
7 days ago
San Francisco, United States RemoteWorker CA Full timeCompany Overview: Welcome to the forefront of machine learning operations! At our company, we're driving the next wave of AI revolution through cutting-edge ML operations technologies. Our mission is to develop scalable and reliable ML systems that empower businesses and revolutionize industries. Join us and be part of a dynamic team committed to pushing the...