Frontiers Infrastructure Engineer
3 weeks ago
The Frontiers Infrastructure team builds the low level framework components to power our ML training systems. We work on building robust, debuggable, high performance libraries to support our distributed training workloads. Our priorities are to maximize the productivity of our researchers and our hardware, with the goal of accelerating progress towards AGI.
About the Role
As an Infrastructure Engineer, you will work to deliver powerful APIs orchestrating thousands of computers moving/persisting vast amounts of data. This requires both providing easy to use, introspectible systems that can promote a fast debugging/development cycle, while also enabling that experience to scale to our newest supercomputers maintaining stability and performance throughout. We’re looking for people who love working with an end to end system distributed across our supercomputers. We want someone excited by the rapid pace of responding to the dynamic and evolving needs of our training systems architectures.
This role is based in San Francisco, CA.
We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.
In this role, you will:
- Work across our Python stack
- Profile and optimize and help design for scale our compute and data capabilities
- Build and maintain tools used by researchers
- Work on deploying our training framework to our latest supercomputers rapidly responding to the changing shapes and needs of the ML systems
You might thrive in this role if you:
- Have worked on large distributed systems
- Love figuring out how systems work and continuously come up with ideas for how to make them faster while minimizing complexity and maintenance burden
- Have strong software engineering skills and are proficient in Python
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.
For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
#J-18808-Ljbffr-
Frontiers Infrastructure Engineer
6 days ago
San Francisco, United States OpenAI Full timeThe Frontiers Infrastructure team builds the low level framework components to power our ML training systems. We work on building robust, debuggable, high performance libraries to support our distributed training workloads. Our priorities are to maximize the productivity of our researchers and our hardware, with the goal of accelerating progress towards AGI....
-
AI Infrastructure Engineer
3 days ago
San Francisco, California, United States Naptha AI Full timeAbout Naptha AIWe are seeking exceptional Software Engineering interns to join Naptha AI and contribute to building the future of AI agent infrastructure.This internship offers hands-on experience working with frontier AI technology, backed by industry veterans and technical leaders through NVIDIA Inception, Google for Startups, and Microsoft for Startups.As...
-
Senior/Lead/Principal Software Engineer
4 weeks ago
San Francisco, United States salesforce Full timeTo get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job Category: Software EngineeringAbout Salesforce:We’re Salesforce, the Customer Company, inspiring the future of business with AI + Data + CRM. Leading with our core values, we help companies across every...
-
AI Data Infrastructure Engineer
6 days ago
San Francisco, California, United States Magic AI Full timeCompany OverviewMagic AI is a cutting-edge technology company dedicated to building safe Artificial General Intelligence (AGI) that accelerates humanity's progress on the world's most important problems.We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than...
-
Senior/Lead/Principal Software Engineer
4 weeks ago
San Francisco, United States salesforce Full timeTo get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job Category: Software EngineeringAbout SalesforceWe’re Salesforce, the Customer Company, inspiring the future of business with AI + Data + CRM. Leading with our core values, we help companies across every...
-
Infrastructure Engineer
3 weeks ago
San Francisco, United States Recruiting From Scratch Full timeWho is Recruiting from Scratch: Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients. Our team is 100% remote and we work with teams across North America, South America, and Europe to help them hire. Why Us We are bringing general intelligence to government contracting. As an early member of the team, you’ll...
-
San Francisco, California, United States E-Frontiers Full timeAbout E-FrontiersWe are a global leader in financial technology managed services and IT infrastructure products, providing cutting-edge solutions to Capital Markets firms worldwide.Our team is committed to empowering our clients with innovative technologies, enabling them to make informed investment decisions and optimize their trading strategies.This role...
-
Enterprise Desktop Support Specialist
7 days ago
San Francisco, California, United States E-Frontiers Full timeAbout the RoleWe are seeking an experienced Enterprise Desktop Support Specialist to join our team at E-Frontiers. In this role, you will provide critical technical support and maintenance for our cutting-edge systems at a financial services client located in San Francisco.You will collaborate closely with both internal teams and the client's technical staff...
-
San Francisco, United States Scale AI, Inc. Full timeAbout ScaleAt Scale AI, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including: generative AI, defense applications, and autonomous vehicles. With our recent Series F round, we're accelerating the abundance of frontier data to pave...
-
San Francisco, United States Scale AI, Inc. Full timeAbout ScaleAt Scale AI, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including: generative AI, defense applications, and autonomous vehicles. With our recent Series F round, we're accelerating the abundance of frontier data to pave...
-
Senior Software Architect, Frontier Data
17 hours ago
San Francisco, California, United States Scale AI, Inc. Full timeAbout Scale AI, Inc.At Scale AI, Inc., our mission is to accelerate the development of AI applications. For years, Scale has been the leading AI data foundry, helping fuel advancements in AI, including generative AI, defense applications, and autonomous vehicles. With recent investments, we're accelerating the abundance of frontier data to pave the road to...
-
Senior DevOps Engineer
1 day ago
San Francisco, California, United States Together AI Full timeJob SummaryWe are seeking a highly skilled Senior DevOps Engineer to join our cloud engineering organization. As a key member of our team, you will be responsible for developing and maintaining the infrastructure for our AI workloads, ensuring scalability, reliability, and high performance. Key Responsibilities- Design and implement automated infrastructure...
-
Software Engineer
2 weeks ago
San Francisco, United States ZipRecruiter Full timeJob DescriptionMagic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific...
-
Infrastructure Engineer
4 weeks ago
San Francisco, United States Factory Full timeFactory is seeking a seasoned Infrastructure Engineer to architect, build, and maintain our advanced cloud infrastructure.What you will do and achieve:Lead the design and implementation of a robust, secure, and highly scalable cloud infrastructure, utilizing cutting-edge tools like Docker and Terraform.Work in close collaboration with product teams and...
-
Infrastructure Engineer
4 weeks ago
San Francisco, United States Resolve Full timeAbout Resolve AIResolve is building AI that operates as a Production Engineer. It investigates and resolves incidents, and handles operational tasks enhancing system reliability, and making on-call stress-free.Our founders (Spiros Xanthos and Mayank Agarwal) are the core creators of OpenTelemetry and led Splunk Observability. They have 2 successful exits to...
-
Infrastructure Engineer
4 weeks ago
San Francisco, United States Rollbar, Inc. Full timeInngest is solving long-standing developer problems related to queueing, event-driven systems, and step functions in a novel way — which means we’re creating first-of-its-kind solutions.Infrastructure engineering is a critical part of Inngest. It involves everything from K8S, Terraform, and Ansible playbooks (for bare metal) to developing high-throughput...
-
Research Engineer, Preparedness
3 weeks ago
San Francisco, United States OpenAI Full timeResearch Engineer, Preparedness | OpenAI | OpenAICareersResearch Engineer, PreparednessSafety Systems - San FranciscoAbout the teamThe Safety Systems team is responsible for various safety work to ensure our best models can be safely deployed to the real world to benefit society and is at the forefront of OpenAI's mission to build and deploy safe AGI,...
-
Infrastructure Talent Pool
4 weeks ago
San Francisco, United States Cohere Full timeWho are we?Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.We obsess over what we...
-
Mainframe Infrastructure Engineer
1 week ago
San Francisco, United States Tekfortune Inc Full timeRole: Mainframe Infrastructure Engineer Location: Remote Job Description: We are looking for a versatile and highly skilled Mainframe Infrastructure Engineer with expertise in observability tooling across various channels to enhance availability and expedite incident resolution. The ideal candidate will design, implement, and lead infrastructure solutions,...
-
Technical Infrastructure Engineer
5 days ago
San Francisco, California, United States ZipRecruiter Full timeAbout Us:At Parafin, our mission is to empower small businesses. Small businesses are the backbone of our economy, yet traditional financial institutions often overlook their needs. Parafin is a technology company that builds innovative infrastructure which enables small businesses to access financial services seamlessly via platforms they sell on. Our first...