Current jobs related to DevOps Infrastructure Engineer for AI Workload Orchestration - San Francisco, California - Together AI
-
AI Infrastructure Systems Developer
1 week ago
San Francisco, California, United States ZipRecruiter Full timeJob OverviewZipRecruiter is seeking an experienced American I.A. Infrastructure Systems Developer to join our team in the United States, with a salary range of $170,000 - $200,000 per year.The successful candidate will be responsible for designing and building infrastructure that supports cutting-edge AI solutions, working closely with data scientists and...
-
Senior DevOps Engineer
4 weeks ago
San Francisco, California, United States Together AI Full timeJob SummaryWe are seeking a highly skilled Senior DevOps Engineer to join our cloud engineering organization. As a key member of our team, you will be responsible for developing and maintaining the infrastructure for our AI workloads, ensuring scalability, reliability, and high performance. Key Responsibilities- Design and implement automated infrastructure...
-
AI Infrastructure Architect
2 weeks ago
San Francisco, California, United States Together AI Full timeAbout the Role">We are seeking a highly skilled DevOps Engineer to join our team at Together AI. As an MLOps engineer, you will develop systems and APIs that enable our customers to perform inference and fine-tune LLMs.">Key Responsibilities">Implement runtime systems that perform inference at scale using AI/ML models from simple models up to the largest...
-
AI Infrastructure Specialist
3 weeks ago
San Francisco, California, United States Together AI Full timeCompany Overview:At Together AI, we believe open and transparent AI systems will drive innovation and create the best outcomes for society. Our team has been behind technological advancements such as FlashAttention, Hyena, FlexGen, and RedPajama.Job Description:We are seeking an experienced MLOps engineer to develop systems and APIs that enable our customers...
-
San Francisco, California, United States Perplexity AI Full timeAI-Driven Search Solutions: Technical Lead PositionWe're looking for an experienced Senior DevOps Engineer to join our team at Perplexity AI. As a key member of our infrastructure team, you'll play a crucial role in shaping the technical direction and implementing scalable solutions for our rapidly growing search platform.Technical RequirementsYou will be...
-
AI Infrastructure Specialist
1 week ago
San Francisco, California, United States Unreal Gigs Full timeUnreal Gigs is seeking an experienced AI Infrastructure Specialist to design, automate, and manage robust machine learning pipelines. Job OverviewThis role involves building scalable infrastructure for AI workloads, automating workflows, and developing tools that enable continuous integration and continuous delivery (CI/CD) of ML...
-
AI Infrastructure Software Engineer
6 days ago
San Francisco, California, United States Magic AI Full timeMagic AI is a pioneering company building safe Artificial General Intelligence (AGI) to accelerate humanity's progress on the world's most pressing challenges. Our mission is to develop AGI that complements human capabilities, rather than replacing them.The Supercomputing Platform & Infrastructure team at Magic AI is responsible for designing and...
-
DevOps Engineer for AI Solutions
2 weeks ago
San Francisco, California, United States WEX Full timeOverview:WEX is an innovative global commerce platform and payments technology company looking to forge the way in a rapidly changing environment. We are journeying to build a consistent world-class user experience across our products and services and leverage customer-focused innovations across all our strategic initiatives, including big data, AI, and...
-
AI Infrastructure Solutions Architect
1 week ago
San Francisco, California, United States Unreal Gigs Full timeUnreal Gigs: AI Infrastructure Solutions ArchitectWe are seeking an experienced AI Infrastructure Solutions Architect to join our team at Unreal Gigs.About the Role:The successful candidate will design, deploy, and maintain the infrastructure that powers AI innovation. This role involves collaborating with data scientists, software engineers, and DevOps...
-
AI Infrastructure Engineer
4 weeks ago
San Francisco, California, United States Naptha AI Full timeAbout Naptha AIWe are seeking exceptional Software Engineering interns to join Naptha AI and contribute to building the future of AI agent infrastructure.This internship offers hands-on experience working with frontier AI technology, backed by industry veterans and technical leaders through NVIDIA Inception, Google for Startups, and Microsoft for Startups.As...
-
AI Infrastructure Systems Architect
3 weeks ago
San Francisco, California, United States ZipRecruiter Full timeJob Title:AI Infrastructure Systems ArchitectAbout the Role:We are seeking an experienced AI Infrastructure Systems Architect to design and build scalable infrastructure that supports AI workloads. The ideal candidate will have a deep understanding of cloud and on-premise infrastructure solutions and be able to optimize them for AI.Key...
-
AI Infrastructure Specialist
2 weeks ago
San Francisco, California, United States Unreal Gigs Full timeJob Summary">We are seeking an experienced AI Infrastructure Specialist to join our team at Unreal Gigs. As an AI Infrastructure Specialist, you will play a critical role in designing, building, and maintaining the infrastructure that supports machine learning and AI workloads.The ideal candidate will have deep expertise in cloud platforms, containerization,...
-
Cloud AI Infrastructure Architect
2 weeks ago
San Francisco, California, United States WEX Full timeOverview:Achieve technical excellence in AI infrastructure development with WEX, a leading global commerce platform and payments technology company. We're seeking an experienced Staff Cloud Engineer to spearhead our AI infrastructure initiatives, leveraging cloud-based solutions and cutting-edge technologies.About the Role:This is an exceptional opportunity...
-
San Francisco, California, United States WEX, Inc. Full timeAbout WEX, Inc.WEX is an innovative global commerce platform and payments technology company that aims to simplify the business of doing business for customers. We are on a mission to create a consistent world-class user experience across our products and services, leveraging customer-focused innovations in big data, AI, and Risk.We are looking for a highly...
-
AI Engineering Team Member
2 weeks ago
San Francisco, California, United States WEX Full timeWe are seeking a talented AI Engineering Team Member to join our team at WEX. As a key member of our AI infrastructure team, you will work closely with data scientists, ML engineers, and stakeholders to understand the requirements and challenges of AI/ML workloads. Your primary responsibility will be to design, implement, and maintain highly scalable and...
-
Technical Leader for AI Infrastructure
7 days ago
San Francisco, California, United States Naptha AI Full timeCompany OverviewNaptha AI is a pre-seed company that aims to revolutionize AI agent infrastructure. Our team has deep expertise in AI and distributed systems, and we are looking for experienced technical leaders to help shape our technical strategy.SalaryWe offer a highly competitive salary, with the amount based on your experience and qualifications. The...
-
AI Infrastructure Specialist
2 weeks ago
San Francisco, California, United States ZipRecruiter Full timeJob DescriptionWe're looking for a highly skilled Ai Infrastructure Specialist to join our team of engineers and data scientists. As an AI Infrastructure Specialist, you'll play a key role in designing, building, and optimizing our AI infrastructure to support the needs of our organization.About the RoleDesign and Build Infrastructure: Design and build...
-
AI Infrastructure Specialist
1 month ago
San Francisco, California, United States Unreal Gigs Full timeJob OverviewWe are seeking a highly skilled AI Infrastructure Specialist to join our team at Unreal Gigs. As an AI Infrastructure Specialist, you will be responsible for designing, building, and managing scalable infrastructure for machine learning workloads.The ideal candidate will have strong experience with cloud platforms such as AWS, GCP, or Azure, and...
-
AI Engineering Specialist
2 weeks ago
San Francisco, California, United States WEX Full timeAbout WEX and the RoleAt WEX, we are forging the way in a rapidly changing environment by simplifying the business of doing business for customers. As a member of our AI Infrastructure team, you will play a pivotal role in enabling these advancements. You will join a team of highly talented and skillful engineers and leaders who will support, guide, and...
-
Visionary DevOps Engineer
1 week ago
San Francisco, California, United States Phonely Full timeAbout PhonelyWe're revolutionizing customer support with cutting-edge voice AI technology, and we need a talented DevOps engineer to join our mission. Our platform enables businesses to build, simulate, and deploy voice agents seamlessly.To achieve 99.9% reliability, we've developed fine-tuned LLMs, rigorous simulation testing, and automated monitoring...
DevOps Infrastructure Engineer for AI Workload Orchestration
1 month ago
Are you a skilled DevOps engineer looking to take your career to the next level? Do you have a passion for designing and building automated infrastructure pipelines? We are seeking a talented Senior DevOps Engineer to join our cloud engineering team at Together AI.
About the RoleWe are hiring a highly experienced Senior DevOps Engineer to lead the development of software and processes for orchestrating AI workloads over large fleets of distributed GPU hardware. In this role, you will be part of a dynamic team that aims to automate everything and build failure-resistant and horizontally scalable cloud infrastructure for GPU-resident applications.
Key Responsibilities- Create highly automated infrastructure pipelines for deploying and scaling distributed and multi-tenant GPU-resident compute to new cloud and data center environments
- Design and build advanced CI/CD pipeline frameworks to ensure efficient and reliable deployment of AI models
- Implement tools to facilitate greater automation and operability of services, ensuring seamless integration with existing infrastructure
- Architect, deploy, and scale observability infrastructure to monitor and analyze system performance
- Collaborate with cross-functional teams to identify areas for improvement and implement best practices to enhance overall system reliability and efficiency
- Minimum of 5 years of relevant experience in DevOps, cloud computing, data center operations, SRE, and Linux systems administration
- Experience in programming languages such as Java, Python, Go, or C++
- Familiarity with cloud computing toolsets like Terraform, Vault, and Packer
- Strong understanding of configuration management tools like Ansible, Pulumi, Chef, and Puppet
- Experience with Kubernetes, containerization, and VPNs
Together AI is a research-driven artificial intelligence company dedicated to advancing the field of AI through open and transparent systems. We believe that by co-designing software, hardware, algorithms, and models, we can significantly lower the cost of modern AI systems and drive innovation for society. Our team has made significant contributions to leading open-source research, models, and datasets, and we invite you to join us on our mission to build the next generation AI infrastructure.
CompensationWe offer competitive compensation, including a base salary range of $160,000 - $230,000 + equity + benefits. Individual compensation will be determined by experience, skills, and job-related knowledge.