ML Infrastructure Engineer
3 weeks ago
- Writing code rather than documents
- Shipping products rather than talking about roadmaps
- Big features rather than changing button colors
- Enabling teams to quickly test and iterate on their ML hypotheses via ML training capabilities, reliable GPU compute infrastructure and experimentation tools such as distributed deep learning libraries and Python notebooks
- Integrating X's GPU compute environment with large scale data and inference pipelines
- Collaborating with cross-functional teams to integrate machine learning models into our platform
- Ensuring scalability and efficiency of machine learning systems
- Work across the stack to solve problems independently
- Mentoring junior engineers and contributing to the team's growth
- Bachelor, Master, Post-graduate or PhD in computer science, computing engineering, machine learning, information retrieval, recommendation systems, natural language processing, statistics, math, engineering, operations research, or other quantitative discipline; or equivalent work experience
- 2+ years of industry experience (4+ for Senior) working with high traffic or large data production environments, distributed systems, backend infrastructure, recommender systems and/or deep learning applications
- 2+ years experience (4+ for Senior) with ML problems and platform tools either through first-hand modeling or close collaboration with modeling engineers or data scientists
- Working knowledge of Jupyter notebooks and Python, plus experience with a compiled language, such as Scala, Java or C++
- You stay up-to-date on Machine Learning and Deep Learning industry trends
- You have low level understanding of compute systems such as distributed storage, NVIDIA drivers and CUDA toolkits
- You are comfortable with Linux systems
- You have worked with Slurm scheduler, Puppet or Ansible
-
ML infrastructure engineer
3 weeks ago
San Francisco, CA, United States Replicate, Inc. Full timeYou're an infrastructure engineer, ideally with ML experience. We're growing fast and need your help scaling. We serve machine learning models. We deal with GPUs, optimize models, write prediction servers, set up clusters, and so on. All the hard stuff that companies doing ML would rather not deal with. Instead of being an ML infrastructure engineer at a...
-
ML infrastructure engineer
3 weeks ago
San Francisco, CA, United States Replicate, Inc. Full timeYou're an infrastructure engineer, ideally with ML experience. We're growing fast and need your help scaling. We serve machine learning models. We deal with GPUs, optimize models, write prediction servers, set up clusters, and so on. All the hard stuff that companies doing ML would rather not deal with. Instead of being an ML infrastructure...
-
ML/AI Engineer, Infrastructure
2 months ago
San Francisco, CA, United States Figma Full timeWe’re looking for engineers with a Machine Learning and Artificial Intelligence background to improve our products and build new capabilities. You will be building the core infrastructure to serve and deploy models efficiently, as well as world-class tooling that enables us to iterate on models quickly. . You will be combining industry best practices and...
-
Software Engineer
3 weeks ago
San Francisco, CA, United States Karkidi Full timeYou’ll help in executing the roadmap for data infrastructure and systems to power the world’s first AI recruiter built by Moonhub. You'll play a pivotal role in the development of tools and infrastructure that democratize data access and enable core capabilities across the organization You’ll architect offline and online data pipelines to...
-
Software Engineer, ML Infrastructure
1 month ago
San Francisco, United States Twelvelabs Full timeWho we are We’re a fast-moving, diverse team pushing the frontiers of artificial intelligence. At Twelve Labs, our mission is to help developers build programs that can see, listen, and understand the world as we do by bringing the world’s most powerful video understanding infrastructure to market. As a part of achieving this mission, we are building...
-
Software Engineer, ML Infrastructure
3 weeks ago
San Francisco, United States Instabase Full timeAt Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...
-
Software Engineer, ML Infrastructure
2 weeks ago
San Francisco, United States Instabase Full timeAt Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...
-
Software Engineer, ML Infrastructure
3 weeks ago
San Francisco, United States Instabase Full timeAt Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...
-
Infrastructure Engineering
3 weeks ago
San Francisco, CA, United States X Corp. Full timeAre you prepared to join the X team and help build the ultimate real-time information-sharing app, revolutionizing how people connect? At X, we're on a mission to become a trusted global digital public square, committed to minimal censorship within legal boundaries. Our goal is to empower every user to freely create and share ideas, fostering open public...
-
Staff Engineer, ML Infrastructure
7 days ago
San Francisco, United States Stripe Full timeWho we are About Stripe Stripe is a financial infrastructure platform for businesses. Millions of companies-from the world's largest enterprises to the most ambitious startups-use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of...
-
Director of Engineering, ML
2 months ago
San Francisco, CA, United States Twelve Labs Full timeWho we are We’re a fast-moving, diverse team pushing the frontiers of artificial intelligence. At Twelve Labs, our mission is to help developers build programs that can see, listen, and understand the world as we do by bringing the world’s most powerful video understanding infrastructure to market. As a part of achieving this mission, we are building...
-
ML Operations Engineer
3 weeks ago
San Francisco, CA, United States Unreal Gigs Full timeCompany Overview: Welcome to the forefront of machine learning operations! At our company, we're driving the next wave of AI revolution through cutting-edge ML operations technologies. Our mission is to develop scalable and reliable ML systems that empower businesses and revolutionize industries. Join us and be part of a dynamic team committed to...
-
Senior Software Engineer, Infrastructure
3 weeks ago
San Francisco, CA, United States CentML Full timeAbout Us We believe AI will fundamentally transform how people live and work. CentML's mission is to massively reduce the cost of developing and deploying ML models so we can enable anyone to harness the power of AI and everyone to benefit from its potential. Our founding team is made up of experts in AI, compilers, and ML hardware and has led efforts...
-
San Francisco, CA, United States Twelve Labs Full timeWho we are We're a fast-moving, diverse team pushing the frontiers of artificial intelligence. At Twelve Labs, our mission is to help developers build programs that can see, listen, and understand the world as we do by bringing the world's most powerful video understanding infrastructure to market. As a part of achieving this mission, we are...
-
San Jose, CA, United States Conductor Full timeWhat You’ll Do The AGI (Artificial General Intelligence) Computing Lab is dedicated to solving the complex system-level challenges posed by the growing demands of future AI/ML workloads. Our team is committed to designing and developing scalable platforms that can effectively handle the computational and memory requirements of these workloads while...
-
Forward Deployed ML Engineer
3 weeks ago
San Francisco, CA, United States Baseten Full timeABOUT BASETEN We're a growing team of builders backed by top-tier investors including IVP, Spark Capital, and Sarah Guo at Conviction. ML teams at enterprises and category-defining AI-native companies like Descript, Bland, and Patreon use Baseten to power their core production workloads with best in class performance, security, and reliability. While...
-
Engineering Manager, Data and ML Infrastructure
2 weeks ago
San Francisco, United States Genai Works Full timeWe are advancing AI to power the future of medicine. At Unlearn, our purpose is to advance artificial intelligence (AI) to eliminate trial and error in medicine. We are innovating advanced machine learning methods to leveragegenerative AI in forecasting patient outcomes, starting with the domain ofclinical trials. We produce AI-generated digital twins of...
-
Principal Infrastructure Engineer
2 months ago
San Francisco, CA, United States Nextdata Technologies Inc Full timeThe company Decentralized data is the future. Data mesh is the right idea. We’re here to make it a reality. Nextdata OS is a data-mesh-native platform built to meet the challenge of decentralizing data at scale. We are inventing a new way for developers to work with data and share it responsibly via data product containers. Our vision is to build a...
-
Staff Engineer, ML Foundations
3 weeks ago
San Francisco, CA, United States Stripe Full timeWho we are About Stripe Stripe is a financial infrastructure platform for businesses. Millions of companies-from the world's largest enterprises to the most ambitious startups-use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount...
-
Engineering Manager, Data and ML Infrastructure
2 weeks ago
San Francisco, California, United States Unlearn Full timeWe are advancing AI to power the future of medicine.At Unlearn, our purpose is to advance artificial intelligence (AI) to eliminate trial and error in medicine. We are innovating advanced machine learning methods to leverage generative AI in forecasting patient outcomes, starting with the domain of clinical trials. We produce AI-generated digital twins of...