Current jobs related to Software Engineer, Inference - San Francisco, California - Anthropic
-
Software Engineer, Model Inference Specialist
4 weeks ago
San Francisco, California, United States OpenAI Full timeKey Role: We're seeking a skilled Software Engineer to join our team at OpenAI and contribute to the development of our critical inference infrastructure.About the Job: As an Inference Infrastructure Engineer, you will work alongside machine learning researchers, engineers, and product managers to bring our latest technologies into production. Your primary...
-
Software Engineer, Inference
4 weeks ago
San Francisco, California, United States Anthropic Full timeAbout AnthropicAnthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.About the role:Our...
-
Senior Inference Systems Engineer
4 days ago
San Francisco, California, United States Genmo Inc. Full timeAt Genmo Inc., we are a research lab dedicated to building state-of-the-art models for video generation. Our goal is to unlock the potential of Artificial General Intelligence (AGI).Job OverviewWe are seeking a senior/staff software engineer to join our inference team. This role involves designing and scaling our inference systems to support millions of...
-
AI Inference Systems Engineer
4 weeks ago
San Francisco, California, United States Perplexity AI Full timeWe are seeking an experienced AI Inference Systems Engineer to join our growing team at Perplexity AI. Our current stack includes Python, C++, TensorRT-LLM, and Kubernetes, providing a unique opportunity to work on large-scale deployment of machine learning models for real-time inference.Key Responsibilities:Develop APIs for AI inference that will be used by...
-
Senior Inference Optimization Engineer
4 weeks ago
San Francisco, California, United States Liquid AI Full timeAt Liquid AI, we're seeking a highly skilled engineer to optimize inference stacks tailored to various hardware platforms.The ideal candidate has extensive experience in CUDA, C++, and Triton, as well as a deep understanding of GPU, CPU, and NPU architectures.They should be self-motivated, capable of working independently, and driven by a passion for...
-
AI Inference Deployment Specialist
6 days ago
San Francisco, California, United States Tbwa ChiatDay Inc Full timeWe are seeking an experienced AI Inference Deployment Specialist to join our team at Skild AI. As a key member of our robotics team, you will be responsible for deploying cutting-edge AI models and optimizing their performance in real-world environments.Role OverviewIn this role, you will work closely with our cross-functional team to design and develop...
-
Backend Software Engineer
4 weeks ago
San Francisco, California, United States Hyperbolic Labs Full timeAbout Us:At Hyperbolic Labs, we're on a mission to democratize AI by leveraging idle computing resources worldwide. Our Open-Access AI Cloud offers an innovative GPU marketplace and AI inference service, making AI more accessible, affordable, and secure for all.We're a team of pioneers at the intersection of AI and open-source technology, driven by a passion...
-
Senior Data Scientist, Causal Inference Expert
4 weeks ago
San Francisco, California, United States Discord Full timeAt Discord, we're revolutionizing the way people connect and engage with each other through gaming and shared interests. Our Experimentation Platform plays a vital role in driving business decisions and growth, and we're seeking an experienced Senior Data Scientist to join our team.The ideal candidate will have a strong background in causal inference and...
-
Senior Java Software Engineer
4 weeks ago
San Francisco, California, United States Triunity Software Full timeJob Title: Senior Java Software EngineerWe are seeking a highly skilled Senior Java Software Engineer to join our team at Triunity Software.Key Responsibilities:* Design, develop, and test complex software applications using Java* Collaborate with cross-functional teams to identify and prioritize project requirements* Develop and maintain high-quality,...
-
Senior Software Engineer for AI Infrastructure
4 weeks ago
San Francisco, California, United States Crusoe Full timeAbout the RoleAs a Senior/Staff Software Engineer on the Managed AI team at Crusoe, you'll have a pivotal role in shaping the architecture and scalability of our next-generation AI inference platform.You will lead the design and implementation of core systems for our AI services, including resilient fault-tolerant queues, model catalogs, and scheduling...
-
Senior Data Scientist, Causal Inference Expert
4 weeks ago
San Francisco, California, United States Discord Full timeRole OverviewWe are seeking an experienced Senior Data Scientist to join our Experimentation Platform team at Discord. As a Causal Inference expert, you will play a crucial role in ensuring the statistical underpinnings of our platform rebuild are sound, and experimenters can design experiments with high rigor.Key ResponsibilitiesProvide statistical...
-
Senior Data Scientist, Causal Inference Expert
4 weeks ago
San Francisco, California, United States Discord Full timeRole OverviewWe are seeking an experienced Senior Data Scientist to join our Experimentation Platform team at Discord. As a Causal Inference expert, you will play a critical role in ensuring the statistical underpinnings of our platform rebuild are sound, and that experimenters can design experiments with high rigor.Our team directly impacts the strategy and...
-
Java Software Engineer
4 weeks ago
San Francisco, California, United States Triunity Software Full timeJob Title : Java Developer Focused on Core Java Spring/Spring Boot/Spring BatchAt Triunity Software, we are seeking a skilled Java Developer to join our team. As a Java Developer, you will be responsible for designing, developing, testing, and deploying Java-based software applications using the Java Spring and Spring Batch frameworks.Key Responsibilities:...
-
Staff AI Infrastructure Engineer
4 weeks ago
San Francisco, California, United States Genmo Full timeRole OverviewWe are seeking a senior software engineer to join our inference team at Genmo, a research lab dedicated to building open, state-of-the-art models for video generation. The successful candidate will be responsible for designing and scaling our inference systems to support millions of users across multiple data centers.Key ResponsibilitiesDevelop...
-
Software Engineer
1 month ago
San Francisco, California, United States OpenAI Full timeAbout the Applied TeamThe Applied team at OpenAI is dedicated to safely bringing cutting-edge technology to the world. Our team has released groundbreaking products such as ChatGPT, Plugins, DALL·E, and APIs for GPT-4, GPT-3, embeddings, and fine-tuning. We also manage large-scale inference infrastructure.Our customers create fast-growing businesses using...
-
San Francisco, California, United States Snap Full timeJob Description:At Snap, we're looking for a skilled Software Engineering Manager to join our Model Serving Infrastructure Team. As a key member of our team, you'll be responsible for evaluating, testing, and debugging your work to ensure high-quality results. Key Responsibilities:Evaluate, test, and debug your work to ensure high-quality results.Experience...
-
Software Engineer, Machine Learning
4 weeks ago
San Francisco, California, United States Unreal Gigs Full timeJob Title: Software Engineer, Machine LearningUnreal Gigs is seeking a highly skilled Software Engineer, Machine Learning to join our team. As a key member of our research team, you will work closely with a senior member to develop cutting-edge deep learning projects, infrastructure, and tooling.Responsibilities:Implement research-based improvements to...
-
San Francisco, California, United States Snapchat Full timeAt Snap Inc., we're looking for a talented Staff Software Engineer to join our team and help us build innovative products that improve the way people live and communicate. As a key member of our ML Infrastructure team, you'll design, implement, and operate our most critical ML Inference / Feature Store services that power Snapchat's recommendation systems...
-
Senior Embedded Software Engineer
4 weeks ago
San Francisco, California, United States MoTek Technologies Full timeSenior Embedded Software Engineer - Computer Vision ExpertWe are seeking a highly skilled Senior Embedded Software Engineer to develop and deploy advanced robot perception systems on integrated hardware. You will collaborate with a team of leading researchers and engineers in robotics and AI to build the next generation of robotics vision systems. The role...
-
Staff Software Engineer
4 weeks ago
San Francisco, California, United States Aurora Innovation Full timeAbout the RoleAurora Innovation is seeking a highly skilled Staff Software Engineer to join our team in the development of our self-driving technology. As a Staff Software Engineer, you will be responsible for improving our dataset quality by establishing semi-automated evaluation mechanisms leveraging state-of-the-art models as well as RLHF techniques.Key...
Software Engineer, Inference
1 month ago
Anthropic is a public benefit corporation headquartered in San Francisco, dedicated to creating reliable, interpretable, and steerable AI systems. Our mission is to develop AI that is safe and beneficial for users and society as a whole.
Job DescriptionWe are seeking a skilled Software Engineer, Inference to join our Inference team. As a key member of this team, you will play a crucial role in building the service that generates outputs from our models in production. This service is the backbone of our efficiency, latency, and reliability.
Your primary responsibilities will include:
- Improving the efficiency and performance of our inference service
- Developing and implementing distributed systems solutions across our stack
- Collaborating with the team to design and implement new features and improvements
- Ensuring the reliability and scalability of our inference service
To succeed in this role, you will need:
- Significant software engineering experience
- A results-oriented approach with a bias towards flexibility and impact
- The ability to pick up slack and take on new challenges
- Experience with high-performance, large-scale distributed systems
- Knowledge of machine learning and AI systems
We offer a competitive compensation package, including salary, equity, and benefits. Our benefits include comprehensive health insurance, 401(k) plan, paid parental leave, and unlimited PTO. We also offer relocation support and a stipend for education and home office improvements.
At Anthropic, we value impact and collaboration. We believe that the highest-impact AI research will be big science, and we work as a single cohesive team on just a few large-scale research efforts. We strive to create a diverse and inclusive team and encourage applications from underrepresented groups.