Current jobs related to Software Engineer, Inference - San Francisco, California - Anthropic


  • San Francisco, California, United States OpenAI Full time

    Key Role: We're seeking a skilled Software Engineer to join our team at OpenAI and contribute to the development of our critical inference infrastructure.About the Job: As an Inference Infrastructure Engineer, you will work alongside machine learning researchers, engineers, and product managers to bring our latest technologies into production. Your primary...


  • San Francisco, California, United States Anthropic Full time

    About AnthropicAnthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.About the role:Our...


  • San Francisco, California, United States Genmo Inc. Full time

    At Genmo Inc., we are a research lab dedicated to building state-of-the-art models for video generation. Our goal is to unlock the potential of Artificial General Intelligence (AGI).Job OverviewWe are seeking a senior/staff software engineer to join our inference team. This role involves designing and scaling our inference systems to support millions of...


  • San Francisco, California, United States Perplexity AI Full time

    We are seeking an experienced AI Inference Systems Engineer to join our growing team at Perplexity AI. Our current stack includes Python, C++, TensorRT-LLM, and Kubernetes, providing a unique opportunity to work on large-scale deployment of machine learning models for real-time inference.Key Responsibilities:Develop APIs for AI inference that will be used by...


  • San Francisco, California, United States Liquid AI Full time

    At Liquid AI, we're seeking a highly skilled engineer to optimize inference stacks tailored to various hardware platforms.The ideal candidate has extensive experience in CUDA, C++, and Triton, as well as a deep understanding of GPU, CPU, and NPU architectures.They should be self-motivated, capable of working independently, and driven by a passion for...


  • San Francisco, California, United States Tbwa ChiatDay Inc Full time

    We are seeking an experienced AI Inference Deployment Specialist to join our team at Skild AI. As a key member of our robotics team, you will be responsible for deploying cutting-edge AI models and optimizing their performance in real-world environments.Role OverviewIn this role, you will work closely with our cross-functional team to design and develop...


  • San Francisco, California, United States Hyperbolic Labs Full time

    About Us:At Hyperbolic Labs, we're on a mission to democratize AI by leveraging idle computing resources worldwide. Our Open-Access AI Cloud offers an innovative GPU marketplace and AI inference service, making AI more accessible, affordable, and secure for all.We're a team of pioneers at the intersection of AI and open-source technology, driven by a passion...


  • San Francisco, California, United States Discord Full time

    At Discord, we're revolutionizing the way people connect and engage with each other through gaming and shared interests. Our Experimentation Platform plays a vital role in driving business decisions and growth, and we're seeking an experienced Senior Data Scientist to join our team.The ideal candidate will have a strong background in causal inference and...


  • San Francisco, California, United States Triunity Software Full time

    Job Title: Senior Java Software EngineerWe are seeking a highly skilled Senior Java Software Engineer to join our team at Triunity Software.Key Responsibilities:* Design, develop, and test complex software applications using Java* Collaborate with cross-functional teams to identify and prioritize project requirements* Develop and maintain high-quality,...


  • San Francisco, California, United States Crusoe Full time

    About the RoleAs a Senior/Staff Software Engineer on the Managed AI team at Crusoe, you'll have a pivotal role in shaping the architecture and scalability of our next-generation AI inference platform.You will lead the design and implementation of core systems for our AI services, including resilient fault-tolerant queues, model catalogs, and scheduling...


  • San Francisco, California, United States Discord Full time

    Role OverviewWe are seeking an experienced Senior Data Scientist to join our Experimentation Platform team at Discord. As a Causal Inference expert, you will play a crucial role in ensuring the statistical underpinnings of our platform rebuild are sound, and experimenters can design experiments with high rigor.Key ResponsibilitiesProvide statistical...


  • San Francisco, California, United States Discord Full time

    Role OverviewWe are seeking an experienced Senior Data Scientist to join our Experimentation Platform team at Discord. As a Causal Inference expert, you will play a critical role in ensuring the statistical underpinnings of our platform rebuild are sound, and that experimenters can design experiments with high rigor.Our team directly impacts the strategy and...


  • San Francisco, California, United States Triunity Software Full time

    Job Title : Java Developer Focused on Core Java Spring/Spring Boot/Spring BatchAt Triunity Software, we are seeking a skilled Java Developer to join our team. As a Java Developer, you will be responsible for designing, developing, testing, and deploying Java-based software applications using the Java Spring and Spring Batch frameworks.Key Responsibilities:...


  • San Francisco, California, United States Genmo Full time

    Role OverviewWe are seeking a senior software engineer to join our inference team at Genmo, a research lab dedicated to building open, state-of-the-art models for video generation. The successful candidate will be responsible for designing and scaling our inference systems to support millions of users across multiple data centers.Key ResponsibilitiesDevelop...

  • Software Engineer

    1 month ago


    San Francisco, California, United States OpenAI Full time

    About the Applied TeamThe Applied team at OpenAI is dedicated to safely bringing cutting-edge technology to the world. Our team has released groundbreaking products such as ChatGPT, Plugins, DALL·E, and APIs for GPT-4, GPT-3, embeddings, and fine-tuning. We also manage large-scale inference infrastructure.Our customers create fast-growing businesses using...


  • San Francisco, California, United States Snap Full time

    Job Description:At Snap, we're looking for a skilled Software Engineering Manager to join our Model Serving Infrastructure Team. As a key member of our team, you'll be responsible for evaluating, testing, and debugging your work to ensure high-quality results. Key Responsibilities:Evaluate, test, and debug your work to ensure high-quality results.Experience...


  • San Francisco, California, United States Unreal Gigs Full time

    Job Title: Software Engineer, Machine LearningUnreal Gigs is seeking a highly skilled Software Engineer, Machine Learning to join our team. As a key member of our research team, you will work closely with a senior member to develop cutting-edge deep learning projects, infrastructure, and tooling.Responsibilities:Implement research-based improvements to...


  • San Francisco, California, United States Snapchat Full time

    At Snap Inc., we're looking for a talented Staff Software Engineer to join our team and help us build innovative products that improve the way people live and communicate. As a key member of our ML Infrastructure team, you'll design, implement, and operate our most critical ML Inference / Feature Store services that power Snapchat's recommendation systems...


  • San Francisco, California, United States MoTek Technologies Full time

    Senior Embedded Software Engineer - Computer Vision ExpertWe are seeking a highly skilled Senior Embedded Software Engineer to develop and deploy advanced robot perception systems on integrated hardware. You will collaborate with a team of leading researchers and engineers in robotics and AI to build the next generation of robotics vision systems. The role...


  • San Francisco, California, United States Aurora Innovation Full time

    About the RoleAurora Innovation is seeking a highly skilled Staff Software Engineer to join our team in the development of our self-driving technology. As a Staff Software Engineer, you will be responsible for improving our dataset quality by establishing semi-automated evaluation mechanisms leveraging state-of-the-art models as well as RLHF techniques.Key...

Software Engineer, Inference

1 month ago


San Francisco, California, United States Anthropic Full time
About Anthropic

Anthropic is a public benefit corporation headquartered in San Francisco, dedicated to creating reliable, interpretable, and steerable AI systems. Our mission is to develop AI that is safe and beneficial for users and society as a whole.

Job Description

We are seeking a skilled Software Engineer, Inference to join our Inference team. As a key member of this team, you will play a crucial role in building the service that generates outputs from our models in production. This service is the backbone of our efficiency, latency, and reliability.

Your primary responsibilities will include:

  • Improving the efficiency and performance of our inference service
  • Developing and implementing distributed systems solutions across our stack
  • Collaborating with the team to design and implement new features and improvements
  • Ensuring the reliability and scalability of our inference service

To succeed in this role, you will need:

  • Significant software engineering experience
  • A results-oriented approach with a bias towards flexibility and impact
  • The ability to pick up slack and take on new challenges
  • Experience with high-performance, large-scale distributed systems
  • Knowledge of machine learning and AI systems

We offer a competitive compensation package, including salary, equity, and benefits. Our benefits include comprehensive health insurance, 401(k) plan, paid parental leave, and unlimited PTO. We also offer relocation support and a stipend for education and home office improvements.

At Anthropic, we value impact and collaboration. We believe that the highest-impact AI research will be big science, and we work as a single cohesive team on just a few large-scale research efforts. We strive to create a diverse and inclusive team and encourage applications from underrepresented groups.