Cloud-Based LLM APIs Developer

1 week ago


San Francisco, California, United States ZipRecruiter Full time
The Role
We are seeking a Staff or Senior Full Stack Engineer to join our growing team. As a key member of our team, you will help us build internal UI products to support LLM Model orchestration, evaluation, and dataset curation. You will work closely with our Applied AI and product teams to build innovative ML-powered solutions to push the state of the art in Healthcare AI technology.

Responsibilities
• Design, build, and maintain highly scalable cloud-based services using TypeScript, Python, and React
• Create tooling to support the ML model lifecycle, including model evaluations, dataset curation, and training infrastructure
• Build scalable infrastructure for LLM model orchestration, powered by an intuitive user experience
• Make architecture and technology decisions that align with business needs, balancing innovation with security and reliability
• Enhance our platform's performance and scalability, focusing on creating seamless user experiences
• Work cross-functionally with teams like product, machine learning, and design to ensure the success of new features from conception to deployment

What You'll Bring
• 5+ years of experience building and scaling distributed systems using modern technologies like TypeScript, Python, React, and Node.js
• Hands-on experience integrating and working with Large Models (LLMs) and APIs, with familiarity in fine-tuning and prompt engineering
• Proficiency in working with cloud services (AWS, GCP), infrastructure as code (Terraform), and containerization (Docker, Kubernetes)
• Proven experience leading engineering teams or owning major software components from development to production
• A background in optimizing and scaling inference systems for LLMs or similar AI-based systems is highly desirable

Our Offerings
• Estimated salary range: $185,000 - $265,000 per year, depending on location, experience, skills, qualifications, and other job-related factors
• Participation in a company stock option plan as part of the total compensation package
• Relocation assistance available for candidates willing to move to San Francisco
• A flexible hybrid work model, with a minimum of 3 days per week spent in the office
• Up to 10% travel required for occasional business meetings and events

Life at Abridge
Abridge is an experiment in alchemy, transforming the healthcare industry through AI. Our culture is founded on doing things the 'inverse' way – focusing on patients, outcomes, and end-user experiences instead of legacy systems. We believe in strong ideas loosely held and prioritize growth and knowledge-sharing among team members. Join us in pushing the boundaries of Healthcare AI technology

  • San Francisco, California, United States ShiftCode Analytics Full time

    Cloud-Based API Development SpecialistShiftCode Analytics is seeking a skilled Cloud-Based API Development Specialist to join our team. In this role, you will design, develop, and deploy cloud-based APIs using MuleSoft.About the Position:Design and develop cloud-based APIs using MuleSoft.Collaborate with clients to understand their business requirements and...

  • LLM Agent Developer

    1 week ago


    San Francisco, California, United States Letta Full time

    Join Our MissionLetta is a cutting-edge AI startup dedicated to empowering developers with powerful LLM agents. We're looking for an exceptional LLM Agent Developer to join our founding team and help shape the future of LLM technology. In this role, you'll be responsible for developing our open-source Agents API standard and contributing to the growth of our...


  • San Francisco, California, United States Kapwing Full time

    Transformative Video Creation ExperienceKapwing is at the forefront of modernizing content creation tools for cloud-based video editors. We're on a mission to empower creators by making video editing faster, more accessible, and collaborative.Our next-generation platform aims to democratize creative tools and shape the future of video creation. To achieve...


  • San Francisco, California, United States Letta Cloud Full time

    About Letta CloudAs a pioneering technology company, Letta Cloud is revolutionizing the field of Large Language Models (LLMs) with its innovative approach. Our team comes from a renowned research lab and PhD advisors at Berkeley, where they produced influential projects like Spark and Ray. This collective expertise enables us to develop cutting-edge LLM...


  • San Francisco, California, United States Letta Cloud Full time

    Company OverviewWe are Letta Cloud, a pioneering startup revolutionizing the field of Large Language Model (LLM) technology. Founded by experts from the renowned research lab at Berkeley, we have deep expertise in both AI and systems.Our mission is to empower developers to build state-of-the-art LLM agents that power their applications. We're currently...


  • San Francisco, California, United States AccrueTalent Full time

    Job SummaryWe're looking for a talented Full Stack Engineer to join our team in San Francisco.This is an exciting opportunity to work with cutting-edge technologies like AI and LLMs, and make a real impact on our company's growth.Key ResponsibilitiesDevelop scalable, high-performance solutions using Python, JavaScript, React, React Native, Flask, PSQL, and...


  • San Francisco, California, United States Kapwing Full time

    OverviewThe company Kapwing is revolutionizing video editing by moving it to the cloud. Our mission is to make content creation fast, accessible, and collaborative. As a talented Full Stack Software Engineer, you will join our Repurpose team and build new creative applications with LLMs and generative AI to make video creation easier. This role offers a...


  • San Francisco, California, United States ZipRecruiter Full time

    Unlock Deeper Understanding in Healthcare with AbridgeAbridge is a pioneering organization that empowers healthcare professionals to make informed decisions through AI-powered clinical documentation. As a Staff or Senior Full Stack Engineer, you'll join our mission-driven team to build innovative solutions for Large Model (LLM) orchestration and...


  • San Francisco, California, United States Letta Cloud Full time

    About Letta CloudWe are a pioneering company that emerged from the MemGPT project with over 12,000 GitHub stars. Our founding team comes from the renowned research lab at Berkeley, where they worked alongside PhD advisors who produced Spark and Ray. With deep expertise in both AI and systems, we are now hiring exceptional engineers to join us in developing...


  • San Francisco, California, United States Abridge Al, Inc Full time

    Job DescriptionThe role involves creating tooling to support the ML model lifecycle, including model evaluations, dataset curation, and training infrastructure. You will also build scalable infrastructure for LLM model orchestration powered by an intuitive and easy-to-use user experience.You will work closely with our Applied AI and product teams to build...


  • San Francisco, California, United States ZipRecruiter Full time

    Empower Deeper Understanding in HealthcareWe are seeking a skilled Cloud-Based Healthcare AI Engineer to join our growing team. As a key member of our engineering team, you will play a crucial role in designing and building innovative ML-powered solutions to transform the healthcare industry.About the RoleYou will work closely with our Applied AI and product...


  • San Francisco, California, United States Abridge Full time

    About the RoleWe're looking for a highly skilled Full Stack Engineer to join our team at Abridge. As a Staff Software Engineer, you will be responsible for developing and maintaining cloud-based services using modern technologies like TypeScript, Python, and React. You will work closely with our Applied AI and product teams to build innovative ML-powered...


  • San Francisco, California, United States Kapwing Full time

    About KapwingKapwing is a leading provider of cloud-based video editing solutions. Our platform empowers creators to produce high-quality content efficiently and effectively.Salary: $125,000 - $185,000 per yearJob Overview:The Repurpose team at Kapwing is seeking a skilled full-stack software engineer to help drive innovation in AI-powered video creation. As...

  • Cloud Engineer

    3 days ago


    San Francisco, California, United States Abridge Al, Inc Full time

    Transforming Healthcare with AI: Join Our TeamWe are Abridge Al, Inc., a mission-driven organization committed to powering deeper understanding in healthcare through AI. As a leading innovator in the field, we are seeking an experienced Cloud Engineer to join our team in San Francisco or New York City.The successful candidate will have 5+ years of experience...


  • San Francisco, California, United States Crusoe Full time

    About CrusoeCrusoe is a pioneering company in the field of AI-first cloud infrastructure. Our mission is to align the future of computing with the future of the climate, creating a sustainable and reliable platform for businesses to power their most advanced AI applications.Job SummaryWe are seeking an experienced API Integration Developer to join our team...


  • San Francisco, California, United States Infinitus LLC Full time

    About the Role:We are seeking a Senior Engineer to join our platform engineering team at Infinitus LLC, a leading provider of AI solutions for healthcare. As a Senior Engineer, you will play a key role in developing cutting-edge end-to-end solutions using Large Language Models (LLMs) and other approaches.Your primary responsibility will be designing and...


  • San Francisco, California, United States Kapwing Full time

    Company OverviewKapwing is a cloud-based video editing platform that enables creators to share their stories online. Our mission is to make content creation fast, accessible, and collaborative.SalaryThe estimated salary for this role is $160,000 - $200,000 per year, based on the location in San Francisco.Job DescriptionWe are looking for a talented software...


  • San Francisco, California, United States Kapwing Full time

    Career OpportunityWe are seeking a skilled software engineer to join our Repurpose team at Kapwing, a cloud-based video editing platform.Job SummaryThis role involves building new creative applications with LLMs and generative AI to make video creation faster and easier.Duties and ResponsibilitiesDevelop full-stack features (React, Node, and Python) for one...


  • San Francisco, California, United States Recruiting from Scratch Full time

    Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients. We are seeking a Cloud Native Systems Architect to join our team. As a key member of our engineering team, you will be responsible for designing and implementing robust systems that power our AI-driven marketing platform.Our ideal candidate has a strong...


  • San Francisco, California, United States ZipRecruiter Full time

    Job Title: Cloud-Based AI EngineerWe are seeking a highly skilled Cloud-Based AI Engineer to join our team. As a Cloud-Based AI Engineer, you will be responsible for designing and implementing cloud-based AI solutions that meet the needs of our clients.Responsibilities:Design and implement scalable, secure, and high-performance architectures for our...