Machine Learning Engineer

2 weeks ago


San Francisco, California, United States Inception Full time
About Us Inception is a generative AI startup. Leveraging breakthrough AI research, we are training next-generation large language models (LLM) powered by diffusion. Unlike existing auto-regressive models, which only output one token at a time, diffusion LLMs can output many tokens in parallel. This means that they are several times faster and can leverage their additional test-time compute to improve quality. They also enable fine-grained control over their outputs to adhere to specific schema and semantic constraints, and they provide a unified paradigm for combining language with other data modalities, including audio, images, and videos.
Our team is led by Stefano Ermon (co-inventor of diffusion models, flash attention, and DPO; faculty at Stanford), Aditya Grover (co-inventor of node2vec and decision transformers; faculty at UCLA), and Volodymyr Kuleshov (prev. co-founder and CTO at Afresh Technologies; faculty at Cornell), and includes engineers from Google Deepmind, Meta AI, Microsoft AI, and OpenAI. We are currently deploying large-scale diffusion LLMs at Fortune 500 companies.
Role Overview We seek experienced Machine Learning Engineers passionate about bringing cutting-edge AI to production. In this role, you will bridge the gap between research and real-world applications, working to train and deploy our diffusion large language models while collaborating with a cross-functional team of researchers and engineers. You'll be instrumental in building our core product offerings while ensuring models perform reliably at scale in production environments.
Key ResponsibilitiesDesign, develop, and optimize LLM architectures and models.Partner with customers to understand their use cases and translate business requirements into technical solutionsImplement innovative approaches for training, fine-tuning, and scaling generative AI models.Work on data preprocessing pipelines, model evaluation, and alignment to enterprise use cases.Contribute to the deployment and maintenance of models in production environments.Collaborate with product teams to design and implement customer-facing ML features
QualificationsBS/MS/PhD in Computer Science, Machine Learning, or related field (or equivalent experience)At least 2 years of experience working on ML projects in PyTorch (or equivalent DL framework), preferably in a research lab or engineering role.Excellent familiarity with transformers and fundamental LLM concepts (e.g., autoregressive pretraining, instruction tuning, in-context learning, LoRA, and KV caching).Experience with training LLM, including fine-tuning.Familiarity with large-scale systems and high-performance computing, including GPU/TPU utilization.Experience with version control (Git) and containerization (Docker).Excellent communication skills with the ability to explain technical concepts to non-technical stakeholders
Preferred SkillsExpertise in data engineering and synthetic data generation for LLMs.Knowledge of MLOps and production-level deployment workflows.Experience with LLMs serving frameworks like vLLM, SGLang, or TensorRT.Experience with cloud platforms (AWS, GCP, Azure)Experience with model quantization and optimization techniques
Why Join UsImpact: Deploy LLMs that transform how millions of users work, create, and solve real-world problems.Innovation: Pioneer novel architectures and training techniques for diffusion LLMs.Growth: Enjoy a fast-paced, collaborative environment where your contributions will directly shape the future of generative AI.
Perks & BenefitsCompetitive salary and equity in a rapidly growing startup.Flexible vacation and paid time off (PTO).Health, dental, and vision insurance.Professional development opportunities (conferences, courses, etc.).
This is an exciting opportunity to join a startup at the forefront of AI development If you're ready to make a tangible impact in the world of generative AI, apply today.
We are an equal opportunity employer and encourage candidates of all backgrounds to apply.

  • San Francisco, California, United States Delty Full time

    About UsDelty is building the healthcare's AI operating system. We create voice-based and computer-based assistants that streamline clinical workflows, reduce administrative burden, and help providers focus on patient care. Our system learns from real healthcare environments to deliver reliable, context-aware support that improves efficiency and elevates the...


  • San Francisco, California, United States Ema Full time $135,000 - $200,000 per year

    Who We AreEma is building the next generation AI technology to empower every employee in the enterprise to be their most creative and productive. Our proprietary tech allows enterprises to delegate most repetitive tasks to Ema, the AI employee. We are founded by ex-Google, Coinbase, Okta executives and serial entrepreneurs. We've raised capital from notable...


  • San Francisco, California, United States Pivotal Solutions Full time

    ResponsibilitiesCollaborate with global teams to deliver high -impact data products for worldwide deployment.Develop and maintain ML pipelines to optimize critical processes for global lending products, including anti -fraud systems, credit strategy, and marketing optimization.Enhance and maintain scalable machine learning infrastructure to support robust...


  • San Francisco, California, United States Taskrabbit Full time $148,000 - $200,000

    About Taskrabbit:Taskrabbit is a marketplace platform that conveniently connects people with Taskers to handle everyday home to-do's, such as furniture assembly, handyman work, moving help, and much more.At Taskrabbit, we want to transform lives one task at a time. As a company we celebrate innovation, inclusion and hard work. Our culture is collaborative,...


  • San Francisco, California, United States Apple Full time

    Do you think Computer Vision and Machine Learning can change the world? Do you think it can transform the way millions of people collect, discover and share the most special moments of their lives? We truly believe it can The System Intelligence Machine Learning (SIML) organization is looking for a Machine Learning Research Engineer with a strong foundation...


  • San Francisco, California, United States University of California - San Francisco Full time

    The Machine Learning and Data engineer role will lead the development, implementation, and maintenance of data pipelines and infrastructure to support the deployment and continuous monitoring of Machine Learning (ML) and generative Artificial Intelligence (AI) tools within UCSF's APeX Enabled Research (AER) team. Most projects will be in partnership with...


  • San Francisco, California, United States Block Full time

    It all started with an idea at Block in 2013. Initially built to take the pain out of peer-to-peer payments, Cash App has gone from a simple product with a single purpose to a dynamic ecosystem, developing unique financial products, including Afterpay/Clearpay, to provide a better way to send, spend, invest, borrow and save to our 50+ million monthly active...


  • San Francisco, California, United States Philo Full time

    At Philo, we're a group of technology and product people who set out to build the future of television, marrying the best in modern technology with the most compelling medium ever invented — in short, we're building the TV experience that we've always wanted for ourselves. In practice this means leveraging cloud delivery, modern tech stacks, machine...


  • San Francisco, California, United States Baselayer Full time

    About Baselayer:Trusted by 2,200+ financial institutions, Baselayer is the intelligent business identity platform that helps verify any business, automate KYB, and monitor real-time risk. Baselayer's B2B risk solutions & identity graph network leverage state & federal government filings and proprietary data sources to prevent fraud, accelerate onboarding,...


  • San Francisco, California, United States Sephora Full time

    Job ID: 278398Location Name: CA-FSC SF Off (0174)Address: 350 Mission St, 20th Floor, San Francisco, CA 94105, United States (US)Job Type:Position Type: RegularJob Function: Information TechnologyRemote Eligible:Hybrid ScheduleCompany Overview:At Sephora we inspire our customers, empower our teams, and help them become the best versions of themselves. We...