Senior Software Engineer, AI Inference

2 months ago


Remote, Oregon, United States Deepgram Full time

Company Overview

Deepgram is a foundational AI company on a mission to transform human-machine interaction using natural language. We give any developer access to the fastest, most powerful voice AI platform including access to models for speech-to-text, text-to-speech, and spoken language understanding with just an API call. From transcription to sentiment analysis to voice synthesis, Deepgram is the preferred partner for builders of voice AI applications.

Opportunity

We are seeking a backend engineer focused on AI inference to join the team powering Deepgram's core speech inference APIs. You'll implement and optimize inference code, experiment with cutting-edge technologies, and develop, maintain, and deploy the stack of services behind our blazing-fast, massive-throughput inference system. This role blends work on backend services and systems with domain specialty in neural networks and GPU programming. Our team owns the applications that serve and empowers builders of innovative speech products by focusing on a world-class combination of reliability, efficiency, and latency.


What You'll Do


  • Implement inference for novel model architectures developed by Deepgram's trailblazing research team
  • Develop, test, and deploy application code for massive-scale production services
  • Debug complex system issues that include networking, scheduling, and high-performance computing interactions
  • Build tooling for internal analysis and benchmarking to identify opportunities for efficiency improvements
  • Experiment with optimization techniques for ML workloads on NVIDIA GPUs and ship the key wins to prod


You'll Love This Role If You


  • Think of yourself as a generalist while enjoying learning deeply in specific areas, causing you to go from debugging a customer issue one day to designing an algorithm the next
  • Like sipping piña coladas and getting caught in the rain
  • Enjoy taking ownership of features from early collaborations with researchers through testing in production
  • Love getting nitty-gritty with profilers, hardware architectures, and inference algorithms
  • Want to work within the context of a humble, collaborative team that collectively owns mission-critical production services


It's Important to Us That You Have


  • The ability to work collaboratively in a fast-paced environment and adapt to changing priorities
  • Proven industry experience building and shipping production services
  • Strong confidence in a lower-level language like C, C++, or Rust
  • Experience slicing large projects or initiatives into smaller experiments or incremental improvements
  • Expertise in a ML framework like Torch or Tensorflow
  • Experience with GPU programming using tools like CUDA or libraries like cuDNN, cuBLAS, etc.


It Would Be Great If You Also Had


  • Extensive professional experience with Rust and C++
  • Experience optimizing ML workloads in production
  • Familiarity with GPU hardware architecture and its impact on inference pipelines

Backed by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $85 million in total funding after closing our Series B funding round last year. If you're looking to work on cutting-edge technology and make a significant impact in the AI industry, we'd love to hear from you

Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing. We put our customers first, grow together and move quickly. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.

We are happy to provide accommodations for applicants who need them.

Compensation Range: $165K - $220K


  • AI Software Engineer

    1 month ago


    Remote, Oregon, United States Zoom Video Communications, Inc. Full time

    AI Software EngineerZoom - Remote$136,800 a yearWhat you can expectAs an AI Engineer, you will collaborate to design, implement, and optimize AI algorithms and software applications. You will ensure AI training, inference, deployment, and operation are functional, reliable, and scalable in your role.About the TeamZoom is seeking a highly passionate AI...

  • Software Engineer

    2 months ago


    Remote, Oregon, United States AI Fund Full time

    Landing AI, led by globally recognized AI leader Dr. Andrew Ng, is building cutting-edge vision products. We are at the forefront of creating domain-specific large vision models (LVMs) and the application of large multi-model models (LMMs) to practical vision tasks. Our Visual Prompting technology takes the ideas of text prompting and adapts them to vision...

  • Software Engineer

    1 month ago


    Remote, Oregon, United States AI Fund Full time

    About LandingAI:LandingAI, led by globally recognized AI leader Dr. Andrew Ng, is building cutting-edge vision products. We are at the forefront of creating domain-specific large vision models (LVMs) and the application of large multi-model models (LMMs) to practical vision tasks. Our Visual Prompting technology takes the ideas of text prompting and adapts...


  • Remote, Oregon, United States Shield AI Full time

    Introduction to Shield AIFounded in 2015, Shield AI is a venture-backed defense technology company whose mission is to protect service members and civilians with intelligent systems. In pursuit of this mission, Shield AI is building the world's best AI pilot. Its AI pilot, Hivemind, has flown a fighter jet (F-16), a vertical takeoff and landing drone...


  • Remote, Oregon, United States Redflag AI Full time

    At Redflag, we develop software that is able to analyze every type of content used to communicate online (text, image, video, and audio) and has the capability to find any particular piece of content across the entire internet. In an ever expanding digital world, we strive to provide solutions that allow businesses and individuals to both protect their...


  • Remote, Oregon, United States Tech Firefly Full time

    DescriptionTech Firefly is teaming up with a deep learning hardware company to hire a Senior Software Engineer for their team. If you are an experienced full stack developer and have experience working for startups, please apply todayPosition: Full-TimeLocation: 100% RemoteResponsibilities:Develop user-friendly interfaces for our ML and AI cloud...


  • Remote, Oregon, United States Hugging Face Full time

    DescriptionHere at Hugging Face, we're on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better.We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on GitHub, over companies...


  • Remote, Oregon, United States Redflag AI Full time

    At Redflag, we develop software that is able to analyze every type of content used to communicate online (text, image, video, and audio) and has the capability to find any particular piece of content across the entire internet. In an ever expanding digital world, we strive to provide solutions that allow businesses and individuals to both protect their...


  • Remote, Oregon, United States Shield AI Full time

    Introduction to Shield AIFounded in 2015, Shield AI is a venture-backed defense technology company whose mission is to protect service members and civilians with intelligent systems. In pursuit of this mission, Shield AI is building the world's best AI pilot. Its AI pilot, Hivemind, has flown a fighter jet (F-16), a vertical takeoff and landing drone...

  • QA Engineer

    1 month ago


    Remote, Oregon, United States Gradient AI Full time

    (Senior) QA Engineer Gradient AI: Gradient AI is a leading provider of AI solutions for the Group Health and P&C insurance industries. Our solutions improve loss ratios and profitability by predicting underwriting and claim risks with greater accuracy, as well as reducing quote turnaround times and claim expenses through intelligent automation. Gradient...


  • Remote, Oregon, United States Edgecortix Full time

    IntroductionEdgeCortix is hiring for a staff field application engineer position to join our Tokyo/Kanagawa-based team and drive support of pre-sales and post-sales activities related to our artificial intelligence (AI) processor and AI acceleration software products. While located in Japan, you will be primarily involved in supporting customers locally,...


  • Remote, Oregon, United States Dotdash Meredith Full time

    Remote- In-office Expectations: This position is fully remote with no in-office requirements, (might require coming into an office 1 or 2x a year)Dotdash Meredith is looking for a Senior Software Engineer 1 to join our Search and Recommendations team. As part of the Search and Recommendations team, you'll be working on widely used components that help users...


  • Remote, Oregon, United States Manifold Full time

    Company OverviewManifold is an innovative AI-powered clinical research platform that simplifies the complex workflows of study and data management. Our mission is to empower researchers to conduct high-impact research efficiently, using fewer resources. We partner with research organizations and cancer centers nationwide, significantly reducing the time...

  • Backend Engineer

    2 months ago


    Remote, Oregon, United States AI Fund Full time

    (Work Paths to Work Passion) is an AI Career Companion leveraging cutting-edge technology and artificial intelligence to help professionals define, plan, and find their best careers, their Ikigai. For the first time, Gen AI allows us to spearhead a mission dedicated to fostering not just better, but more fulfilling and highly productive careers. This...

  • Software Engineer

    2 months ago


    Remote, Oregon, United States Inspiren Full time

    About Inspiren Inspiren was created to help operators forge thriving senior living communities.We use a simple, streamlined platform that protects resident privacy, to optimize community operations at every step. Our technology puts residents first, capturing insights on everything from revenue leakage to staff utilization, while providing an extra layer of...


  • Remote, Oregon, United States Pachama Full time

    Who we are.Pachama is a mission-driven company looking to restore nature to help address climate change. Pachama brings the latest technology in remote sensing and AI to the world of forest carbon in order to enable forest conservation and restoration to scale. Pachama's core technology harnesses satellite imaging with artificial intelligence to measure...

  • Software Engineer

    23 hours ago


    Remote, Oregon, United States AppFolio, Inc Full time

    About AppFolio, Inc.We're a pioneering company in the cloud and AI space, delivering magical experiences that make our customers' lives easier. Our mission is to revolutionize the real estate industry by innovating and collaborating with passionate individuals like you.Job SummaryWe're seeking a highly skilled Software Engineer to join our dynamic and...


  • Remote, Oregon, United States Timescale Full time

    We're looking for experienced engineers with a backend developer background who are now passionate/excited about AI products and workflows. You will help us build and maintain our AI-related Python libraries and integrations, help define and design our DX (developer experience) for AI products, write demos and content around these products, and automate...


  • Remote, Oregon, United States Timescale Full time

    We are a data platform company that is aiming to develop new SaaS tools to help AI developers develop AI-based applications. Timescale is looking for an experienced Backend Engineer with a track record of building great SaaS and cloud services to help us develop these tools.This is a new product and thus it is an exciting opportunity to play a central role...


  • Remote, Oregon, United States Technergetics Full time

    Position: Principal of AI Research and Development Beware of fraudulent job offers and postings Technergetics will never extend an offer of employment without a thorough interview process involving face to face interviews either in-person or a virtual Teams meeting from an official Technergetics email address (). If you receive any correspondence from an...