Senior Machine Learning Infrastructure Engineer

1 week ago


San Francisco, California, United States Cobalt AI, LLC Full time
Job Title: Senior Machine Learning Infrastructure Engineer

About Us:

At Cobalt AI, we're advancing physical security through innovative AI-powered monitoring solutions. Our primary offering, Cobalt Monitoring Intelligence, builds on our established track record of protecting Fortune 1000 companies across more than 10 countries. We've effectively managed over 350 million alarm events and conducted 15 million hours of active patrols, resulting in a 90% noise reduction for our clients. We maintain a strong focus on both technological advancement and data protection, as demonstrated by our ongoing SOC2 compliance.

Job Description:

We're currently expanding our capabilities from hundreds of robots to hundreds of thousands of cameras, and we're looking for skilled professionals to join our growing team. We need individuals who can contribute to our early-stage architecture decisions and help drive our technological development. Our approach combines the analytical strengths of AI with human expertise to create more effective security solutions. This presents an opportunity to be part of significant advancements in how businesses and communities approach security.

Responsibilities:


  • Design and implement robust machine learning pipelines to crunch on a large number of simultaneous video streams.
  • Improve the performance and reliability of our GPU workloads, on the edge and in the cloud.
  • Design and implement scalable infrastructure for deploying CUDA-enabled ML models to our expanding fleet of edge devices.
  • Discover, evaluate, and integrate cutting-edge computer vision models/algorithms to enable better decision making in time-sensitive contexts.
  • Critically assess and address networking and data storage security risks for the edge processors and their integration with our backend.
  • Lead a mix of senior and junior engineers by example, establishing and maintaining best practices in ML infrastructure development.
  • Lead design reviews and gather input from multiple teams to ensure high-quality outcomes.
  • Tackle challenging interdisciplinary problems within our deep tech stack.
  • Deploy changes with immediate impacts on our product, worldwide.


Requirements:


  • Minimum 5+ years of experience in software engineering, with a focus on ML infrastructure design.
  • Experience leading teams or managing projects.
  • Demonstrated expertise in ML infrastructure development and best coding practices for Python, C++, Rust or equivalent.
  • Experience with pipeline stability and resilience using tools, such as Datadog or equivalent.
  • Passionate about delivering high-quality products that exceed user expectations.
  • Eager to work with, mentor, and learn from peers through code reviews, design documents, and pair programming.
  • Enthusiastic about diving into different areas of technology and problem-solving.
  • Must be authorized to work in the United States.


Bonus Skills:


  • Experience with generative AI technologies.
  • Experience with event streams, like kinesis or equivalent.
  • Familiarity with the Nix package manager and its ecosystem.
  • Experience working in a fast-paced startup environment.
  • Hands-on experience with AWS and DevOps practices.
  • Expertise in networking and security protocols.
  • Experience building and scaling high availability distributed systems.
  • Background as a full-stack engineer.


Salary Range in SVBA: $160k - $210k (actual compensation will be determined based on experience, location, and other factors permitted by law)

Cobalt AI is an equal employment opportunity employer and values diversity.

  • San Francisco, California, United States Unreal Gigs Full time

    Job OverviewWe're seeking a seasoned Senior Machine Learning Infrastructure Engineer to lead the design, development, and optimization of our machine learning infrastructure. As a key member of our team, you'll work on challenging projects, from building scalable data pipelines to deploying and managing machine learning models in production environments.Key...


  • San Francisco, California, United States Unreal Gigs Full time

    Company Overview:Welcome to Unreal Gigs, a pioneer in AI-driven innovation. We're committed to building robust infrastructure that powers our machine learning models at scale. Our team is shaping the future of AI infrastructure engineering.Position Overview:As a Senior Machine Learning Infrastructure Architect, you'll lead the design, development, and...


  • San Francisco, California, United States Sight Machine Full time

    About Sight MachineSight Machine is a leading provider of industrial IoT solutions, empowering manufacturers to improve efficiency, sustainability, and productivity. Our innovative platform integrates data from various sources, enabling real-time visibility and insights into manufacturing processes.Job SummaryWe are seeking a highly skilled Senior Cloud...


  • San Francisco, California, United States CHALK INC Full time

    About ChalkAt Chalk, we're revolutionizing the machine learning landscape by providing a cutting-edge experience for our users. Our infrastructure is designed to empower developers, making it simple to integrate new features, backtest changes, and deliver real-time machine learning and generative AI. We understand the pain points and workflows that make...


  • San Francisco, California, United States Sight Machine, Inc. Full time

    About Sight Machine, Inc.Sight Machine strengthens manufacturers by providing the industry's only standard data model and system-level visualization capabilities. By integrating all crucial data into a single innovative platform, everyone involved in the fabrication process can visualize, contextualize and examine data in one intuitive interface.Job...


  • San Francisco, California, United States Unreal Gigs Full time

    Job Title: Senior ML Operations EngineerAt Unreal Gigs, we're pushing the boundaries of machine learning operations (MLOps) to drive innovation and transform industries. As a Senior ML Operations Engineer, you'll play a critical role in building and maintaining the infrastructure and processes that enable the deployment, monitoring, and management of machine...


  • San Francisco, California, United States Sight Machine Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Infrastructure Engineer to join our team at Sight Machine. As a key member of our Cloud Infrastructure Team, you will be responsible for designing, deploying, and managing our cloud infrastructure to ensure high availability, scalability, and security.Key Responsibilities- Employing DevOps...


  • San Francisco, California, United States Sight Machine Full time

    About Sight MachineSight Machine is a leading provider of industrial IoT solutions, empowering manufacturers to improve efficiency, sustainability, and quality. Our innovative platform enables real-time data visualization, contextualization, and examination, driving business outcomes and customer satisfaction.Job SummaryWe are seeking a highly skilled Senior...


  • San Francisco, California, United States Sight Machine Full time

    About Sight MachineSight Machine is a leading provider of industrial IoT solutions, empowering manufacturers to improve efficiency, sustainability, and quality. Our innovative platform integrates data from various sources, enabling real-time visibility and insights into production processes.Job DescriptionWe are seeking a highly skilled Senior Cloud...


  • San Francisco, California, United States Cobalt AI, LLC Full time

    Job Title: Lead Machine Learning Infrastructure EngineerAbout Cobalt AI, LLCCobalt AI, LLC is a leading provider of AI-powered monitoring solutions for physical security. Our innovative approach combines the analytical strengths of AI with human expertise to create more effective security solutions.Job SummaryWe are seeking a highly skilled Lead Machine...


  • San Francisco, California, United States Sight Machine, Inc. Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Infrastructure Engineer to join our team at Sight Machine, Inc. As a key member of our Cloud Infrastructure Team, you will be responsible for designing, deploying, and managing our cloud-based infrastructure to ensure high availability, scalability, and security.As a Senior Cloud Infrastructure...


  • San Francisco, California, United States Unreal Gigs Full time

    Job Title: Machine Learning Infrastructure ArchitectAt Unreal Gigs, we're seeking a highly skilled Machine Learning Infrastructure Architect to join our team. As a key member of our infrastructure team, you'll be responsible for designing, building, and optimizing our machine learning infrastructure to support the needs of our organization.Key...


  • San Francisco, California, United States Sight Machine Full time

    About Sight MachineSight Machine is a leading provider of industrial IoT solutions, empowering manufacturers to improve efficiency, sustainability, and quality. Our innovative platform integrates data from various sources, enabling real-time monitoring, analysis, and optimization of manufacturing processes.Job DescriptionWe are seeking a highly skilled...


  • South San Francisco, California, United States Genentech Full time

    We are seeking a Principal Machine Learning Engineer, Infrastructure to join our team at Genentech Computational Sciences (gCS) Prescient Design. This role will involve designing, constructing, and optimizing large-scale distributed systems, with a particular emphasis on machine learning infrastructure.The successful candidate will have strong skills and...


  • San Francisco, California, United States Cantina Full time

    About the Role:Cantina is seeking a highly skilled Machine Learning Engineer to design, implement, and maintain robust infrastructure to support training, deployment, and monitoring of ML models at scale. As a key member of our engineering team, you will collaborate with research scientists to integrate ML models into production systems and applications.Key...


  • South San Francisco, California, United States Genentech Full time

    At Genentech, we are at the forefront of employing machine learning to revolutionize drug discovery, adopting novel methods, techniques, and infrastructure to transform the field.Our Engineering team is seeking engineers with strong skills and hands-on experience in designing, constructing, and optimizing large-scale distributed systems, with a particular...


  • San Francisco, California, United States Faire Inc Full time

    About FaireFaire is an innovative online wholesale marketplace that empowers independent retailers to thrive in a rapidly changing retail landscape. Our mission is to level the playing field for small businesses by leveraging the power of technology and data.At Faire, we're passionate about using machine learning to drive business growth and create a more...


  • South San Francisco, California, United States Genentech Full time

    Job SummaryWe are seeking a highly skilled Principal Machine Learning Engineer, Infrastructure to join our team at Genentech. As a key member of our Prescient Design group, you will be responsible for designing, constructing, and optimizing large-scale distributed systems for machine learning infrastructure.Key ResponsibilitiesDesign and implement...


  • San Francisco, California, United States Plaid Full time

    We're looking for a skilled software engineer to join our Payment Risk team at Plaid. As a software engineer on this team, you will work on tooling and infrastructure that facilitates the development of our machine learning based risk models, from data and feature pipelines to model evaluation framework.You will also work on model serving infrastructure to...


  • San Francisco, California, United States PriceSpider Full time

    Job Title: Senior Machine Learning EngineerAt PriceSpider, we are dedicated to building innovative solutions that push the boundaries of technology. Our team thrives on tackling complex challenges and delivering high-performance, scalable systems.Job Summary:We are seeking a dynamic Senior Machine Learning Engineer with strong infrastructure skills, deep...