On-device ML Engineer

2 months ago


New York, United States Hugging Face Full time
Job DescriptionJob Description

Here at Hugging Face, we’re on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better.

We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on GitHub, over 15.000 companies are using HF technology in production, including leading AI organizations such as Google, Elastic, Salesforce, Grammarly and NASA.

About the Role

As an On-device ML Engineer, you will explore cutting edge methods to run models on consumer platforms, with a special focus on Apple technologies. Your responsibilities will include optimizing, quantizing, and converting the best models for efficient execution on iPhones and Macs. Additionally, you will design, build, and contribute to open source software that demonstrates model usage and develop libraries to minimize friction for developers who may not be deeply familiar with ML. Beyond the technical challenges, your goal will be to disseminate these methods, facilitate their adoption, and create tools for the community.

Day-to-day tasks may include the following:

  • Model evaluation, considering quality, latency, memory, and storage needs. You understand the best model for a task may not be the latest SOTA, but the one with the best trade-off.
  • Strive to make SOTA models work efficiently on Apple platforms by converting them to native formats like Core ML or MLX, enabling execution on GPUs and the Neural Engine.
  • Dive into large codebases, such as Transformers, to optimize model architectures for Apple Silicon platforms, debug issues, and develop workarounds.
  • Write Swift code to implement or optimize ML tasks, including pre-and post-processing pipelines.
  • Produce high-quality technical documentation, including blog posts, tutorials, guides, social media threads, and concise demo apps.
  • Contribute to open source projects, like coremltools, to improve coverage of PyTorch operations.
  • Create tools that enable developers to convert, run, and share models easily, making it straightforward for researchers and practitioners to distribute models in device-friendly formats.
  • Occasionally, write or be ready to understand low-level code such as parallel GPU kernels.

About you

You’ll thrive in this position if you are:

  • Experienced Swift Developer: Have a strong background in Swift development with a practical, builder mindset and a good sense of software and application design.
  • Passionate About ML: Have a deep understanding of essential model architectures and a passion for machine learning.
  • Core ML Proficiency: Have experience using Core ML and understand its advantages and limitations.
  • Open Source Contributor: Are eager to publish and contribute to open-source libraries to help developers adopt ML.
  • Versatile Engineer: Can move across different levels of abstraction as needed, from UI to Metal kernels.
  • Readable Code: Write code that is easy to understand but are also prepared to make critical path ugly for optimization’s sake. (But just the critical path, please

  • On-device ML Engineer

    3 months ago


    New York, United States Hugging Face Full time

    Job DescriptionJob DescriptionHere at Hugging Face, we’re on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better.We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on...


  • New York, United States Fusemachines Full time

    Job DescriptionJob DescriptionWe are seeking a Director of Engineering with balanced expertise in Machine Learning (ML)/ML Operations (MLOps) and core software engineering to spearhead our engineering initiatives for an innovative web application product. This role demands a leader who not only has a profound technical grounding in both ML/MLOps and software...

  • ML Engineer

    5 days ago


    New York, United States Motion Recruitment Full time

    **Job Opportunity: Machine Learning Engineer** **Overview:** Join our dynamic team as a Machine Learning Engineer, where you'll drive innovation in algorithmic approaches for signals intelligence. This role requires an in-person presence and offers a chance to significantly impact our product evolution. **Responsibilities:** - Design and execute...

  • Lead Product Engineer

    2 months ago


    New York, United States Fusemachines Full time

    About Fusemachines Fusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 400...

  • Lead Product Engineer

    3 months ago


    New York, United States Fusemachines Full time

    Job DescriptionJob DescriptionAbout FusemachinesFusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican...

  • AI/ML, NLP Engineer

    2 weeks ago


    New York, United States Action Tech Full time

    This opportunity is a hybrid position that requires 4 days onsite in either NYC or Greenwich, CT.All candidates must be US Citizens or Green card holders and already be local to the tri-state area!AVP or VP level role available!Job Description The AI/ML team is developing cutting edge solutions to establish a unique competitive edge for the firm. As a senior...

  • AI/ML, NLP Engineer

    1 month ago


    New York, United States Action Tech Full time

    This opportunity is a hybrid position that requires 4 days onsite in either NYC or Greenwich, CT.All candidates must be US Citizens or Green card holders and already be local to the tri-state area!AVP or VP level role available!Job Description The AI/ML team is developing cutting edge solutions to establish a unique competitive edge for the firm. As a senior...

  • Senior ML Engineer

    2 months ago


    New York, New York, United States Hinge Full time

    Hinge is the dating app designed to be deletedIn today's digital world, finding genuine relationships is tougher than ever. At Hinge, we're on a mission to inspire intimate connection to create a less lonely world. We're obsessed with understanding our users' behaviors to help them find love, and our success is defined by one simple metric– setting up...

  • Senior ML Engineer

    2 months ago


    New York, United States Hinge Full time

    Hinge is the dating app designed to be deleted In today's digital world, finding genuine relationships is tougher than ever. At Hinge, we’re on a mission to inspire intimate connection to create a less lonely world. We’re obsessed with understanding our users’ behaviors to help them find love, and our success is defined by one simple metric–...

  • Lead Product Engineer

    2 months ago


    New York City, United States Fusemachines Full time

    About Fusemachines Fusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 400...


  • New York, New York, United States BJAK Full time

    DescriptionAbout UsRealy is at the forefront of the virtual reality revolution, creating an extended reality social media application built on a cutting-edge spatial computing platform. Our mission is to enable users to share their recorded and live content in a fully immersive 3D experience, capturing memories in the best possible way.Realy is seeking a...

  • NLP Engineer

    5 days ago


    New York, New York, United States Action Tech Full time

    This role is a hybrid position requiring onsite presence for four days a week.Candidates must be US Citizens or Green Card holders and currently reside in the tri-state area.Available positions include AVP or VP level.Position OverviewThe AI/ML division is at the forefront of creating innovative solutions that provide a distinct competitive advantage for...

  • ML Engineer

    2 months ago


    New York, United States Sixfold Full time

    Job DescriptionJob DescriptionWho we are:At Sixfold, we’re giving businesses the benefits of AI-powered decision-making. Our AI handles the leg work while humans spend their time on things they do best. Our product is the world’s first artificial intelligence trained to solve the hardest problems in the insurance industry. Underwriters work with Sixfold...


  • New York, New York, United States Action Tech Full time

    This role is a hybrid position requiring a commitment to work onsite for four days a week.Applicants must be US Citizens or Green Card holders and reside in the tri-state area.We are seeking candidates for either an AVP or VP level position.Position OverviewThe AI/ML division is at the forefront of creating innovative solutions that provide a distinct...

  • Design Engineer

    2 weeks ago


    New York, United States Medical Device Startup Full time

    Design and Manufacturing Engineer The ideal candidate possesses both a high level of technical expertise and an innate passion to build. You will play a critical role in creating and refining designs and processes in order to improve the product design, manufacturability, quality, and productivity. You will manage and oversee the manufacturing of a medical...


  • New York, New York, United States Adaptive ML Full time

    About the TeamAdaptive ML is a cutting-edge technology company that specializes in building singular generative AI experiences. Our team is dedicated to democratizing the use of reinforcement learning and creating foundational technologies, tools, and products that enable models to learn directly from user interactions and self-critique and self-improve from...

  • Python Engineer

    4 weeks ago


    New York, United States Focus Capital Markets Full time

    In this role, you will directly work with and support some of the top data scientists and machine learning engineers to prototype and build end-to-end code and full-scale applications that leverage ML tools in their deployment. You will develop new tools and software to support the business in harnessing their data and analytics capabilities, including the...


  • New York, United States GLOBALFOUNDRIES Full time

    About GlobalFoundries GlobalFoundries (GF) is a leading full-service semiconductor foundry providing a unique combination of design, development, and fabrication services to some of the world's most inspired technology companies. With a global manufacturing footprint spanning three continents, GF makes possible the technologies and systems that transform...

  • AI/ML Engineer

    2 months ago


    New York, United States 2Bridge Partners Full time

    Job DescriptionJob DescriptionSeeking an AI Platform Engineer to join as a technical team leader. You will play a crucial role in building and evolving the AI Platform, with a focus on code generation, lifecycle benefits and the goal of a more productive engineering organization.ResponsibilitiesCreate AI tools that empower development, with a focus on...


  • New York, United States Hugging Face Full time

    Job DescriptionJob DescriptionHere at Hugging Face, we’re on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better.We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on...