On-device ML Engineer

3 weeks ago


New York, United States Hugging Face Full time
Job DescriptionJob Description

Here at Hugging Face, we’re on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better.

We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on GitHub, over 15.000 companies are using HF technology in production, including leading AI organizations such as Google, Elastic, Salesforce, Grammarly and NASA.

About the Role

As an On-device ML Engineer, you will explore cutting edge methods to run models on consumer platforms, with a special focus on Apple technologies. Your responsibilities will include optimizing, quantizing, and converting the best models for efficient execution on iPhones and Macs. Additionally, you will design, build, and contribute to open source software that demonstrates model usage and develop libraries to minimize friction for developers who may not be deeply familiar with ML. Beyond the technical challenges, your goal will be to disseminate these methods, facilitate their adoption, and create tools for the community.

Day-to-day tasks may include the following:

  • Model evaluation, considering quality, latency, memory, and storage needs. You understand the best model for a task may not be the latest SOTA, but the one with the best trade-off.
  • Strive to make SOTA models work efficiently on Apple platforms by converting them to native formats like Core ML or MLX, enabling execution on GPUs and the Neural Engine.
  • Dive into large codebases, such as Transformers, to optimize model architectures for Apple Silicon platforms, debug issues, and develop workarounds.
  • Write Swift code to implement or optimize ML tasks, including pre-and post-processing pipelines.
  • Produce high-quality technical documentation, including blog posts, tutorials, guides, social media threads, and concise demo apps.
  • Contribute to open source projects, like coremltools, to improve coverage of PyTorch operations.
  • Create tools that enable developers to convert, run, and share models easily, making it straightforward for researchers and practitioners to distribute models in device-friendly formats.
  • Occasionally, write or be ready to understand low-level code such as parallel GPU kernels.

About you

You’ll thrive in this position if you are:

  • Experienced Swift Developer: Have a strong background in Swift development with a practical, builder mindset and a good sense of software and application design.
  • Passionate About ML: Have a deep understanding of essential model architectures and a passion for machine learning.
  • Core ML Proficiency: Have experience using Core ML and understand its advantages and limitations.
  • Open Source Contributor: Are eager to publish and contribute to open-source libraries to help developers adopt ML.
  • Versatile Engineer: Can move across different levels of abstraction as needed, from UI to Metal kernels.
  • Readable Code: Write code that is easy to understand but are also prepared to make critical path ugly for optimization’s sake. (But just the critical path, please


  • New York, United States Hugging Face Full time

    Here at Hugging Face, we're on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better. We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on GitHub, over 15.000 companies are...

  • ML Engineer

    3 weeks ago


    New York, United States Trigyn Technologies Full time

    Job Description: The Machine Learning Engineer works at the intersection of data engineering and machine learning to expand the capabilities of the client's ChatGPT-style generative AI solution. This role collaborates with other data engineers to build data pipelines and infrastructure to support the machine learning models. Furthermore, it requires...


  • New York, United States Fusemachines Full time

    Job DescriptionJob DescriptionWe are seeking a Director of Engineering with balanced expertise in Machine Learning (ML)/ML Operations (MLOps) and core software engineering to spearhead our engineering initiatives for an innovative web application product. This role demands a leader who not only has a profound technical grounding in both ML/MLOps and software...


  • New York, United States Fusemachines Full time

    Job DescriptionJob DescriptionWe are seeking a Director of Engineering with balanced expertise in Machine Learning (ML)/ML Operations (MLOps) and core software engineering to spearhead our engineering initiatives for an innovative web application product. This role demands a leader who not only has a profound technical grounding in both ML/MLOps and software...

  • ML Research Engineer

    1 month ago


    New York, United States Genesis Therapeutics Full time

    We’re a tight-knit team of proven drug hunters, deep learning researchers, and software engineers united by a common mission — drive AI innovation in biochemistry, discovering and developing groundbreaking therapies for patients suffering from severe disorders. Genesis AI team is focused on developing foundation models for small molecule drug discovery...

  • ML Research Engineer

    3 weeks ago


    New York, United States Genesis Therapeutics Full time

    We’re a tight-knit team of proven drug hunters, deep learning researchers, and software engineers united by a common mission — drive AI innovation in biochemistry, discovering and developing groundbreaking therapies for patients suffering from severe disorders. Genesis AI team is focused on developing foundation models for small molecule drug discovery...

  • AI/ML Engineer

    4 weeks ago


    New York, United States Wesper Full time

    Job DescriptionJob DescriptionTHE OPPORTUNITY Wesper is looking for a smart and creative engineer to lead our AI/ML efforts and product initiatives. This includes advanced ML modeling for large-scale healthcare data synthesis, deep physiological signal optimization pipelines, and generative AI architectures. The right candidate will have an opportunity to...

  • Lead Product Engineer

    1 month ago


    New York, United States Fusemachines Full time

    About Fusemachines Fusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 400...

  • Lead Product Engineer

    2 months ago


    New York, United States Fusemachines Full time

    Job DescriptionJob DescriptionAbout FusemachinesFusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican...

  • Lead Product Engineer

    1 month ago


    New York, United States Fusemachines Full time

    Job DescriptionJob DescriptionAbout FusemachinesFusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican...

  • Lead Product Engineer

    1 month ago


    New York, United States Fusemachines Full time

    About Fusemachines Fusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 400...

  • Lead Product Engineer

    3 weeks ago


    New York, United States Fusemachines Full time

    Job DescriptionJob DescriptionAbout FusemachinesFusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican...

  • Data Engineer

    2 days ago


    New York, United States Benchmark IT LLC Full time

    Our direct client, a fast-growing FinTech firm in New York City, is looking for a Data Engineer. In this role, you will work with Sales, Marketing, and Product teams to define, calculate, and grow their key operating metrics (e.g. sales, conversions, retention). This individual will conduct exploratory data analysis, statistical analysis, and predictive...

  • AI/ML, NLP Engineer

    1 month ago


    New York, United States Action Tech Full time

    This opportunity is a hybrid position that requires 4 days onsite in either NYC or Greenwich, CT.All candidates must be US Citizens or Green card holders and already be local to the tri-state area!Job Description The AI/ML team is developing cutting edge solutions to establish a unique competitive edge for the firm. As a senior AI/ML - NLP Engineer on our...

  • AI/ML, NLP Engineer

    3 weeks ago


    New York, United States Action Tech Full time

    This opportunity is a hybrid position that requires 4 days onsite in either NYC or Greenwich, CT.All candidates must be US Citizens or Green card holders and already be local to the tri-state area!Job Description The AI/ML team is developing cutting edge solutions to establish a unique competitive edge for the firm. As a senior AI/ML - NLP Engineer on our...

  • Senior ML Engineer

    1 week ago


    New York, United States Virtusa Full time

    Senior ML Engineer - CREQ191248 Description Job Description ML Engineer Skills Programming Languages: Proficiency in Python, familiarity with R is a plus. Machine Learning: Strong understanding of machine learning algorithms, model training, and evaluation, experience with libraries such as TensorFlow, PyTorch, Scikit-Learn, etc. API Development: Experience...

  • Data Engineer

    1 week ago


    New York, United States Benchmark IT - Technology Talent Full time

    Our direct client, a fast-growing FinTech firm in New York City, is looking for a Data Engineer. In this role, you will work with Sales, Marketing, and Product teams to define, calculate, and grow their key operating metrics (e.g. sales, conversions, retention). This individual will conduct exploratory data analysis, statistical analysis, and predictive...

  • Data Engineer

    6 days ago


    New York, United States Benchmark IT - Technology Talent Full time

    Our direct client, a fast-growing FinTech firm in New York City, is looking for a Data Engineer. In this role, you will work with Sales, Marketing, and Product teams to define, calculate, and grow their key operating metrics (e.g. sales, conversions, retention). This individual will conduct exploratory data analysis, statistical analysis, and predictive...

  • ML Engineering Intern

    3 weeks ago


    New York, New York, United States tapwage Full time

    Elevating the quality of human life through every conversation InternshipAbout the Team:At , we are a trailblazing force in the world of artificial intelligence, committed to pushing the boundaries of technology. Our latest breakthrough - the Nebula LLM - represents the cutting edge of innovation, and we're looking for dedicated Machine Learning Engineering...

  • ML Engineering Intern

    1 month ago


    New York, New York, United States tapwage Full time

    Elevating the quality of human life through every conversation InternshipAbout the Team:At , we are a trailblazing force in the world of artificial intelligence, committed to pushing the boundaries of technology. Our latest breakthrough - the Nebula LLM - represents the cutting edge of innovation, and we're looking for dedicated Machine Learning Engineering...