On-device ML Engineer
2 weeks ago
Here at Hugging Face, we're on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better.
We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on GitHub, over 15.000 companies are using HF technology in production, including leading AI organizations such as Google, Elastic, Salesforce, Grammarly and NASA.
About the Role
As an On-device ML Engineer, you will explore cutting edge methods to run models on consumer platforms, with a special focus on Apple technologies. Your responsibilities will include optimizing, quantizing, and converting the best models for efficient execution on iPhones and Macs. Additionally, you will design, build, and contribute to open source software that demonstrates model usage and develop libraries to minimize friction for developers who may not be deeply familiar with ML. Beyond the technical challenges, your goal will be to disseminate these methods, facilitate their adoption, and create tools for the community.
Day-to-day tasks may include the following:
- Model evaluation, considering quality, latency, memory, and storage needs. You understand the best model for a task may not be the latest SOTA, but the one with the best trade-off.
- Strive to make SOTA models work efficiently on Apple platforms by converting them to native formats like Core ML or MLX, enabling execution on GPUs and the Neural Engine.
- Dive into large codebases, such as Transformers, to optimize model architectures for Apple Silicon platforms, debug issues, and develop workarounds.
- Write Swift code to implement or optimize ML tasks, including pre-and post-processing pipelines.
- Produce high-quality technical documentation, including blog posts, tutorials, guides, social media threads, and concise demo apps.
- Contribute to open source projects, like coremltools, to improve coverage of PyTorch operations.
- Create tools that enable developers to convert, run, and share models easily, making it straightforward for researchers and practitioners to distribute models in device-friendly formats.
- Occasionally, write or be ready to understand low-level code such as parallel GPU kernels.
You'll thrive in this position if you are:
- Experienced Swift Developer: Have a strong background in Swift development with a practical, builder mindset and a good sense of software and application design.
- Passionate About ML: Have a deep understanding of essential model architectures and a passion for machine learning.
- Core ML Proficiency: Have experience using Core ML and understand its advantages and limitations.
- Open Source Contributor: Are eager to publish and contribute to open-source libraries to help developers adopt ML.
- Versatile Engineer: Can move across different levels of abstraction as needed, from UI to Metal kernels.
- Readable Code: Write code that is easy to understand but are also prepared to make critical path ugly for optimization's sake. (But just the critical path, please )
- Optimization Techniques: Understand various optimization techniques, from kv-caching in transformers to post-training quantization and training-time methods.
- System Understanding: Have a strong systems understanding and can identify performance bottlenecks.
- Framework Proficiency: Have experience with various frameworks such as llama.cpp, MLX, PyTorch, and CoreNet.
- Are a good debugger.
- Can write excellent technical documentation.
- Engage in discussion forums and communities about these topics.
If you're interested in joining us but don't tick every box above, we still encourage you to apply We're building a diverse team whose skills, experiences, and backgrounds complement one another. We're happy to consider where you might be able to make the biggest impact.
More about Hugging Face
We are actively working to build a culture that values diversity, equity, and inclusivity. We are intentionally building a workplace where you feel respected and supported-regardless of who you are or where you come from. We believe this is foundational to building a great company and community, as well as the future of machine learning more broadly. Hugging Face is an equal opportunity employer, and we do not discriminate based on race, ethnicity, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or ability status.
We value development. You will work with some of the smartest people in our industry. We are an organization that has a bias for impact and is always challenging ourselves to grow continuously. We provide all employees with reimbursement for relevant conferences, training, and education.
We care about your well-being. We offer flexible working hours and remote options. We offer health, dental, and vision benefits for employees and their dependents. We also offer parental leave and flexible paid time off.
We support our employees wherever they are. While we have office spaces in NYC and Paris, we're very distributed, and all remote employees have the opportunity to visit our offices. If needed, we'll also outfit your workstation to ensure you succeed.
We want our teammates to be shareholders. All employees have company equity as part of their compensation package. If we succeed in becoming a category-defining platform in machine learning and artificial intelligence, everyone enjoys the upside.
-
On-device ML Engineer
3 weeks ago
New York, United States Hugging Face Full timeJob DescriptionJob DescriptionHere at Hugging Face, we’re on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better.We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on...
-
On-device ML Engineer
1 month ago
New York, United States Hugging Face Full timeJob DescriptionJob DescriptionHere at Hugging Face, we’re on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better.We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on...
-
On-device ML Engineer
4 weeks ago
New York, United States Hugging Face Full timeJob DescriptionJob DescriptionHere at Hugging Face, we’re on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better.We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on...
-
ML Engineer
3 weeks ago
New York, United States Trigyn Technologies Full timeJob Description: The Machine Learning Engineer works at the intersection of data engineering and machine learning to expand the capabilities of the client's ChatGPT-style generative AI solution. This role collaborates with other data engineers to build data pipelines and infrastructure to support the machine learning models. Furthermore, it requires...
-
Director of Engineering
2 months ago
New York, United States Fusemachines Full timeJob DescriptionJob DescriptionWe are seeking a Director of Engineering with balanced expertise in Machine Learning (ML)/ML Operations (MLOps) and core software engineering to spearhead our engineering initiatives for an innovative web application product. This role demands a leader who not only has a profound technical grounding in both ML/MLOps and software...
-
Director of Engineering
3 weeks ago
New York, United States Fusemachines Full timeJob DescriptionJob DescriptionWe are seeking a Director of Engineering with balanced expertise in Machine Learning (ML)/ML Operations (MLOps) and core software engineering to spearhead our engineering initiatives for an innovative web application product. This role demands a leader who not only has a profound technical grounding in both ML/MLOps and software...
-
ML Research Engineer
1 month ago
New York, United States Genesis Therapeutics Full timeWe’re a tight-knit team of proven drug hunters, deep learning researchers, and software engineers united by a common mission — drive AI innovation in biochemistry, discovering and developing groundbreaking therapies for patients suffering from severe disorders. Genesis AI team is focused on developing foundation models for small molecule drug discovery...
-
ML Research Engineer
3 weeks ago
New York, United States Genesis Therapeutics Full timeWe’re a tight-knit team of proven drug hunters, deep learning researchers, and software engineers united by a common mission — drive AI innovation in biochemistry, discovering and developing groundbreaking therapies for patients suffering from severe disorders. Genesis AI team is focused on developing foundation models for small molecule drug discovery...
-
AI/ML Engineer
4 weeks ago
New York, United States Wesper Full timeJob DescriptionJob DescriptionTHE OPPORTUNITY Wesper is looking for a smart and creative engineer to lead our AI/ML efforts and product initiatives. This includes advanced ML modeling for large-scale healthcare data synthesis, deep physiological signal optimization pipelines, and generative AI architectures. The right candidate will have an opportunity to...
-
Lead Product Engineer
2 months ago
New York, United States Fusemachines Full timeAbout Fusemachines Fusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 400...
-
Lead Product Engineer
2 days ago
New York, United States Fusemachines Full timeAbout Fusemachines Fusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 400...
-
Lead Product Engineer
2 months ago
New York, United States Fusemachines Full timeAbout Fusemachines Fusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 400...
-
Lead Product Engineer
2 months ago
New York, United States Fusemachines Full timeJob DescriptionJob DescriptionAbout FusemachinesFusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican...
-
Lead Product Engineer
2 months ago
New York, United States Fusemachines Full timeJob DescriptionJob DescriptionAbout FusemachinesFusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican...
-
Lead Product Engineer
3 weeks ago
New York, United States Fusemachines Full timeJob DescriptionJob DescriptionAbout FusemachinesFusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican...
-
Data Engineer
5 days ago
New York, United States Benchmark IT LLC Full timeOur direct client, a fast-growing FinTech firm in New York City, is looking for a Data Engineer. In this role, you will work with Sales, Marketing, and Product teams to define, calculate, and grow their key operating metrics (e.g. sales, conversions, retention). This individual will conduct exploratory data analysis, statistical analysis, and predictive...
-
AI/ML, NLP Engineer
1 month ago
New York, United States Action Tech Full timeThis opportunity is a hybrid position that requires 4 days onsite in either NYC or Greenwich, CT.All candidates must be US Citizens or Green card holders and already be local to the tri-state area!Job Description The AI/ML team is developing cutting edge solutions to establish a unique competitive edge for the firm. As a senior AI/ML - NLP Engineer on our...
-
AI/ML, NLP Engineer
3 weeks ago
New York, United States Action Tech Full timeThis opportunity is a hybrid position that requires 4 days onsite in either NYC or Greenwich, CT.All candidates must be US Citizens or Green card holders and already be local to the tri-state area!Job Description The AI/ML team is developing cutting edge solutions to establish a unique competitive edge for the firm. As a senior AI/ML - NLP Engineer on our...
-
Senior ML Engineer
2 weeks ago
New York, United States Virtusa Full timeSenior ML Engineer - CREQ191248 Description Job Description ML Engineer Skills Programming Languages: Proficiency in Python, familiarity with R is a plus. Machine Learning: Strong understanding of machine learning algorithms, model training, and evaluation, experience with libraries such as TensorFlow, PyTorch, Scikit-Learn, etc. API Development: Experience...
-
Data Engineer
2 weeks ago
New York, United States Benchmark IT - Technology Talent Full timeOur direct client, a fast-growing FinTech firm in New York City, is looking for a Data Engineer. In this role, you will work with Sales, Marketing, and Product teams to define, calculate, and grow their key operating metrics (e.g. sales, conversions, retention). This individual will conduct exploratory data analysis, statistical analysis, and predictive...