ML Compiler Engineer

3 days ago


Santa Clara, United States Oho Group Ltd Full time

Looking for an ML Compiler Engineer


This exciting start-up focuses on harnessing the potential of AI for the betterment of humanity. In pursuit of this goal, they are crafting a comprehensive solution encompassing both software and hardware, engineered to deliver vastly superior performance and efficiency compared to traditional methods. The design emphasizes scalability, ensuring adaptability to evolving needs


Currently, they are in the process of creating the SPU (Spatial Processing Unit), an economical AI processor meticulously crafted from foundational principles to offer unparalleled performance per watt, empowering the advancement of next-generation AI workloads.


Skills:

• Strong proficiency in organization and time management

• Expertise in Software Architecture

• Proficient in Parallel Programming

• Exceptional proficiency in C/C++

• Extensive experience with CUDA

• Advanced Problem-Solving Skills

• Demonstrated proficiency in PyTorch with successful deployment of models

• Experience in performance and memory optimization

• Aptitude for benchmarking and optimization techniques


Requirements:

• A bachelor's degree in a STEM field (preferably EE or CS). Candidates with a master's degree or equivalent experience will be given preference.

• A deep understanding of algorithms and numerical analysis

• Familiarity with low-level software drivers and kernel operations

• 4+ years of experience in software development for products and/or enterprise

• Some knowledge of compiler design and development

• Experience working on large and complex applications, with proficiency in code manipulation and optimization

• Excellent problem-solving skills, with the ability to troubleshoot and resolve critical technical issues independently

• Proficiency in code versioning tools such as Git, Mercurial, or SVN

• Familiarity with continuous integration practices

• Knowledge of frameworks like TensorFlow, PyTorch, or ONNX is not mandatory but would be beneficial.



  • Santa Clara, California, United States Qualcomm Full time

    Job OverviewWe are seeking a skilled Machine Learning Compiler Engineer to join our team at Qualcomm. As an ML Compiler Engineer, you will work on developing state-of-the-art machine learning development tools and software libraries.


  • Santa Clara, California, United States Qualcomm Full time

    Company OverviewQualcomm Technologies, Inc. is a leader in the development of AI and machine learning technologies.Salary InformationThe estimated annual salary for this role is $175,000-$225,000 based on industry standards and location.Job DescriptionThis is an exciting opportunity to work on a wide range of ML compilers, improve their optimization...


  • Santa Clara, United States d-Matrix Full time

    d-Matrix has fundamentally changed the physics of memory-compute integration with our digital in-memory compute (DIMC) engine. The "holy grail" of AI compute has been to break through the memory wall to minimize data movements. We've achieved this with a first-of-its-kind DIMC engine. Having secured over $154M, $110M in our Series B offering, d-Matrix is...


  • Santa Clara, United States d-Matrix Full time

    d-Matrix has fundamentally changed the physics of memory-compute integration with our digital in-memory compute (DIMC) engine. The “holy grail” of AI compute has been to break through the memory wall to minimize data movements. We’ve achieved this with a first-of-its-kind DIMC engine. Having secured over $154M, $110M in our Series B offering, d-Matrix...


  • Santa Clara, United States Acceler8 Talent Full time

    ML Performance EngineerJoin Our Team as an ML Performance EngineerAre you ready to pioneer the future of AI, making it private, convenient, and profitable for all? At our company, we're on a mission to empower developers and enterprises worldwide by migrating inference to user devices and supercharging existing on-device inference. We believe this...


  • Santa Clara, California, United States Yoh Full time

    Job Overview:">About Yoh: As a leading workforce solutions company, we provide flexible and skilled talent to help businesses succeed. Our mission is to deliver exceptional services that meet the unique needs of our clients and candidates.Salary Range: $175,000 - $240,000 per annumJob Description:">In this critical role as a Senior Compiler Engineer, you...

  • DevOps Engineer

    3 weeks ago


    Santa Clara, United States EVONA Full time

    DevOps Engineer - AI/ML IntegrationLocation: Santa Clara, CARole Overview:We are hiring a DevOps Engineer in Santa Clara to streamline development workflows, optimize cloud infrastructure, and deploy machine learning models into production environments. This role combines traditional DevOps practices with cutting-edge AI/ML technology, offering a unique...

  • DevOps Engineer

    6 days ago


    Santa Clara, United States EVONA Full time

    DevOps Engineer - AI/ML IntegrationLocation: Santa Clara, CARole Overview:We are hiring a DevOps Engineer in Santa Clara to streamline development workflows, optimize cloud infrastructure, and deploy machine learning models into production environments. This role combines traditional DevOps practices with cutting-edge AI/ML technology, offering a unique...

  • Runtime Engineer

    3 days ago


    Santa Clara, United States Oho Group Ltd Full time

    Runtime EngineerOur client are on a mission to make AI accessible and impactful without compromising the environment. By building a high-performance, portable compiler, they empower developers to train models in the cloud, deploy them at the edge, and everything in between—all optimized for efficiency and scalability.This position focuses on advancing...


  • santa clara, United States EVONA Full time

    DevOps Engineer - AI/ML IntegrationLocation: Santa Clara, CARole Overview:We are hiring a DevOps Engineer in Santa Clara to streamline development workflows, optimize cloud infrastructure, and deploy machine learning models into production environments. This role combines traditional DevOps practices with cutting-edge AI/ML technology, offering a unique...


  • Santa Clara, United States d-Matrix Full time

    d-Matrix has fundamentally changed the physics of memory-compute integration with our digital in-memory compute (DIMC) engine. The "holy grail" of AI compute has been to break through the memory wall to minimize data movements. We've achieved this with a first-of-its-kind DIMC engine. Having secured over $154M, $110M in our Series B offering, d-Matrix is...


  • Santa Clara, California, United States Amazon Full time

    About the Role: We are seeking an experienced Search Ranking Systems Engineer to join our team at Amazon. This is a unique opportunity to work on building search ranking systems that serve thousands of product types, billions of queries, and hundreds of millions of customers worldwide.Key Responsibilities: As a Search Ranking Systems Engineer, you will be...


  • Santa Clara, California, United States Proximity Works Full time

    Job Description:We are seeking a highly skilled Ai/ml solutions architect to lead the design, development, and implementation of cutting-edge AI/ML solutions. As a key technical leader, you will be responsible for architecting scalable, low-latency distributed systems optimized for AI/ML workloads.The ideal candidate will have 8+ years of experience in...


  • Santa Clara, United States Disability Solutions Full time

    Johnson & Johnson MedTech Digital (JJMT) is recruiting for a Director of Applied ML/AI to establish and incubate the applied sciences team to create new innovative solutions for AI for improved surgical outcomes. JJMT Digital is passionate about crafting the future of digital surgery by developing software products and solutions for hospitals, clinicians,...


  • Santa Ana, California, United States First American Full time

    DescriptionWe're looking for a talented Cloud-Based ML Engineer to join our team! As a Cloud-Based ML Engineer, you will be responsible for building and managing the infrastructure on cloud to deploy Machine Learning models in production in conformance with organization's security and compliance needs. Your experience in fine-tuning Large Language Models...


  • Santa Clara, California, United States Apple Full time

    About the Role">As a Staff Machine Learning Infrastructure Engineer on the ML Compute Team, you'll join a phenomenal team of hardworking engineers and be entrusted with a range of responsibilities. Your work will include collaborating with teams across Apple on machine learning workloads, driving the design and delivery of critical features, and...


  • Santa Clara, United States Proximity Works Full time

    We are looking for a Senior Solutions Architect to design, develop, and scale innovative AI/ML-driven solutions. You will be responsible for architecting highly scalable, low-latency distributed systems optimized for AI/ML workloads. As a key technical leader, you will solve complex challenges, influence next-generation AI/ML infrastructures, and guide...


  • Santa Clara, California, United States ServiceNow Full time

    **What You'll Do**You'll work with our team to design and implement test automation frameworks for machine learning models. This includes:Collaborating with ML engineers to ensure that functional and non-functional requirements are addressedEstablishing and improving metrics collection and reportingIncorporating research of industry trends and applying best...


  • Santa Clara, California, United States Apple Full time

    OverviewApple is a pioneer in the field of machine learning, and our Machine Learning Platform Technology & Infra team is at the forefront of this innovation. We've developed a platform that enables the next generation of intelligent experiences on Apple products and services. As a software engineer on our team, you'll have the opportunity to shape the...


  • Santa Clara, United States Proximity Works Full time

    We are looking for a Senior Solutions Architect to design, develop, and scale innovative AI/ML-driven solutions. You will be responsible for architecting highly scalable, low-latency distributed systems optimized for AI/ML workloads. As a key technical leader, you will solve complex challenges, influence next-generation AI/ML infrastructures, and guide...