We have other current jobs related to this field that you can find below


  • Santa Clara, California, United States AMD Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world.Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.Underpinning our...

  • Senior AI Engineer

    5 days ago


    Santa Clara, California, United States Blue River Technology Full time

    Job OverviewPosition: Senior AI EngineerLocation: Remote work available with occasional office presence required.Key ResponsibilitiesDesign and implement advanced deep learning classification systems tailored for the construction industry.Investigate and innovate new techniques to enhance detection accuracy and optimize inference speed.Collaborate with...


  • Santa Clara, California, United States Nvidia Full time

    Senior Machine Learning EngineerlocationsUS, CA, RemoteUS, TX, AustinUS, WA, RemoteUS, NY, RemoteUS, CA, Santa Claratime typeFull timejob requisition idJR1977220We're looking for a motivated Senior Machine Learning Engineer, focused on Vector Search, to join NVIDIA's RAPIDS Machine Learning team. RAPIDS is the open source suite of libraries that combine the...


  • Santa Clara, California, United States NVIDIA Full time

    Position Overview:We are on the lookout for a Lead Software Program Manager to spearhead our initiatives in the transformative realm of deep learning software. This is a dynamic environment filled with numerous program management challenges where we are applying engineering precision and operational excellence.Your role will involve collaborating with...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is on the lookout for a skilled Lead AI Systems Engineer to become a vital part of our Autonomous Vehicles division. In this position, you will leverage artificial intelligence to enhance Autonomous Vehicle perception, contributing to the development of our cutting-edge autonomous driving technology. We seek an innovative and inquisitive engineer who...


  • Santa Clara, California, United States Nvidia Corporation Full time

    Perception for autonomous vehicles (AV) is one of the most exciting and challenging areas to work on today. Machine learning plays a crucial role in this field, but to excel in machine learning for Perception AV, we need to master the fundamentals. Join the Perception ML Foundation team, where we combine expertise in machine learning, high-performance...

  • Scale-out Engineer

    3 weeks ago


    Santa Clara, California, United States Tenstorrent Inc. Full time

    Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high...


  • Santa Clara, California, United States Nvidia Full time

    Senior Scientific Machine Learning Software Engineer - PhysicslocationsUS, CA, Santa ClaraUS, Remotetime typeFull timejob requisition idJR1981550NVIDIA's deep learning and HPC platforms have made a huge impact in various fields and are broadly used across leading academic institutions, start-ups, and industry, including the world's largest Internet...


  • Santa Clara, California, United States Nutanix Full time

    Build on top of Open Source LLM (Large Language Models) to leverage a diverse dataset.Develop AI-based systems for Natural Language Processing (NLP)Develop tools and processes for automatically train, updating and evaluate LLM (Large Language Models)Strong foundation in Machine Learning (ML), Deep Learning, LLMs and NLPFamiliar with LLM (Large Language...


  • Santa Clara, California, United States Nvidia Corporation Full time

    NVIDIA is searching for a world-class engineer in graphics and AI to join our neural graphics product team. If you agree with us that the most exciting thing about the AI revolution is applying it to solve real problems, this team will be an excellent fit for you We are passionate about applications in generative AI, gaming, augmented reality, user-generated...


  • Santa Clara, California, United States NVIDIA Corporation Full time

    Position Overview:The role of a Solutions Architect for DGX Cloud involves engaging with clients to facilitate the integration of cutting-edge Artificial Intelligence (AI) technologies into their operations. This position is pivotal in the NVIDIA AI Enterprise (NVAIE) Segment Team, which is dedicated to ensuring the effective implementation of DGX Cloud and...


  • Santa Clara, California, United States AMD Full time

    JOIN AMD AND MAKE A DIFFERENCEAt AMD, we are dedicated to revolutionizing lives through our advanced technology, enhancing our industry, communities, and the global landscape. Our vision is to create exceptional products that propel next-generation computing experiences, serving as the foundation for data centers, artificial intelligence, personal computing,...


  • Santa Clara, California, United States NVIDIA Full time

    As a Lead Solutions Architect focusing on AI/ML Storage Systems, you will play a crucial role in our innovative team, contributing to the development, implementation, and management of cutting-edge storage solutions designed specifically for Artificial Intelligence and Machine Learning applications. This position encompasses a variety of areas, including...


  • Santa Clara, California, United States d-Matrix Full time

    Software Engineer, Senior - AI/ML Workloadsd-Matrix - Santa Clara, CALocationSanta Clara, CaTypeFull timeDepartmentR&D - SW Kernels & Workloadsd-Matrix has fundamentally changed the physics of memory-compute integration with our digital in-memory compute (DIMC) engine. The "holy grail" of AI compute has been to break through the memory wall to minimize data...


  • Santa Clara, California, United States Amazon Web Services, Inc. Full time

    Position Overview:We are in search of an innovative and analytical thinker to become a part of our team as a Lead AI Research Scientist specializing in prototyping at Amazon Web Services, Inc. If you possess a strong enthusiasm for technology, along with a proven track record in developing machine learning models tailored for business solutions, and have the...


  • Santa Clara, California, United States Promote Project Full time

    About Promote Project: Promote Project is a leader in innovative technology solutions, dedicated to pushing the boundaries of what is possible in the realm of artificial intelligence and cloud computing. Our commitment to excellence is reflected in our talented workforce and our pursuit of groundbreaking advancements.Position Overview: We are seeking a...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA has evolved into a leader in accelerated computing platforms, encompassing GPU, DPU, and software solutions for AI/ML, deep learning, analytics, visual simulation, and professional graphics across diverse sectors. We are seeking a visionary leader to join one of the most dynamic and rapidly expanding teams focused on Cloud Service Providers (CSPs).Key...

  • Data Science Manager

    1 month ago


    Santa Clara, California, United States Consulting Full time

    Establish operational objectives and work plans for the Data Science/Computational Linguistics group that meet the strategic objectives of the AI team.6+ of software industry experience in Data Science or Natural Language ProcessingHands-on experience with scalable machine learning techniques applied to both structured and natural language dataSolid...


  • Santa Clara, California, United States d-Matrix Full time

    d-Matrix has fundamentally changed the physics of memory-compute integration with our digital in-memory compute (DIMC) engine. The "holy grail" of AI compute has been to break through the memory wall to minimize data movements. We've achieved this with a first-of-its-kind DIMC engine. Having secured over $154M, $110M in our Series B offering, d-Matrix is...


  • Santa Ana, California, United States Deep Rock Water Full time

    Position OverviewAt Deep Rock Water, we are dedicated to fostering healthier lifestyles, vibrant communities, and a sustainable environment. Our legacy spans over a century, and we recognize that investing in our employees is crucial. We empower our teams to embrace diverse perspectives, accelerate problem-solving, and develop innovative solutions that cater...

Manager, Deep Learning Inference

3 months ago


Santa Clara, California, United States NVIDIA Full time

At NVIDIA, we are building the world's leading AI computing platform. The mission of the TensorRT team is to deliver software solutions for achieving state-of-the-art performance and efficiency in Machine Learning inference with NVIDIA GPUs.

We are looking for a hands-on, highly technically experienced and motivated engineering manager to help lead critical work in the Deep Learning Inference Software team, and drive the development of ONNX and TensorRT software.

What you'll be doing:

In this role, you will help shape the strategy for inference deployment workflows and lead the ONNX development efforts at NVIDIA.

Help define and drive ONNX inference workflows and development objectivesCollaborate closely with industry partners in advancing ONNXCoordinate planning and execution of inference strategy in concert with various internal teams at NVIDIAGrow and develop a team of world-class engineers
What we need to see:

Masters (or equivalent experience) or PhD and at last 12 overall years of relevant industry experience in Computer Science, Artificial Intelligence, Applied Math, or related field5+ years of demonstrated experience in leading and mentoring multiple software engineering teamsStrong experience with C++11/C++14Working knowledge or experience with TensorRT, PyTorch, TensorFlow, JAX, ONNX Runtime or other ML frameworks.Excellent understanding of software development practices including architecting, development, testing, continuous integration, and documentationExcellent communication skills, strong analytical, and organization skills
Ways to Stand Out From the Crowd:

Significant contributions to Deep Learning optimizations for inference.Strong Python programming experienceFamiliarity with CUDA kernel programmingExperience working directly with AI hardware and software development teams.Exceptional project management skills, with a demonstrated ability to lead complex projects to completion.A charismatic leader who inspires innovation and drives the team towards achieving NVIDIA's vision.
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's an outstanding legacy of innovation that's fueled by phenomenal technology-and amazing people.

Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As an NVIDIAN, you'll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work. Come join our team and see how you can make a lasting impact on the world.

The base salary range is 220,000 USD - 419,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.