Software Engineer, Machine Learning, Infrastructure Runtime and API

4 weeks ago


Sunnyvale TX, United States Google Full time

Minimum qualifications:Bachelor's degree or equivalent practical experience.8 years of experience with software development in one or more programming languages (e.g., Python, C, C++, Java, Javascript).5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture.3 years of experience with machine learning algorithms and tools (e.g., TensorFlow), or applied ML (e.g., deep learning, natural language processing).3 years of experience with low-level programming. Preferred qualifications:Master’s degree or PhD in Engineering, Computer Science, or a related technical field.5 years of experience working in a complex, matrixed organization.Experienced with low-level programming. Understanding of advanced Machine Learning development, with the ability to work full-stack from API design to programming languages, runtimes, compilers, and advanced hardware. About the job Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.With your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.Machine Learning Infrastructure Runtime and API (MIRA)’s mission is to accelerate Machine Learning (ML) for Google and the world by building a modular and scalable ML infrastructure to enable users to develop and execute ML programs seamlessly across frameworks and heterogeneous hardware with performance, usability, and extensibility. We are building the ML infrastructure between the frameworks and hardware as a platform, an operating system for ML programs.Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.The US base salary range for this full-time position is $189,000-$284,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google. Responsibilities Maintain a cross-stack, co-design approach to support Google’s diverse ML ecosystem needs, minimizing use-case specific code paths and turn down investments in maintenance of such fragmented efforts in the current stack.Support large-model scale-out training through horizontal scaling.Migrate existing frameworks (e.g., TensorFlow, JAX, PyTorch) runtimes (e.g., TFExecutor, TFRT, PJRT) and product area custom workflows onto MIRA, minimizing any user disruption.Enable integration with open-source components (with frameworks, compilers) in OSS.Partner with other Core Machine Learning functional areas and Google DeepMind to continuously bring exceptional runtime capabilities to our product areas (e.g., Youtube, Search, etc.).



  • Dallas, TX, United States Goldman Sachs Full time

    MORE ABOUT THIS JOB: JD PRX Java Developer VP Role Production Runtime Experience (PRX) is a Technology Business Unit focused on running scalable production management services with a mandate of operational excellence and operational risk reduction that is achieved through large scale automation, best-in-class engineering and by leveraging data sciences and...


  • Sunnyvale, United States StormAI Full time

    Senior Software Engineer (C++) $15M Funding - VC backed AI Company USD$160k - $220k Base + Early stage equity Sunnyvale, USWant to join a company that where you'd be working alongside a team of world-class entrepreneurs experts in AI and Robotics?We are working with a company that helps positively impact warehouses, schools, hospitals, hotels, and many...


  • Sunnyvale, United States DoorDash Full time

    About the RoleWe’re looking for a passionate Applied Machine Learning expert to join our team. As a Staff Machine Learning Engineer, you’ll be conceptualizing, designing, implementing, and validating algorithmic improvements to the growth and personalization experiences at the heart of our fast-growing grocery and retail delivery business. You will use...


  • Sunnyvale, United States StormAI Full time

    Senior Software Engineer (C++) 💻$15M Funding - VC backed AI Company 🏥🚀USD$160k - $220k Base + Early stage equity💲Sunnyvale, USWant to join a company that where you'd be working alongside a team of world-class entrepreneurs experts in AI and Robotics?We are working with a company that helps positively impact warehouses, schools, hospitals, hotels,...

  • Software Engineering

    21 hours ago


    Dallas, TX, United States Goldman Sachs Full time

    JD PRX Java Developer VP Role Production Runtime Experience (PRX) is a Technology Business Unit focused on running scalable production management services with a mandate of operational excellence and operational risk reduction that is achieved through large scale automation, best-in-class engineering and by leveraging data sciences and machine learning....


  • Sunnyvale, United States FedML, Inc. Full time

    Job DescriptionJob DescriptionResponsibilities Participate in the development of MLOps/AIOps machine learning platform and open source communities Responsible for the foundational research and product development, and continuously improve the R&D efficiency Responsible for feature development, algorithm optimization of the platform, improving user...


  • Sunnyvale, United States META Full time

    We are the teams who create all of Meta's products used by billions of people around the world. Want to build new features and improve existing products like Messenger, Video, Groups, News Feed, Search and more? Want to solve unique, large scale, highly complex technical problems? Meta is seeking experienced full-stack Software Engineers to join our product...


  • Sunnyvale, CA, United States Demo160: Core Template TEMC Full time

    Overview: We are looking for a Machine Learning (ML) Engineer to help us create artificial intelligence products. Machine Learning Engineer responsibilities include creating machine learning models and retraining systems. To do this job successfully, you need exceptional skills in statistics and programming. If you also have knowledge of data science and...


  • Sunnyvale, CA, United States Google Full time

    Minimum qualifications:Bachelor's degree or equivalent practical experience.4 years of experience in product management, consulting, co-founder or related technical role.2 years of experience building and shipping technical products.Experience developing or launching products or technologies within Machine Learning.Preferred qualifications:Master's degree in...


  • Sunnyvale, United States Alibaba Cloud Full time

    Sinian team focuses on heterogeneous compute and software-hardware cooperative technologies. We have worked on a unified heterogeneity-aware lowering and optimization platform, accelerating applications on various heterogeneous hardware. Our goal is to unleash the hardware computing power and deploy deep learning applications for improving portability,...


  • Sunnyvale, CA, United States Diverse Lynx Full time

    We are looking for a Machine Learning Engineer Sunnyvale, CA - Onsite from Day 1 Job Type: Fulltime Customer Systems Job Summary Conversational Engineering develops next-generation AI and NLP solutions. Our mission is to maintain a comprehensive and effective support, sales & payment experience for customers around the globe. Our conversational...


  • Sunnyvale, United States Alibaba Cloud Full time

    Sinian team focuses on heterogeneous compute and software-hardware cooperative technologies. We have worked on a unified heterogeneity-aware lowering and optimization platform, accelerating applications on various heterogeneous hardware. Our goal is to unleash the hardware computing power and deploy deep learning applications for improving portability,...


  • Sunnyvale, United States Alibaba Cloud Full time

    Sinian team focuses on heterogeneous compute and software-hardware cooperative technologies. We have worked on a unified heterogeneity-aware lowering and optimization platform, accelerating applications on various heterogeneous hardware. Our goal is to unleash the hardware computing power and deploy deep learning applications for improving portability,...


  • Sunnyvale, United States Alibaba Cloud Full time

    Sinian team focuses on heterogeneous compute and software-hardware cooperative technologies. We have worked on a unified heterogeneity-aware lowering and optimization platform, accelerating applications on various heterogeneous hardware. Our goal is to unleash the hardware computing power and deploy deep learning applications for improving portability,...


  • Sunnyvale, United States Alibaba Cloud Full time

    Sinian team focuses on heterogeneous compute and software-hardware cooperative technologies. We have worked on a unified heterogeneity-aware lowering and optimization platform, accelerating applications on various heterogeneous hardware. Our goal is to unleash the hardware computing power and deploy deep learning applications for improving portability,...


  • Sunnyvale, United States Alibaba Cloud Full time

    Sinian team focuses on heterogeneous compute and software-hardware cooperative technologies. We have worked on a unified heterogeneity-aware lowering and optimization platform, accelerating applications on various heterogeneous hardware. Our goal is to unleash the hardware computing power and deploy deep learning applications for improving portability,...


  • Sunnyvale, United States Nexa AI Full time

    NEXA AI invented functional tokens and Octopus models for AI agents. We provide more accurate AI agent solutions, 4x faster and 10x cheaper than OpenAI GPT-4o API, with a latency of ~0.3s. Our product Octoverse enables developers to build AI Companions that understand and complete tasks for your users in apps. Learn more at...


  • Sunnyvale, United States Nexa AI Full time

    NEXA AI invented functional tokens and Octopus models for AI agents. We provide more accurate AI agent solutions, 4x faster and 10x cheaper than OpenAI GPT-4o API, with a latency of ~0.3s. Our product Octoverse enables developers to build AI Companions that understand and complete tasks for your users in apps. Learn more at...


  • Sunnyvale, United States Nexa AI Full time

    NEXA AI invented functional tokens and Octopus models for AI agents. We provide more accurate AI agent solutions, 4x faster and 10x cheaper than OpenAI GPT-4o API, with a latency of ~0.3s. Our product Octoverse enables developers to build AI Companions that understand and complete tasks for your users in apps. Learn more at...


  • Irving, TX, United States Ascendion Inc. Full time

    About Ascendion Ascendion is a full-service digital engineering solutions company. We make and manage software platforms and products that power growth and deliver captivating experiences to consumers and employees. Our engineering, cloud, data, experience design, and talent solution capabilities accelerate transformation and impact for enterprise clients....