Machine Learning Engineer/Technical Lead

2 weeks ago


Sunnyvale, United States FedML, Inc. Full time

Responsibilities

  • Participate in the development of MLOps/AIOps machine learning platform and open source communities
  • Responsible for the foundational research and product development, and continuously improve the R&D efficiency
  • Responsible for feature development, algorithm optimization of the platform, improving user experience and usability through cutting-edge or mature technologies
  • Participate in or lead design reviews with peers and stakeholders to decide amongst available technologies
  • Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency)
  • Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback

Minimum Qualifications

  • Bachelor’s degree or equivalent practical experience in computer science or related areas.
  • 2 years of experience with software development in one or more programming languages (Python, Java, JavaScript, C/C++), or 1 year of experience with an advanced degree
  • 2 years of experience with data structures or algorithms in either an academic or industry setting
  • Good communication and writing skills in English environment

Preferred Qualifications

  • Proficient in Python language, familiar with typical deep learning frameworks (TensorFlow/PyTorch) and models such as CNN, Transformer, GBDT, LR, etc.
  • Experience in developing MLOps features, including workflow orchestration, model training, model serving, monitoring/observability, versioning of data, code, model, data pipeline, logging, etc.
  • Familiar with communication backends (MPI, NCCL, RPC, MQTT), GPU CUDA, and other core modules of deep learning frameworks, those who have participated in the development of specific modules of famous deep learning frameworks are preferred
  • Experience with federated learning, distributed training on large-scale model is preferred
  • Combine the platform and the open source library to improve the training efficiency of deep learning end-to-end through task scheduling, elastic disaster recovery, performance optimization and other measures, involving K8S/KubeFlow, network optimization, and distributed training

About the Job

FedML, Inc. (https:/fedml.ai) empowers our clients to build & scale any machine learning or artificial intelligence models anywhere. That includes the latest foundation models as well as more traditional ML models.  Our products cover both training, serving with a low-code UI MLOPs & LLMOps platform. We also offer a Federated Machine Learning solution for cross-silo training for data privacy sensitive applications.

Our earliest products power federated machine learning missions for clients in several industries, where data privacy, low latency serving, and low cost of data storage are important to the client.  Our easy-to-use FedML MLOps solution enables data science and machine learning engineering to work seamlessly together to deploy & manage their model to production machines. Our federated learning and serving solutions support siloed edge devices, smartphones, and IoT.

Our next generation of solutions includes geo-distributed machine learning and serving that continues our tradition of delivering easy-to-use, simple, low-cost, and enterprise grade MLOPs solutions.  Our MLOps and evolving LLMOps platform will always empower experimentation, observability, evaluation, governance, and collaboration for our clients’ AI & ML training and serving needs, as well as other general computing needs.

FedML supports vertical solutions across a broad range of industries (healthcare, finance, insurance, automotive, advertising, smart cities, IoT etc,) and applications (computer vision, natural language processing, data mining, and time-series forecasting). Its core technology is backed by more than 3 years of cutting-edge research of its co-founders who are recognized leaders in the federated machine learning community.

FedML's researchers and software engineers and product teams are busy developing the next-generation FedML platform for machine learning and artificial intelligence and we're looking to grow our team with skilled professionals who bring fresh ideas from all areas, including machine learning and its applications, computer vision, natural language processing, large-scale system design, distributed/cloud computing/systems, MLOps, security/privacy, mobile/IoT systems, and networking.  We’re an early stage startup, hence you will work on projects which are critical to our customers' and our business needs.  If you love to learn, and love to convert ideas into real and scalable machine learning infrastructure products and applications, FedML may be a great place for you.

Location

Our HQ is in Sunnyvale California.  Preference is for someone local who can be at our office regularly. Hybrid is ok.

How to apply

If you are interested, pleaseapply via the link.



  • Sunnyvale, United States FedML, Inc. Full time

    Job DescriptionJob DescriptionResponsibilities Participate in the development of MLOps/AIOps machine learning platform and open source communities Responsible for the foundational research and product development, and continuously improve the R&D efficiency Responsible for feature development, algorithm optimization of the platform, improving user...


  • Sunnyvale, United States Illumio Full time

    Senior Machine Learning Engineer On-site work model of 5 days in office/week in Sunnyvale, CA As a Senior Machine Learning Engineer, you will have the opportunity to build and lead a new team focused on developing cutting-edge machine learning solutions for our industry-leading Zero Trust security products.  You will be responsible for driving the design,...


  • Sunnyvale, United States RIT Solutions, Inc. Full time

    Machine Learning Engineer/Data Scientist Sunnyvale, CA - Hybrid Job Description In this unique position, individual will be at the intersection on Client Engineering and Data Science, and perform tasks to support our advanced measurement products. Strong technical skills combined with knowledge of Client engineering, data science/statistics concepts and...


  • Sunnyvale, United States Alibaba Cloud Full time

    Sinian team focuses on heterogeneous compute and software-hardware cooperative technologies. We have worked on a unified heterogeneity-aware lowering and optimization platform, accelerating applications on various heterogeneous hardware. Our goal is to unleash the hardware computing power and deploy deep learning applications for improving portability,...


  • Sunnyvale, United States Alibaba Cloud Full time

    Sinian team focuses on heterogeneous compute and software-hardware cooperative technologies. We have worked on a unified heterogeneity-aware lowering and optimization platform, accelerating applications on various heterogeneous hardware. Our goal is to unleash the hardware computing power and deploy deep learning applications for improving portability,...


  • Sunnyvale, United States Alibaba Cloud Full time

    Sinian team focuses on heterogeneous compute and software-hardware cooperative technologies. We have worked on a unified heterogeneity-aware lowering and optimization platform, accelerating applications on various heterogeneous hardware. Our goal is to unleash the hardware computing power and deploy deep learning applications for improving portability,...


  • Sunnyvale, United States Red Oak Technologies Full time

    “NOTE: If selected for this position, you are required to perform ALL work within a commutable distance of your assigned Worksite Location. Three days a week onsite, Tues, Wed, Thurs, with Monday and Friday being remote”Junior level- Machine Learning Engineer for Sunnyvale, CA for a 12-month Contract.Responsibilities:Design and develop Machine Learning...


  • Sunnyvale, California, United States Illumio Full time

    On-site work model with 3 days in office/week in Sunnyvale, CA for the summer of 2024. Engineering Intern, Machine Learning (Core ML)About the Role:As a Machine Learning Engineer, you will be working on the full lifecycle of data science and graph learning algorithms to solve cybersecurity issues faced by businesses of all scales. Your contributions will...


  • Sunnyvale, United States Grid Dynamics Full time

    Description Position at Grid Dynamics Our customer is an American multinational technology company headquartered in San Ramon, California. Our customer is one of the world's largest technology companies based in Silicon Valley with operations all over the world. On this project, we are working with bleeding-edge big data technologies to develop a...


  • Sunnyvale, United States DoorDash Full time

    About the RoleWe’re looking for a passionate Applied Machine Learning expert to join our team. As a Staff Machine Learning Engineer, you’ll be conceptualizing, designing, implementing, and validating algorithmic improvements to the growth and personalization experiences at the heart of our fast-growing grocery and retail delivery business. You will use...


  • Sunnyvale, United States Red Oak Technologies Full time

    Red Oak Technologies is a leading provider of comprehensive resourcing solutions across a variety of industries and sectors including IT, Marketing, Finance, Business Operations, Manufacturing and Engineering. We specialize in quickly acquiring and efficiently matching top-tier professional talent with clients in immediate need of highly skilled contract,...


  • Sunnyvale, United States Red Oak Technologies Full time

    Red Oak Technologies is a leading provider of comprehensive resourcing solutions across a variety of industries and sectors including IT, Marketing, Finance, Business Operations, Manufacturing and Engineering. We specialize in quickly acquiring and efficiently matching top-tier professional talent with clients in immediate need of highly skilled contract,...


  • Sunnyvale, United States Red Oak Technologies Full time

    Red Oak Technologies is a leading provider of comprehensive resourcing solutions across a variety of industries and sectors including IT, Marketing, Finance, Business Operations, Manufacturing and Engineering. We specialize in quickly acquiring and efficiently matching top-tier professional talent with clients in immediate need of highly skilled contract,...


  • Sunnyvale, California, United States Demo160: Core Template TEMC Full time

    Overview: We are looking for a Machine Learning (ML) Engineer to help us create artificial intelligence products.   Machine Learning Engineer responsibilities include creating machine learning models and retraining systems. To do this job successfully, you need exceptional skills in statistics and programming. If you also have knowledge of data science...


  • Sunnyvale, California, United States Demo160: Core Template TEMC Full time

    Overview: We are looking for a Machine Learning (ML) Engineer to help us create artificial intelligence products.   Machine Learning Engineer responsibilities include creating machine learning models and retraining systems. To do this job successfully, you need exceptional skills in statistics and programming. If you also have knowledge of data science...


  • Sunnyvale, United States DoorDash Full time

    About the RoleAs a Machine Learning Engineer, you will have the opportunity to leverage our robust data and machine learning infrastructure to develop inference and ML models that impact millions of users across our three audiences and tackle our most challenging business problems. You will work with other engineers, analysts, and product managers to develop...


  • Sunnyvale, CA, United States Baidu Full time

    Do you want to be part of the AI revolution? Do you want to think out of the box, thriving on challenges in the AI industry and the desire to solve them? Do you want to work with a world-class team to explore the fast-growing AI hardware opportunities and impact on the AI industry?We’re looking forward to you joining us to collaborate, contribute, and...


  • Sunnyvale, United States FedML, Inc. Full time

    Job DescriptionJob DescriptionResponsibilities Participate in the development of machine learning platform and open source communities Responsible for the foundational research and product development, and continuously improve the R&D efficiency Responsible for feature development, algorithm optimization of the platform, improving user experience and...


  • Sunnyvale, TX, United States Google Full time

    Minimum qualifications:Bachelor's degree or equivalent practical experience. 8 years of experience in software development, and with data structures/algorithms. 5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture. 5 years of experience with machine learning algorithms and tools...


  • Sunnyvale, CA, United States Google Full time

    Minimum qualifications:Bachelor's degree or equivalent practical experience. 8 years of experience in software development, and with data structures/algorithms.5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture. 5 years of experience with machine learning algorithms and tools (e.g.,...