Software Engineer, Systems ML

4 months ago


Bellevue, United States META Full time

Summary: Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web.Some aspects of this role as an HPC specialist may include authoring components such as cuBLAS, cuDNN, AITemplate, FlashAttention and development of runtimes such as LLM disaggregated runtime. HPC specialists spend time optimizing the program to reduce the accelerators idle time. They also develop tools to debug (cuda-gdb), profiler utilizing the accelerated computing hardware (such as PE’s/SFU etc in MTIA or Transformer engine in H100). They are experts in systems who are able to design, debug and accelerate AI workloads from single-node scale up to multi-node scale out distributed systems. They also are able to influence the next generation of Silicon architectures (such as Tensor Core in V100. Transformer Engine in H100) based on the evolving AI workload needs.We are hiring in multiple locations. Required Skills: Software Engineer, Systems ML - HPC Specialist Responsibilities: Apply relevant AI and machine learning techniques to build & optimize our intelligent systems that improve Metas products and experiences Develop custom/novel architectures, define use cases, and develop methodology & benchmarks to evaluate different approaches Apply in depth knowledge of how the machine learning system interacts with the other systems around it Assist in goal setting related to project impact, AI system design, and ML excellence Minimum Qualifications: Minimum Qualifications: Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. 2+ years of experience in HPC and parallel computing. Proficiency in GPU programming using CUDA and familiarity with CUDA libraries (cuBLAS, cuDNN, etc.). Proven track record of leading successful HPC projects. Proven technical expertise in HPC architectures and technologies. Preferred Qualifications: Preferred Qualifications: PhD in Computer Science, Computer Engineering, or relevant technical field. Experience developing AI algorithms or AI-System infrastructure in C/C++ or Python. Experience developing AI Compiler (TorchInductor in PyTorch 2.0). Public Compensation: $70.67/hour to $208,000/year + bonus + equity + benefits Industry: Internet Equal Opportunity: Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment. Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.



  • Bellevue, United States META Full time

    Summary: The PyTorch Vanguard team is at the forefront of machine learning innovation, community engagement, and open-source development. We are dedicated to pushing the boundaries of ML technologies while fostering a vibrant, global community of developers and researchers. Our team combines cutting-edge ML engineering with community-driven initiatives to...


  • Bellevue, United States META Full time

    Summary: The PyTorch Vanguard team is at the forefront of machine learning innovation, community engagement, and open-source development. We are dedicated to pushing the boundaries of ML technologies while fostering a vibrant, global community of developers and researchers. Our team combines cutting-edge ML engineering with community-driven initiatives to...


  • Bellevue, United States META Full time

    Summary: Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web. We are hiring in multiple...


  • Bellevue, United States META Full time

    Summary: In this role, you will be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of the bigger industry-leading PyTorch AI framework organization. MTIA Software Team has been developing a comprehensive AI Compiler strategy that delivers a highly flexible platform to train & serve new DL/ML model architectures, combined...


  • Bellevue, United States Wal-Mart Associates, Inc. Full time

    Position: Senior Software Engineer Job Location: 10500 NE 8th Street, Bellevue, WA 98004 Duties: Create and maintain Python Software Development Kits (SDKs) for internal use. Ensure SDKs are well-documented for easy integration and usage by AI Engineers and cross-functional teams. Regularly update and improve SDKs to align with evolving project...

  • Software Engineer II

    4 weeks ago


    Bellevue, Washington, United States Belva Full time

    Job TitleBelva is seeking a talented Software Engineer II to join our team of passionate product builders.We are a trailblazing A.I. Telecommunications company, and we're looking for an individual who can take code ownership and help lead the charge in AI / ML solutions that make an impact in the lives of millions.As a Software Engineer II, you will work...

  • Software Engineer II

    2 months ago


    Bellevue, United States Belva.ai Full time

    Job DescriptionJob DescriptionAt Belva, we are seeking a talented and experienced Software Engineer II to join our team. We’re a trailblazing A.I. Telecommunications company, searching for an individual who can take code ownership and help lead the charge in AI / ML solutions that make an impact in the lives of millions.Role and Responsibilities:We are...


  • Bellevue, Washington, United States Amazon Full time

    Job SummaryAmazon's AGI Information organization is seeking a highly skilled and experienced Software Development Engineer to drive the development of industry-leading Knowledge Graph systems. As a key member of the AGI Information Web & Knowledge Services team, you will play a critical role in advancing AI/ML technologies that enable customers to leverage...


  • Bellevue, United States Meta Inc Full time

    Summary: In this role, you will be a member of the Network.AI Software team and part of the bigger DC networking organization. The team develops and owns the software stack around NCCL (NVIDIA Collective Communications Library), which enables multi-GPU and multi-node data communication through HPC-style collectives. NCCL has been integrated into PyTorch and...


  • Bellevue, Washington, United States Oliver Wyman Group Full time

    Job SummaryOliver Wyman Vector is seeking a skilled Software Systems Engineer to join our team. As a Software Systems Engineer, you will be responsible for defining and validating highly reliable system functionality, planning and executing complex systems integration, and performing risk management and trade study analyses.Key Responsibilities Define and...


  • Bellevue, Washington, United States META Full time

    About the Role:Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics.The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web.We are hiring in multiple...

  • IT Software Engineer

    1 month ago


    Bellevue, United States Sunrise Systems Full time

    Job Title: IT Software Engineer (.Net) Reference ID: - Location: Bellevue, WA Duration: Months Job Type: Contract (Candidates must be able to work on W without VISA sponsorship) This position is % onsite Looking for a minimum of years’ experience. Top must have skills: The following are interrelated and critical to the existing...


  • Bellevue, Washington, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Senior Software Development Engineer to join our AGI Finetuning organization. As a key member of our team, you will design, build, and maintain systems for evaluating our best-in-class models. You will work closely with our Applied Scientists to develop tools that support our modeling and evaluation team.Key...

  • Software Engineer

    3 weeks ago


    Bellevue, United States META Full time

    Summary: Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D...


  • Bellevue, United States META Full time

    Summary: Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D...


  • Bellevue, Washington, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Software Development Engineer to join our team at Amazon. As a key member of our team, you will be responsible for driving innovation and ML engineering to deliver a "best in the world" experience for our customers.Key ResponsibilitiesAs a seasoned software development engineer, you will be responsible for owning...


  • Bellevue, United States META Full time

    Summary: Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D...


  • Bellevue, Washington, United States META Full time

    Summary:META is seeking a talented AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. This position will involve applying relevant AI infrastructure and hardware acceleration techniques to build and optimize intelligent ML systems that improve META's...

  • Software Engineer

    1 month ago


    Bellevue, Washington, United States Bayone Full time

    Job Title: Software Engineer - Distributed SystemsBayone is seeking a highly skilled Software Engineer - Distributed Systems to join our team. As a key member of our engineering team, you will be responsible for designing and developing scalable and fault-tolerant systems using your expertise in distributed systems, network system design, and large scale...


  • Bellevue, United States META Full time

    Summary: GenAI Media Editing is hiring an Engineering Manager who is passionate about incubating, developing and landing state-of-the-art ML models to power media experiences across the different surfaces of Meta's Family of Apps. Our team has shipped exciting features such as Imagine, Imagine Edit, Stickers and Imagine Flash and we continue to build...