Software Engineer, Systems ML

5 days ago


Bellevue, United States META Full time

Summary: In this role, you will be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of the bigger industry-leading PyTorch AI framework organization. MTIA Software Team has been developing a comprehensive AI Compiler strategy that delivers a highly flexible platform to train & serve new DL/ML model architectures, combined with auto-tuned high performance for production environments across specialized hardware architectures. The compiler stack, DL graph optimizations, and kernel authoring for specific hardware, directly impacts performance and deployment velocity of both AI training and inference platforms at Meta.You will be working on one of the core areas such as PyTorch framework components, AI compiler and runtime, high-performance kernels and tooling to accelerate machine learning workloads on the current & next generation of MTIA AI hardware platforms. You will work closely with AI researchers to analyze deep learning models and lower them efficiently on MTIA hardware. You will also partner with hardware design teams to develop compiler optimizations for high performance. You will apply software development best practices to design features, optimization, and performance tuning techniques. You will gain valuable experience in developing machine learning compiler frameworks and will help in driving next generation hardware software codesign for AI domain specific problems. Required Skills: Software Engineer, Systems ML - Frameworks / Compilers / Kernels Responsibilities: Development of SW stack with one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and acceleration onto next generation of hardware architectures. Contribute to the development of the industry-leading PyTorch AI framework core compilers to support new state of the art inference and training AI hardware accelerators and optimize their performance. Analyze deep learning networks, develop & implement compiler optimization algorithms. Collaborating with AI research scientists to accelerate the next generation of deep learning models such as Recommendation systems, Generative AI, Computer vision, NLP etc. Performance tuning and optimizations of deep learning framework & software components. Minimum Qualifications: Minimum Qualifications: Proven C/C++ programming skills Experience in AI framework development or accelerating deep learning models on hardware architectures. Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Preferred Qualifications: Preferred Qualifications: A Bachelor's degree in Computer Science, Computer Engineering, relevant technical field and 7+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a Master's degree in Computer Science, Computer Engineering, relevant technical field and 4+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a PhD in Computer Science Computer Engineering, or relevant technical field and 3+ years of experience in AI framework development or accelerating deep learning models on hardware architectures. Knowledge of GPU, CPU, or AI hardware accelerator architectures. Experience working with frameworks like PyTorch, Caffe2, TensorFlow, ONNX, TensorRT OR AI high performance kernels: Experience with CUDA programming, OpenMP / OpenCL programming or AI hardware accelerator kernel programming. Experience in accelerating libraries on AI hardware, similar to cuBLAS, cuDNN, CUTLASS, HIP, ROCm etc. OR AI Compiler: Experience with compiler optimizations such as loop optimizations, vectorization, parallelization, hardware specific optimizations such as SIMD. Experience with MLIR, LLVM, IREE, XLA, TVM, Halide is a plus. OR AI frameworks: Experience in developing training and inference framework components. Experience in system performance optimizations such as runtime analysis of latency, memory bandwidth, I/O access, compute utilization analysis and associated tooling development. Public Compensation: $error/year to $error/year + bonus + equity + benefits We apologize for the inconvenience, please be patient as we work to correct the issue. Industry: Internet Equal Opportunity: Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment. Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.



  • Bellevue, United States META Full time

    Summary: The PyTorch Vanguard team is at the forefront of machine learning innovation, community engagement, and open-source development. We are dedicated to pushing the boundaries of ML technologies while fostering a vibrant, global community of developers and researchers. Our team combines cutting-edge ML engineering with community-driven initiatives to...


  • Bellevue, United States META Full time

    Summary: Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web.Some aspects of this role as...


  • Bellevue, United States META Full time

    Summary: The PyTorch Vanguard team is at the forefront of machine learning innovation, community engagement, and open-source development. We are dedicated to pushing the boundaries of ML technologies while fostering a vibrant, global community of developers and researchers. Our team combines cutting-edge ML engineering with community-driven initiatives to...


  • Bellevue, United States META Full time

    Summary: Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web. We are hiring in multiple...


  • Bellevue, United States Wal-Mart Associates, Inc. Full time

    Position: Senior Software Engineer Job Location: 10500 NE 8th Street, Bellevue, WA 98004 Duties: Create and maintain Python Software Development Kits (SDKs) for internal use. Ensure SDKs are well-documented for easy integration and usage by AI Engineers and cross-functional teams. Regularly update and improve SDKs to align with evolving project...

  • Software Engineer II

    3 weeks ago


    Bellevue, Washington, United States Belva Full time

    Job TitleBelva is seeking a talented Software Engineer II to join our team of passionate product builders.We are a trailblazing A.I. Telecommunications company, and we're looking for an individual who can take code ownership and help lead the charge in AI / ML solutions that make an impact in the lives of millions.As a Software Engineer II, you will work...

  • Software Engineer II

    2 months ago


    Bellevue, United States Belva.ai Full time

    Job DescriptionJob DescriptionAt Belva, we are seeking a talented and experienced Software Engineer II to join our team. We’re a trailblazing A.I. Telecommunications company, searching for an individual who can take code ownership and help lead the charge in AI / ML solutions that make an impact in the lives of millions.Role and Responsibilities:We are...


  • Bellevue, Washington, United States Wal-Mart Associates, Inc. Full time

    Job Summary: We are seeking a highly skilled Senior Software Engineer to join our team at Wal-Mart Associates, Inc. The ideal candidate will have a strong background in software development, with a focus on cloud computing and machine learning. Key Responsibilities: Design and develop scalable software solutions using cloud computing platforms such as GCP...


  • Bellevue, Washington, United States Amazon Full time

    Job SummaryAmazon's AGI Information organization is seeking a highly skilled and experienced Software Development Engineer to drive the development of industry-leading Knowledge Graph systems. As a key member of the AGI Information Web & Knowledge Services team, you will play a critical role in advancing AI/ML technologies that enable customers to leverage...


  • Bellevue, United States Meta Inc Full time

    Summary: In this role, you will be a member of the Network.AI Software team and part of the bigger DC networking organization. The team develops and owns the software stack around NCCL (NVIDIA Collective Communications Library), which enables multi-GPU and multi-node data communication through HPC-style collectives. NCCL has been integrated into PyTorch and...


  • Bellevue, Washington, United States Oliver Wyman Group Full time

    Job SummaryOliver Wyman Vector is seeking a skilled Software Systems Engineer to join our team. As a Software Systems Engineer, you will be responsible for defining and validating highly reliable system functionality, planning and executing complex systems integration, and performing risk management and trade study analyses.Key Responsibilities Define and...


  • Bellevue, Washington, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Software Development Engineer to lead the development of our next-generation AI platform. As a key member of our team, you will be responsible for designing, developing, and deploying large-scale distributed machine learning systems.You will collaborate closely with our team of ML scientists and customers to...


  • Bellevue, Washington, United States META Full time

    About the Role:Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics.The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web.We are hiring in multiple...

  • IT Software Engineer

    1 month ago


    Bellevue, United States Sunrise Systems Full time

    Job Title: IT Software Engineer (.Net) Reference ID: - Location: Bellevue, WA Duration: Months Job Type: Contract (Candidates must be able to work on W without VISA sponsorship) This position is % onsite Looking for a minimum of years’ experience. Top must have skills: The following are interrelated and critical to the existing...


  • Bellevue, Washington, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Senior Software Development Engineer to join our AGI Finetuning organization. As a key member of our team, you will design, build, and maintain systems for evaluating our best-in-class models. You will work closely with our Applied Scientists to develop tools that support our modeling and evaluation team.Key...


  • Bellevue, Washington, United States Amazon Full time

    About the RoleWe are seeking a talented and inventive individual to join the Applied AI team at Amazon and contribute to the design, architecture, and implementation of cutting-edge AI technologies.This role offers a unique opportunity to make a significant impact on the customer experience and work on LLM-based solutions.As a Software Development Engineer...

  • Software Engineer

    2 weeks ago


    Bellevue, United States META Full time

    Summary: Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D...


  • Bellevue, Washington, United States META Full time

    About the Role:Meta Platforms, Inc. is seeking a highly skilled Software Engineer, Systems to join our team. As a key member of our systems engineering team, you will be responsible for designing, developing, and testing operating systems-level software, compilers, and network distribution software for large distributed components that run applications.Key...


  • Bellevue, United States META Full time

    Summary: Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D...


  • Bellevue, Washington, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Software Development Engineer to join our team at Amazon. As a key member of our team, you will be responsible for driving innovation and ML engineering to deliver a "best in the world" experience for our customers.Key ResponsibilitiesAs a seasoned software development engineer, you will be responsible for owning...