Software Engineer, Systems ML
1 week ago
Software Engineer, Systems ML - Frameworks / Compilers / Kernels
Apply to this job
Location pin icon
Bellevue, WA •Menlo Park, CA •New York, NY •Remote, US + 3 more
- Hide
Apply to this job
In this role, you will be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of the bigger industry-leading PyTorch AI framework organization. MTIA Software Team has been developing a comprehensive AI Compiler strategy that delivers a highly flexible platform to train & serve new DL/ML model architectures, combined with auto-tuned high performance for production environments across specialized hardware architectures. The compiler stack, DL graph optimizations, and kernel authoring for specific hardware, directly impacts performance and deployment velocity of both AI training and inference platforms at Meta. You will be working on one of the core areas such as PyTorch framework components, AI compiler and runtime, high-performance kernels and tooling to accelerate machine learning workloads on the current & next generation of MTIA AI hardware platforms. You will work closely with AI researchers to analyze deep learning models and lower them efficiently on MTIA hardware. You will also partner with hardware design teams to develop compiler optimizations for high performance. You will apply software development best practices to design features, optimization, and performance tuning techniques. You will gain valuable experience in developing machine learning compiler frameworks and will help in driving next generation hardware software codesign for AI domain specific problems.
Software Engineer, Systems ML - Frameworks / Compilers / Kernels Responsibilities
- Development of SW stack with one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and acceleration onto next generation of hardware architectures.
- Contribute to the development of the industry-leading PyTorch AI framework core compilers to support new state of the art inference and training AI hardware accelerators and optimize their performance.
- Analyze deep learning networks, develop & implement compiler optimization algorithms.
- Collaborating with AI research scientists to accelerate the next generation of deep learning models such as Recommendation systems, Generative AI, Computer vision, NLP etc.
- Performance tuning and optimizations of deep learning framework & software components.
- Proven C/C++ programming skills
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
- Experience in AI framework development or accelerating deep learning models on hardware architectures.
- A Bachelor's degree in Computer Science, Computer Engineering, relevant technical field and 12+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a Master's degree in Computer Science, Computer Engineering, relevant technical field and 8+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a PhD in Computer Science Computer Engineering, or relevant technical field and 7+ years of experience in AI framework development or accelerating deep learning models on hardware architectures.
- Knowledge of GPU, CPU, or AI hardware accelerator architectures.
- Experience working with frameworks like PyTorch, Caffe2, TensorFlow, ONNX, TensorRT
- OR AI high performance kernels: Experience with CUDA programming, OpenMP / OpenCL programming or AI hardware accelerator kernel programming. Experience in accelerating libraries on AI hardware, similar to cuBLAS, cuDNN, CUTLASS, HIP, ROCm etc.
- OR AI Compiler: Experience with compiler optimizations such as loop optimizations, vectorization, parallelization, hardware specific optimizations such as SIMD. Experience with MLIR, LLVM, IREE, XLA, TVM, Halide is a plus.
- OR AI frameworks: Experience in developing training and inference framework components. Experience in system performance optimizations such as runtime analysis of latency, memory bandwidth, I/O access, compute utilization analysis and associated tooling development.
For those who live in or expect to work from California if hired for this position, please click here for additional information.
Start preparing
Learn about how to prepare for your interview with our interview guide, tips, and interactive experiences.
Visit interview prep
Locations
Use Ctrl and scroll to zoom the map
Zoom in
Zoom out
Re-centre
Data Center
About Meta
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.
$85.10/hour to $251,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Equal Employment Opportunity and Affirmative Action
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here .
Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com .
-
Software Engineer, Systems ML
1 month ago
Bellevue, United States META Full timeSummary: The PyTorch Vanguard team is at the forefront of machine learning innovation, community engagement, and open-source development. We are dedicated to pushing the boundaries of ML technologies while fostering a vibrant, global community of developers and researchers. Our team combines cutting-edge ML engineering with community-driven initiatives to...
-
Software Engineer, Systems ML
1 week ago
Bellevue, United States META Full timeSoftware Engineer, Systems ML - PyTorch Performance and Engagement Apply to this job Location pin icon Bellevue, WA •Menlo Park, CA •New York, NY + 2 more - Hide Apply to this job The PyTorch Vanguard team is at the forefront of machine learning innovation, community engagement, and open-source development. We are dedicated to pushing the boundaries...
-
Software Engineer, Systems ML
1 week ago
Bellevue, United States META Full timeSoftware Engineer, Systems ML - PyTorch Performance and Engagement Apply to this job Location pin icon Bellevue, WA •Menlo Park, CA •New York, NY + 2 more - Hide Apply to this job The PyTorch Vanguard team is at the forefront of machine learning innovation, community engagement, and open-source development. We are dedicated to pushing the boundaries...
-
Software Engineer, Systems ML
1 week ago
Bellevue, United States META Full timeSoftware Engineer, Systems ML - PyTorch Performance and Engagement Apply to this job Location pin icon Bellevue, WA •Menlo Park, CA •New York, NY + 2 more - Hide Apply to this job The PyTorch Vanguard team is at the forefront of machine learning innovation, community engagement, and open-source development. We are dedicated to pushing the boundaries...
-
Software Engineer, Systems ML
5 days ago
Bellevue, United States META Full timeSoftware Engineer, Systems ML - PyTorch Performance and Engagement Apply to this job Location pin icon Bellevue, WA •Menlo Park, CA •New York, NY + 2 more - Hide Apply to this job The PyTorch Vanguard team is at the forefront of machine learning innovation, community engagement, and open-source development. We are dedicated to pushing the boundaries...
-
Software Engineer, Systems ML
19 hours ago
Bellevue, United States Facebook Full timeSummary: Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web. Some aspects of this role as...
-
Software Engineer, Systems ML
4 months ago
Bellevue, United States META Full timeSummary: Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web.Some aspects of this role as...
-
Software Engineer, Systems ML
4 weeks ago
Bellevue, United States META Full timeSummary: The PyTorch Vanguard team is at the forefront of machine learning innovation, community engagement, and open-source development. We are dedicated to pushing the boundaries of ML technologies while fostering a vibrant, global community of developers and researchers. Our team combines cutting-edge ML engineering with community-driven initiatives to...
-
Software Engineer, Systems ML
1 week ago
Bellevue, United States META Full timeSoftware Engineer, Systems ML - HPC Specialist Apply to this job Location pin icon Bellevue, WA •Menlo Park, CA •Remote, US + 2 more - Hide Apply to this job Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position...
-
Software Engineer, Systems ML
4 weeks ago
Bellevue, United States META Full timeSummary: Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web. We are hiring in multiple...
-
Software Engineer, Systems ML
3 weeks ago
Bellevue, United States META Full timeSummary: In this role, you will be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of the bigger industry-leading PyTorch AI framework organization. MTIA Software Team has been developing a comprehensive AI Compiler strategy that delivers a highly flexible platform to train & serve new DL/ML model architectures, combined...
-
Software Engineer II
2 months ago
Bellevue, United States Belva.ai Full timeJob DescriptionJob DescriptionAt Belva, we are seeking a talented and experienced Software Engineer II to join our team. We’re a trailblazing A.I. Telecommunications company, searching for an individual who can take code ownership and help lead the charge in AI / ML solutions that make an impact in the lives of millions.Role and Responsibilities:We are...
-
Senior AI/ML Solutions Architect
7 days ago
Bellevue, Washington, United States Belva Full timeWe are seeking a highly experienced and skilled AI/ML Engineer to join our backend python team at Belva.Job OverviewAs an AI/ML Engineer, you will work closely with our ML and Data Engineers to turn Machine Learning models and data pipelines into robust software applications.Key ResponsibilitiesDesign and develop core AI/ML solutions that drive business...
-
Software Engineer, SystemML
1 month ago
Bellevue, United States Meta Inc Full timeSummary: In this role, you will be a member of the Network.AI Software team and part of the bigger DC networking organization. The team develops and owns the software stack around NCCL (NVIDIA Collective Communications Library), which enables multi-GPU and multi-node data communication through HPC-style collectives. NCCL has been integrated into PyTorch and...
-
Software Systems Engineer
5 days ago
Bellevue, Nebraska, United States Northrop Grumman Full timeAbout the Role:Northrop Grumman is seeking an experienced Software Systems Engineer to join our team in Bellevue, NE.Key Responsibilities:Participate in the entire software development lifecycle with a focus on software engineering.Collaborate with software designers and engineers in the planning, design, development, and utilization of software...
-
IT Software Engineer
2 months ago
Bellevue, United States Sunrise Systems Full timeJob Title: IT Software Engineer (.Net) Reference ID: - Location: Bellevue, WA Duration: Months Job Type: Contract (Candidates must be able to work on W without VISA sponsorship) This position is % onsite Looking for a minimum of years’ experience. Top must have skills: The following are interrelated and critical to the existing...
-
Bellevue, Washington, United States Amazon Full timeJob Summary:We are seeking a highly skilled Data Scientist Leader to drive the development of large-scale AI/ML solutions. The successful candidate will have a strong background in machine learning, deep learning, and software engineering.The ideal candidate will have a PhD in Computer Science, Engineering, or a related field, with 5+ years of experience in...
-
Software Engineer, Systems
1 month ago
Bellevue, United States META Full timeSummary: Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D...
-
Software Engineer, Systems
3 weeks ago
Bellevue, United States META Full timeSummary: Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D...
-
Bellevue, Washington, United States META Full timeSummary:META is seeking a talented AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. This position will involve applying relevant AI infrastructure and hardware acceleration techniques to build and optimize intelligent ML systems that improve META's...