Software Engineer, Systems ML
3 months ago
Summary: In this role, you will be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of the bigger industry-leading PyTorch AI framework organization. MTIA Software Team has been developing a comprehensive AI Compiler strategy that delivers a highly flexible platform to train & serve new DL/ML model architectures, combined with auto-tuned high performance for production environments across specialized hardware architectures. The compiler stack, DL graph optimizations, and kernel authoring for specific hardware, directly impacts performance and deployment velocity of both AI training and inference platforms at Meta.You will be working on one of the core areas such as PyTorch framework components, AI compiler and runtime, high-performance kernels and tooling to accelerate machine learning workloads on the current & next generation of MTIA AI hardware platforms. You will work closely with AI researchers to analyze deep learning models and lower them efficiently on MTIA hardware. You will also partner with hardware design teams to develop compiler optimizations for high performance. You will apply software development best practices to design features, optimization, and performance tuning techniques. You will gain valuable experience in developing machine learning compiler frameworks and will help in driving next generation hardware software codesign for AI domain specific problems. Required Skills: Software Engineer, Systems ML - Frameworks / Compilers / Kernels Responsibilities: Development of SW stack with one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and acceleration onto next generation of hardware architectures. Contribute to the development of the industry-leading PyTorch AI framework core compilers to support new state of the art inference and training AI hardware accelerators and optimize their performance. Analyze deep learning networks, develop & implement compiler optimization algorithms. Collaborating with AI research scientists to accelerate the next generation of deep learning models such as Recommendation systems, Generative AI, Computer vision, NLP etc. Performance tuning and optimizations of deep learning framework & software components. Minimum Qualifications: Minimum Qualifications: Proven C/C++ programming skills Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Experience in AI framework development or accelerating deep learning models on hardware architectures. Preferred Qualifications: Preferred Qualifications: A Bachelor's degree in Computer Science, Computer Engineering, relevant technical field and 12+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a Master's degree in Computer Science, Computer Engineering, relevant technical field and 8+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a PhD in Computer Science Computer Engineering, or relevant technical field and 7+ years of experience in AI framework development or accelerating deep learning models on hardware architectures. Knowledge of GPU, CPU, or AI hardware accelerator architectures. Experience working with frameworks like PyTorch, Caffe2, TensorFlow, ONNX, TensorRT OR AI high performance kernels: Experience with CUDA programming, OpenMP / OpenCL programming or AI hardware accelerator kernel programming. Experience in accelerating libraries on AI hardware, similar to cuBLAS, cuDNN, CUTLASS, HIP, ROCm etc. OR AI Compiler: Experience with compiler optimizations such as loop optimizations, vectorization, parallelization, hardware specific optimizations such as SIMD. Experience with MLIR, LLVM, IREE, XLA, TVM, Halide is a plus. OR AI frameworks: Experience in developing training and inference framework components. Experience in system performance optimizations such as runtime analysis of latency, memory bandwidth, I/O access, compute utilization analysis and associated tooling development. Public Compensation: $85.10/hour to $251,000/year + bonus + equity + benefits Industry: Internet Equal Opportunity: Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment. Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.
-
Software Engineer, Systems ML
2 weeks ago
Menlo Park, California, United States META Full timeJob Summary:The PyTorch Compiler team at Meta is dedicated to making PyTorch run faster and more resource-efficient without sacrificing its flexibility and ease of use. We are seeking a highly skilled Software Engineer, Systems ML to join our team and contribute to the development of the PT2 compiler.The ideal candidate will have a strong background in...
-
Software Engineer, Systems ML
7 days ago
Menlo Park, United States META Full timeSummary: The PyTorch Compiler team is dedicated to making PyTorch run faster and more resource-efficient without sacrificing its flexibility and ease of use. The team is the driving force behind PT2, a step function change in PyTorch’s history that brought compiler technologies to the core of PyTorch. PT2 technologies have gained industry-wide recognition...
-
Senior Software/ML(Machine Learning) Engineer
3 weeks ago
Menlo Park, United States Pyramid Consulting, Inc Full timeImmediate need for a talented Senior Software/ML(Machine Learning) Engineer. This is a 12+months contract (Possible Extension) opportunity with long-term potential and is located in Menlo Park, CA(Onsite). Please review the job description below and contact me ASAP if you are interested.Job ID:24-43529 Pay Range: $75 - $80/hour. Employee benefits include,...
-
Senior Software/ML(Machine Learning) Engineer
1 month ago
Menlo Park, United States Pyramid Consulting, Inc Full timeImmediate need for a talented Senior Software/ML(Machine Learning) Engineer. This is a 12+months contract (Possible Extension) opportunity with long-term potential and is located in Menlo Park, CA(Onsite). Please review the job description below and contact me ASAP if you are interested.Job ID:24-43529 Pay Range: $75 - $80/hour. Employee benefits include,...
-
Principal Software Engineer
3 weeks ago
Menlo, Georgia, United States Quicken Full timeJob Title: Principal Software Engineer - AI/ML SolutionsQuicken is a leading provider of personal finance management software, committed to helping individuals achieve financial stability. We're seeking an experienced Principal Software Engineer to lead the development of AI-driven capabilities within our products.Responsibilities:Architect and develop...
-
Senior Software/ML(Machine Learning) Engineer
3 weeks ago
menlo, United States Pyramid Consulting, Inc Full timeImmediate need for a talented Senior Software/ML(Machine Learning) Engineer. This is a 12+months contract (Possible Extension) opportunity with long-term potential and is located in Menlo Park, CA(Onsite). Please review the job description below and contact me ASAP if you are interested.Job ID:24-43529 Pay Range: $75 - $80/hour. Employee benefits include,...
-
Senior Software/ML(Machine Learning) Engineer
4 weeks ago
menlo, United States Pyramid Consulting, Inc Full timeImmediate need for a talented Senior Software/ML(Machine Learning) Engineer. This is a 12+months contract (Possible Extension) opportunity with long-term potential and is located in Menlo Park, CA(Onsite). Please review the job description below and contact me ASAP if you are interested.Job ID:24-43529 Pay Range: $75 - $80/hour. Employee benefits include,...
-
Software Engineer
3 months ago
Menlo Park, United States Diffuse Bio Full timeThe role: Design, build, and iterate on research infrastructure in close collaboration with research engineers. Build tools to automate and maintain computing clusters and data parsing pipelines. Design and build software and APIs that enable internal and external access to our AI systems. Ideal background: Adaptability and openness to work on multiple...
-
Senior Software Engineer
3 weeks ago
Menlo, Georgia, United States Pyramid Consulting, Inc Full timeJob Title: Senior Software/ML EngineerJob Summary:We are seeking a highly skilled Senior Software/ML Engineer to join our team at Pyramid Consulting, Inc. in Menlo Park, CA. As a key member of our Silicon team, you will be responsible for developing optimized software in an embedded environment for vector machines, building an optimization flow or compiler...
-
Software Engineering Manager, AI Networking
3 months ago
Menlo Park, United States META Full timeSummary: In this role, you will be a member of the Network AI Software team and part of the bigger DC networking organization. The team develops and owns the software stack around collective communication libraries around Meta.At the high level, the team aims to enable Meta-wide ML products and innovations to leverage our large-scale training and inference...
-
Manager, Software Engineering, MTIA Software
4 days ago
Menlo Park, United States Meta Inc Full timeSummary: The MTIA (Meta Training & Inference Accelerator) Software team is part of AI Infra PyTorch org. The team’s mission is to explore, develop and help productize high-performance software and hardware technologies for AI at datacenter scale. The team co-optimizes both SW (e.g., algorithms and numerics) and HW (e.g., platform and network) to come up...
-
Research Scientist, Systems ML and HPC
3 weeks ago
Menlo Park, California, United States META Full timeJob DescriptionMeta is seeking a highly skilled Research Scientist to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics and a strong background in Systems ML and HPC.Key ResponsibilitiesApply High-Performance Computing (HPC) algorithms and techniques to optimize large-scale AI...
-
Software Engineer
3 weeks ago
Menlo Park, California, United States Meta Full timeMeta AI Software EngineerWe are seeking a highly skilled AI Software Engineer to join our Research & Development teams at Meta. As a key member of our team, you will be responsible for developing and applying AI and machine learning techniques to build intelligent language systems that improve our products and experiences.ResponsibilitiesApply relevant AI...
-
Technical Program Manager, ML
7 days ago
Menlo Park, United States META Full timeSummary: The Meta Technical Program Management (TPM) community is pioneering technologies to bring people (and businesses) closer together at a global scale. TPMs work at the cross-section between technical execution and business strategy and are expected to partner closely with Engineering and Product teams. Being a TPM at Meta means driving impact by...
-
Software Engineering Manager, AI Networking
3 weeks ago
Menlo Park, California, United States META Full timeJob Summary:In this role, you will be a key member of the Network AI Software team, part of the larger DC networking organization at Meta. The team is responsible for developing and owning the software stack around collective communication libraries.The team's primary goal is to enable Meta-wide ML products and innovations to leverage our large-scale...
-
Software Engineer IV
3 weeks ago
Menlo Park, California, United States BCforward Full timeJob Title: Software Engineer IVBCforward is seeking a highly motivated Software Engineer IV for a Remote opportunity. The ideal candidate will have industry experience working on a range of recommendation, classification, and optimization problems. You will bring the ability to own the whole ML life cycle, define projects and drive excellence across...
-
Software Engineering Manager, AI Compiler
3 weeks ago
Menlo Park, California, United States META Full timeJob SummaryThe Meta AI Compiler Software team is seeking a Software Engineering Manager to lead the development and optimization of compiler toolchains for Meta's production DL/ML workloads on the MTIA AI accelerator hardware. The ideal candidate will have experience with compiler architecture, development, and management, as well as a strong understanding...
-
Software Engineer IV
2 weeks ago
Menlo Park, California, United States BCforward Full timeAbout the Role:We are seeking a highly motivated Software Engineer IV to join our team at BCforward. As a key member of our engineering team, you will be responsible for designing, developing, and deploying large-scale software applications.Key Responsibilities:Adapt standard machine learning methods leveraging modern parallel environments (e.g. distributed...
-
Software Engineering Manager, AI Compiler
3 months ago
Menlo Park, United States META Full timeSummary: The MTIA (Meta Training & Inference Accelerator) Software team has been developing a comprehensive AI Compiler strategy and optimizing compiler toolchains. This enables training and inference of Meta’s production DL/ML workloads on the specialized MTIA AI accelerator hardware in a highly performant and flexible way.We are looking for a Software...
-
Menlo Park, United States Meta Inc Full timeSummary: In this role, you will be a member of the Backbone Software team at Meta. As part of this team, you will develop and own mission-critical software systems that control Petabit/s data traveling through, and millions of operations performed in, Meta's mission-critical Backbone network - one of the largest in the world. As a Software Engineer within...