Software Engineer, Systems ML
22 hours ago
Summary:
In this role, you will be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of the bigger industry-leading PyTorch AI framework organization. MTIA Software Team has been developing a comprehensive AI Compiler strategy that delivers a highly flexible platform to train & serve new DL/ML model architectures, combined with auto-tuned high performance for production environments across specialized hardware architectures. The compiler stack, DL graph optimizations, and kernel authoring for specific hardware, directly impacts performance and deployment velocity of both AI training and inference platforms at Meta.You will be working on one of the core areas such as PyTorch framework components, AI compiler and runtime, high-performance kernels and tooling to accelerate machine learning workloads on the current & next generation of MTIA AI hardware platforms. You will work closely with AI researchers to analyze deep learning models and lower them efficiently on MTIA hardware. You will also partner with hardware design teams to develop compiler optimizations for high performance. You will apply software development best practices to design features, optimization, and performance tuning techniques. You will gain valuable experience in developing machine learning compiler frameworks and will help in driving next generation hardware software codesign for AI domain specific problems.
Required Skills:
Software Engineer, Systems ML - Frameworks / Compilers / Kernels Responsibilities:
-
Development of SW stack with one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and acceleration onto next generation of hardware architectures
-
Contribute to the development of the industry-leading PyTorch AI framework core compilers to support new state of the art inference and training AI hardware accelerators and optimize their performance
-
Analyze deep learning networks, develop & implement compiler optimization algorithms
-
Collaborating with AI research scientists to accelerate the next generation of deep learning models such as Recommendation systems, Generative AI, Computer vision, NLP etc
-
Performance tuning and optimizations of deep learning framework & software components
Minimum Qualifications:
Minimum Qualifications:
-
Proven C/C++ programming skills
-
Experience in AI framework development or accelerating deep learning models on hardware architectures
-
Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
Preferred Qualifications:
Preferred Qualifications:
-
OR AI Compiler: Experience with compiler optimizations such as loop optimizations, vectorization, parallelization, hardware specific optimizations such as SIMD. Experience with MLIR, LLVM, IREE, XLA, TVM, Halide is a plus.
-
OR AI frameworks: Experience in developing training and inference framework components. Experience in system performance optimizations such as runtime analysis of latency, memory bandwidth, I/O access, compute utilization analysis and associated tooling development.
-
OR AI high performance kernels: Experience with CUDA programming, OpenMP / OpenCL programming or AI hardware accelerator kernel programming. Experience in accelerating libraries on AI hardware, similar to cuBLAS, cuDNN, CUTLASS, HIP, ROCm etc.
-
A Bachelor's degree in Computer Science, Computer Engineering, relevant technical field and 7+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a Master's degree in Computer Science, Computer Engineering, relevant technical field and 4+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a PhD in Computer Science Computer Engineering, or relevant technical field and 3+ years of experience in AI framework development or accelerating deep learning models on hardware architectures.
-
Experience working with frameworks like PyTorch, Caffe2, TensorFlow, ONNX, TensorRT
-
Knowledge of GPU, CPU, or AI hardware accelerator architectures.
Public Compensation:
$70.67/hour to $208,000/year + bonus + equity + benefits
Industry: Internet
Equal Opportunity:
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.
Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.
-
Senior Staff Machine Learning Engineer
1 week ago
Jefferson City, MO, United States Coinbase Full timeReady to be pushed beyond what you think you’re capable of? At Coinbase, our mission is to increase economic freedom in the world. It’s a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform — and with it, the future global financial system. To achieve our mission, we’re seeking a very...
-
Software Engineer Lead
7 days ago
Jefferson City, MO, United States Ensono Full timeSoftware Engineer LeadRemote - United StatesJR012408 At Ensono, our Purpose is to be a relentless ally, disrupting the status quo and unleashing our clients to Do Great Things! We enable our clients to achieve key business outcomes that reshape how our world runs. As an expert technology adviser and managed service provider with cross-platform...
-
Software Engineer Lead
1 week ago
Jefferson City, MO, United States Ensono Full timeSoftware Engineer LeadRemote - United StatesJR012408 At Ensono, our Purpose is to be a relentless ally, disrupting the status quo and unleashing our clients to Do Great Things! We enable our clients to achieve key business outcomes that reshape how our world runs. As an expert technology adviser and managed service provider with cross-platform...
-
Software Engineer Lead
19 hours ago
Jefferson City, MO, United States Ensono Full timeSoftware Engineer LeadRemote - United StatesJR012408 At Ensono, our Purpose is to be a relentless ally, disrupting the status quo and unleashing our clients to Do Great Things! We enable our clients to achieve key business outcomes that reshape how our world runs. As an expert technology adviser and managed service provider with cross-platform...
-
Senior Software Engineer
5 days ago
Jefferson City, MO, United States Oracle Full timeJob Description OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This is your chance to be part of the AI revolution, creating systems that allow customers to scale from tens to thousands of GPUs without compromising...
-
Jefferson City, MO, United States Coinbase Full timeReady to be pushed beyond what you think you’re capable of? At Coinbase, our mission is to increase economic freedom in the world. It’s a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform — and with it, the future global financial system. To achieve our mission, we’re seeking a very...
-
Senior Software Delivery Engineer
5 days ago
Jefferson City, MO, United States CVS Health Full timeAt CVS Health, we're building a world of health around every consumer and surrounding ourselves with dedicated colleagues who are passionate about transforming health care. As the nation's leading health solutions company, we reach millions of Americans through our local presence, digital channels and more than 300,000 purpose-driven colleagues - caring for...
-
Principal Sales Engineer
1 week ago
Jefferson City, MO, United States Rocket Software Full timeIt's fun to work in a company where people truly BELIEVE in what they're doing! Job Description Summary: We're looking for a Principal Sales Engineer who is not only passionate about technology but thrives on engaging with customers to solve complex challenges. This role centers on migrating mainframe workloads to cloud environments, where your technical...
-
Senior Principal Software Engineer
2 weeks ago
Jefferson City, MO, United States Oracle Full timeJob Description Are you ready to embark on a journey that will revolutionize healthcare and improve patient outcomes on a global scale? The Oracle Health division is leading the charge in healthcare innovation, and we are seeking an exceptional Senior Principal Software Engineer to join our Healthcare Agents Engineering team , where you will be at the heart...
-
Software Developer 5
7 days ago
Jefferson City, MO, United States Oracle Full timeJob Description Oracle Cloud Infrastructure (OCI) is Oracle's next-generation cloud platform, engineered to handle the most demanding enterprise workloads. Within OCI, the AI Platform organization is building a comprehensive cloud service to support the full lifecycle of AI and machine learning - from GPU infrastructure and training pipelines to model...