AIML - Machine Learning Engineer, Foundation Model Services
1 week ago
Do you feel you think differently, you are eager to break status quo, are bold and ambitious, aren't afraid to take risks and are passionate to build the best of class technology. If yes, what better place to be at and do this than Apple? At Apple, "we think different, we push the boundaries of computing and intelligence. We build products that bring smile to people's face". Foundation Model Infrastructure team, within Machine Learning Platform Technologies organization is the back-bone of Apple Intelligence. It builds frameworks, services and tools that power the largest Apple foundation models on servers. Our Infrastructure powers a wide gamut of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri and upcoming ever exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will get a chance to bring Intelligence to billions of users across the world. You will have an opportunity to make a difference in life of people. You will have a chance to work on optimizing billions of parameter languge and vision and speech models using state of the art technologies and make it run at scale of Apple.
Description
Work along side Foundation Model Research team to optimize inference for cutting edge model architectures. Work closely with product teams to build Production grade solutions to launch models serving millions of customers in real time. Build tools to understand bottlenecks in Inference for different hardwares and use cases. Mentor and guide engineers in the organization.
Minimum Qualifications
- 5+ years of experience leading and driving complex, ambiguous projects.
- Have experience with high throughput services particularly at supercomputing scale.
- Proficient with running applications on Cloud (AWS / Azure or equivalent) using Kubernetes, Docker etc.
- Familiar with GPU programming concepts using CUDA.
- Familiar with one of the popular ML Frameworks like Pytorch, Tensorflow.
Preferred Qualifications
- Proficient in building and maintaining systems written in modern languages (eg: Golang, python)
- Familiar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models.
- Familiarity with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, Nvidia Triton Server etc.
- Experience writing custom CUDA kernels using CUDA or OpenAI Triton.
Pay & Benefits
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $181,100 and $318,400, and your base pay will depend on your skills, qualifications, experience, and location.
Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses - including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .
Submit Resume
-
Santa Clara, California, United States Apple Full time $147,400 - $272,100 per yearWe are seeking a highly experienced Machine Learning Engineer to build, deploy, and optimize Large Language Model (LLM)-based applications, with a strong emphasis on MLOps/LLMOps (LLM operations) and scalable production systems. At Apple, we believe in creating technology that enriches lives and empowers creativity. You'll play a pivotal role in developing...
-
AIML Resident
1 week ago
Santa Clara, California, United States Apple Full time $120,000 - $180,000 per yearImagine what you could do here At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Combining groundbreaking machine learning research with next-generation hardware, our teams take user experiences to the next level.DescriptionApple's AIML Residency is a year-long program inviting experts in various...
-
AIML - Sr Machine Learning Mgr, Health AIML
1 week ago
Santa Clara, California, United States Apple Full timeDo you get excited by driving product impact via measurement and evaluation, for products and services used by hundreds of millions of people globally?The vision for the AIML Health organization is to improve Apple products by using data as the voice of our customers. Within this organization the mission of the Search Analytics team is to inform product...
-
AIML Resident
3 days ago
Santa Clara, California, United States Apple Full timeImagine what you could do here At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Combining groundbreaking machine learning research with next-generation hardware, our teams take user experiences to the next level. Description Apple's AIML Residency is a year-long program inviting experts in...
-
Machine Learning Engineer
5 days ago
Santa Clara, California, United States Autonomous Healthcare Full time $120,000 - $180,000 per yearAbout Autonomous HealthcareAt Autonomous Healthcare, we are at the forefront of medical innovation, developing the next generation of devices that will revolutionize patient care. Our mission is to commercialize breakthrough medical technologies by leveraging cutting-edge AI and autonomous systems. We believe that the best solutions are built together, and...
-
Santa Clara, California, United States Apple Full timeDo you want to make Siri and Apple products smarter for our users? The Answers, Knowledge & Information team is redefining how hundreds of millions of people use their devices to get information. We are an Applied ML team pushing the limits of apple intelligence, assistant response ranking, and search technologies, while also responsible for a production...
-
Santa Clara, California, United States Apple Full time $150,000 - $250,000 per yearDo you want to make Siri and Apple products smarter for our users? The Siri and Information Intelligence team is redefining how hundreds of millions of people use their devices to get information. We are an Applied ML team pushing the limits on real-time augmented information retrieval and generation, information safety and search technologies, while also...
-
Senior Machine Learning Engineer, Perception
1 week ago
Santa Clara, California, United States Plus Full time $120,000 - $180,000 per yearWe are seeking a highly skilled Machine Learning Engineer with deep expertise in developing Bird's Eye View (BEV) fusion models using multimodal sensor inputs, particularly LiDAR. You will play a central role in designing scalable perception algorithms that integrate data from camera, LiDAR, and radar sensors to support autonomous driving and 3D scene...
-
Santa Clara, California, United States Apple Full time $181,100 - $318,400 per yearThe Answers, Knowledge & Information team is revolutionizing the way hundreds of millions of people access information on their devices, all while keeping user privacy at the forefront. As an Applied ML team, we're pushing the boundaries of Apple Intelligence, result ranking, and innovative search technologies, all while running a low latency production...
-
Principal Machine Learning Engineer
7 days ago
Santa Clara, California, United States ServiceNow Full time $200,000 - $400,000 per yearCompany DescriptionIt all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500. Our intelligent cloud-based...