Inference Platform Engineer
3 weeks ago
We are XPeng Motors, a leading smart electric vehicle company committed to in-house R&D and intelligent manufacturing. Our mission is to create a better mobility experience for our customers by transforming smart electric vehicles with technology and data.
Job OverviewWe're seeking an exceptional Inference Platform Engineer to join our team and make a significant impact on the transportation revolution through advancements in autonomous driving. This full-time position offers a dynamic, supportive, and engaging work environment where creativity thrives.
Responsibilities- Design, implement, and operate components of our novel model inference platform, including quota management, job scheduling, and queuing systems.
- Identify performance bottlenecks and optimization opportunities to ensure the reliability of the distributed inference infrastructure.
- Work closely with Machine Learning Engineers to evolve the inference platform as per their use cases.
- Monitor system health, diagnose, and troubleshoot issues, and perform routine maintenance tasks.
- Advanced degree (MS or PhD) in Computer Science or related field.
- 5+ years of industry or research experience in ML Infra, model inference.
- Expertise in programming languages like Python/Java/C++ and experience with distributed computing frameworks.
- Experience with high-throughput, fault-tolerant system design.
- Proficient in Docker and Kubernetes.
- A competitive compensation package, including salary range $180,000-$300,000, bonus, equity, and benefits.
- Perks include snacks, lunches, and organized fun activities.
We are an Equal Opportunity Employer. It is our policy to provide equal employment opportunities to all qualified persons without regard to race, age, color, sex, sexual orientation, religion, national origin, disability, veteran status, or marital status.
-
Staff Data Scientist
22 hours ago
Santa Clara, California, United States XPENG Motors Full time**About XPENG Motors' Mission:**Our mission is to transform the future of mobility by leveraging advanced technologies like AI, Internet, and autonomous driving to create seamless and safe EV experiences for our customers.We strive to push the boundaries of innovation, working closely with top industry talent and incorporating cutting-edge technologies into...
-
Inference Engine Developer
4 days ago
Santa Clara, California, United States Predibase Full timePredibase is looking for a talented inference engine developer to work on our ML Inference team. As an engineer on this team, you will be responsible for developing and integrating new LLM inference techniques into our next-generation serving systems. This involves collaborating closely with customers to understand their performance requirements and working...
-
AI Engineer for Large-Scale Inference
2 days ago
Santa Clara, California, United States XPENG Motors Full time**About XPENG Motors:**XPENG Motors is a leading smart electric vehicle company that designs, develops, manufactures, and markets innovative EVs seamlessly integrated with advanced Internet, AI, and autonomous driving technologies.We are committed to in-house R&D and intelligent manufacturing to create a better mobility experience for our customers. Our goal...
-
AI Inference Architect
3 weeks ago
Santa Clara, California, United States d-Matrix Full timeUnlock Efficient AI Inference with d-Matrixd-Matrix has revolutionized memory-compute integration with our pioneering digital in-memory compute (DIMC) engine, breaking the 'memory wall' to minimize data movements. This breakthrough enables us to accelerate Large Language Models at scale.We've secured significant funding, with $154M raised in our Series B...
-
AI Software Engineer
3 days ago
Santa Clara, California, United States Apple Full timeAbout the Role">We are looking for a skilled Software Engineer to join our Foundation Model Batch Inference team. As a key member of this group, you will design and build innovative large scale batch inference solutions that power billions of foundation model inference queries across Apple products.">Key Responsibilities">Build scalable and efficient systems...
-
AI Inference Algorithm Designer
5 days ago
Santa Clara, California, United States d-Matrix Full timed-Matrix has revolutionized memory-compute integration with its digital in-memory compute (DIMC) engine. Breaking through the memory wall to minimize data movements has been a long-standing challenge in AI computing. d-Matrix has achieved this breakthrough with its first-of-its-kind DIMC engine.The company is poised to advance Large Language Models to scale...
-
AI Engineering Manager: Distributed Systems
5 days ago
Santa Clara, California, United States XPENG Motors Full timeAbout the RoleThis is an exciting opportunity to join our team as a Senior Infrastructure Architect, where you will play a critical role in designing, implementing, and operating components of our novel model inference platform. Your expertise in programming languages like Python/Java/C++ and experience with distributed computing frameworks will be...
-
Senior Software Engineer
4 weeks ago
Santa Clara, California, United States XPENG Motors Full timeXpeng Motors is a leading Chinese smart electric vehicle company that integrates advanced Internet, AI, and autonomous driving technologies into its vehicles. The company's commitment to in-house R&D and intelligent manufacturing enables it to create a better mobility experience for its customers.Job OverviewThe successful candidate will play a key role in...
-
Optimizing AI Performance Engineer
4 days ago
Santa Clara, California, United States Acceler8 Talent Full time**Acceler8 Talent**: We're pushing the boundaries of on-device AI by optimizing foundation models for efficiency and scalability.We're building a high-impact team to drive innovation in our open-source inference frameworks. As an Inference Performance Engineer, you'll work on challenging projects to improve performance and quality.Identify and resolve...
-
Senior Machine Learning Infrastructure Developer
19 hours ago
Santa Clara, California, United States XPENG Motors Full time**Job Description:**Xpeng Motors is seeking a Staff AI Infrastructure Engineer to join our team and contribute to the development of our novel model inference platform.As a member of our team, you will be responsible for designing, implementing, and operating key components of the platform, including quota management, job scheduling, and queuing systems.You...
-
Platform Engineering Director
3 weeks ago
Santa Clara, California, United States Palo Alto Networks Full timeAbout the RoleWe're seeking an exceptional individual to fill the role of Platform Engineering Director. As a key member of our infrastructure team, you will be responsible for leading and managing a team responsible for designing, building, and maintaining our infrastructure platform. The ideal candidate will have a strong background in cloud...
-
AI Platform Software Engineer Santa Clara
6 days ago
Santa Clara, California, United States Tbwa ChiatDay Inc Full timeCelestial AI's Photonic Fabric Revolutionizes Data Center InfrastructureAs Generative AI continues to advance, the performance drivers for data center infrastructure are shifting from systems-on-chip (SoCs) to systems of chips. In the era of Accelerated Computing, data center bottlenecks are no longer limited to compute performance, but rather the system's...
-
Platform Engineering Lead
3 weeks ago
Santa Clara, California, United States ServiceNow Full timeAbout This Role">We are seeking a highly skilled Staff Software Engineer Core Platform to join our team. As a key member of our engineering organization, you will play a critical role in designing, developing, and maintaining high-quality software solutions.Job Responsibilities">As a Staff Software Engineer Core Platform, your primary responsibilities will...
-
AI Systems Engineer
5 days ago
Santa Clara, California, United States Collabera Full timeWe are seeking a highly motivated AI Systems Engineer to join our dynamic team. In this role, you will bridge the gap between development and operations in AI-focused projects, ensuring the seamless deployment, scalability, and reliability of AI and machine learning applications.The ideal candidate will have a solid understanding of computer algorithms, AI...
-
Cloud Platform Engineering Lead
3 weeks ago
Santa Clara, California, United States Palo Alto Networks Full timeWe're seeking an experienced Cloud Platform Engineering Lead to lead and manage a team responsible for designing, building, and maintaining our cloud infrastructure platform.Job DescriptionThis role will involve driving the design, implementation, and maintenance of cloud-based infrastructure platforms (e.g., AWS, Azure, GCP) and on-prem solutions to support...
-
Platform Performance Engineer
2 days ago
Santa Clara, California, United States Apple Full timeWe are seeking a highly motivated system analysis engineer with excellent analytical skills to join our Platform Architecture group.In this role, you will work on building technologies that connect our hardware and software into one unified system.You will collaborate with engineers across Apple to deep dive into hardware and software technologies, uncover...
-
Senior AI Infrastructure Engineer
3 days ago
Santa Clara, California, United States NIO Full timeAbout NIONIO is a leading innovator in the premium smart electric vehicle market. Founded in 2014, our mission is to create a community that shares joy and grows together with users.We design, develop, manufacture, and sell premium smart electric vehicles, driving innovations in next-generation technologies like autonomous driving, digital solutions,...
-
Artificial Intelligence Platform Engineer
3 weeks ago
Santa Clara, California, United States Cloud Analytics Technologies, LLC Full timeAbout the RoleWe are seeking an experienced Artificial Intelligence Platform Engineer to join our team at Cloud Analytics Technologies, LLC. As a key member of our engineering team, you will be responsible for designing, developing, and deploying cutting-edge AI platforms and solutions. ResponsibilitiesYour primary responsibilities will include:Designing and...
-
Staff AI Infrastructure Specialist
3 weeks ago
Santa Clara, California, United States XPENG Motors Full timeUnlocking the Future of MobilityXpeng Motors is one of China's leading smart electric vehicle companies, dedicated to designing, developing, manufacturing, and marketing smart EVs that seamlessly integrate advanced Internet, AI, and autonomous driving technologies.About the RoleWe're looking for an experienced AI Infrastructure Developer to join our team and...
-
Software Engineer, Platform Architect
3 weeks ago
Santa Clara, California, United States Apple Full time**About the Role**We are seeking a talented Senior Engineer to join our Tools & Technology team at Apple. As a key member of this team, you will be responsible for developing and maintaining our proprietary cross-platform rendering engine.This is an exceptional opportunity to work on cutting-edge technology and contribute to shaping the direction of our...