Research Engineer Inference Specialist
1 week ago
Unlock AI Innovation as a Member of Technical Staff, Research Engineer (Inference)
Acceler8 Talent is seeking a highly skilled Research Engineer (Inference) to join our team of AI innovators. As a key member of our technical staff, you will play a pivotal role in optimizing and deploying state-of-the-art models for real-world applications.
About the Company
Our AI studio is renowned for its groundbreaking work in developing and deploying highly effective language models. With a strong foundation in model alignment and fine-tuning, we are now focused on scaling our technology for enterprise use cases. Our team is well-funded and equipped with cutting-edge resources, offering a unique environment for those passionate about pushing AI boundaries.
About the Role
As a Research Engineer (Inference), you will be responsible for optimizing AI models for enterprise deployment, ensuring they perform efficiently under varying conditions. Your work will focus on reducing latency, improving throughput, and maintaining model performance during inference. Engineers in this role should have a deep understanding of the trade-offs in model inference, including balancing hardware constraints with real-time processing demands.
What We Offer
- Competitive compensation aligned with your experience and contributions.
- Unlimited paid time off and flexible parental leave.
- Comprehensive medical, dental, and vision coverage.
- Visa sponsorship for qualified hires.
- Professional growth opportunities through coaching, conferences, and training.
Key Responsibilities:
- Optimize and deploy large language models (LLMs) for inference across cloud and on-prem environments.
- Utilize frameworks like ONNX, TensorRT, and TVM to accelerate model performance.
- Troubleshoot complex issues related to model scaling and performance.
- Collaborate with cross-functional teams to refine and deploy inference pipelines using PyTorch, Docker, and Kubernetes.
- Balance competing demands, such as model accuracy and inference speed, in enterprise settings.
If you have experience with LLM inference, model optimization tools, and infrastructure management, this role aligns perfectly with your skills.
-
Research Engineer
3 days ago
Palo Alto, California, United States Acceler8 Talent Full timeUnlock AI Innovation as a Research Engineer (Inference)Embark on a challenging role that pushes the boundaries of AI innovation. As a Research Engineer (Inference), you'll be at the forefront of optimizing and deploying large language models for real-world applications. This position is ideal for engineers who thrive in a high-tech environment, solving...
-
Research Engineer
4 weeks ago
Palo Alto, California, United States Acceler8 Talent Full timeAbout the RoleWe are seeking a highly skilled Research Engineer to join our team as a Member of Technical Staff, focusing on optimizing and deploying large language models for real-world applications.As a key member of our team, you will be responsible for optimizing AI models for enterprise deployment, ensuring they perform efficiently under varying...
-
AI Research Engineer
3 weeks ago
Palo Alto, California, United States Acceler8 Talent Full timeAbout the RoleWe are seeking a highly skilled AI Research Engineer to join our team as a Member of Technical Staff, Research Engineer (Inference). As a key member of our team, you will be responsible for optimizing and deploying large language models (LLMs) for inference across cloud and on-prem environments.Key Responsibilities:Optimize LLMs for inference...
-
Senior Research Engineer
3 weeks ago
Palo Alto, California, United States Luma AI Full timeJob Title: Senior Research EngineerWe are seeking a highly skilled Senior Research Engineer to join our team at Luma AI. As a key member of our research team, you will be responsible for designing, developing, and deploying cutting-edge AI solutions using PyTorch and other deep learning frameworks.Responsibilities:Design and implement efficient algorithms...
-
Palo Alto, California, United States Tesla Full timeAbout the RoleAs a Software Engineer within our Autonomy teams at Tesla, you will contribute to one of the most advanced and widely deployed AI Platforms in the world for Autopilot and our Humanoid Robot, Optimus.Key ResponsibilitiesWrite, debug, and maintain robust software for Autopilot and Humanoid robot AI inference (Export/Compiler/Runtime) stackWork...
-
Senior Research Engineer
4 weeks ago
Palo Alto, California, United States Luma AI Full timeJob DescriptionWe are seeking a highly skilled Senior Research Engineer to join our team at Luma AI. As a key member of our research team, you will be responsible for designing and implementing cutting-edge AI models and systems.Key ResponsibilitiesDevelop and implement efficient models and systems for data processing, training, and deployment.Collaborate...
-
Senior AI Research Engineer
3 weeks ago
Palo Alto, California, United States Tykhe Inc Full timeJoin Our Team as a Lead Research Scientist/EngineerWe are seeking a highly skilled and experienced Lead Research Scientist/Engineer to join our team at Tykhe Inc in Palo Alto, CA. Our company specializes in building cutting-edge GenAI infrastructure, focusing on Voice/Audio/Speech, Vision, and Multi-modal platforms.If you have expertise in designing,...
-
AI Research Engineer
3 weeks ago
Palo Alto, California, United States Acceler8 Talent Full timeJoin Acceler8 Talent as a Founding Machine Learning Research EngineerWe are seeking enthusiastic individuals to join our pioneering team as Founding ML Research Engineers. If you're passionate about advancing AI systems and tackling complex challenges in machine learning, we want to hear from you.This role offers both junior and senior opportunities,...
-
Senior AI Research Engineer
4 weeks ago
Palo Alto, California, United States Tykhe Inc Full timeUnlock the Future of GenAI InfrastructureAt Tykhe Inc, we're pushing the boundaries of GenAI infrastructure, focusing on Voice/Audio/Speech, Vision, and Multi-modal platforms. If you're an expert in designing, developing, training, and fine-tuning state-of-the-art models using cutting-edge technologies and frameworks, we want to hear from you.Key...
-
Research Scientist
3 weeks ago
Palo Alto, California, United States Acceler8 Talent Full timeFounding Machine Learning Research EngineerWe are seeking talented individuals to join our pioneering team as Founding ML Research Engineers. This role offers both junior and senior opportunities, allowing individuals at different stages of their career to contribute to groundbreaking projects.Responsibilities:Develop and evaluate extensive systems...
-
Senior AI Research Scientist
2 weeks ago
Palo Alto, California, United States Tykhe Inc Full timeJoin Our Team as a Lead Research Scientist/EngineerWe are seeking a highly skilled and experienced Lead Research Scientist/Engineer to join our team at Tykhe Inc in Palo Alto, CA. Our company specializes in building cutting-edge GenAI infrastructure, focusing on Voice/Audio/Speech, Vision, and Multi-modal platforms.If you have expertise in designing,...
-
AI Research Scientist
3 days ago
Palo Alto, California, United States Acceler8 Talent Full timeAccelerate Your Career as a Founding Machine Learning Research EngineerWe're seeking talented individuals to join our pioneering team as Founding ML Research Engineers. This role offers opportunities for both junior and senior professionals to contribute to groundbreaking projects in AI systems and machine learning.Key Responsibilities:Design and develop...
-
AI Research Scientist
4 weeks ago
Palo Alto, California, United States Acceler8 Talent Full timeUnlock the Future of AI with Acceler8 TalentWe are seeking a talented Research Engineer to join our pioneering team and contribute to groundbreaking projects in machine learning. As a key member of our team, you will have the opportunity to work on complex challenges and develop innovative solutions.Key Responsibilities:Design and implement extensive systems...
-
EMF/RF Power Engineering Specialist III
4 days ago
Palo Alto, California, United States Electric Power Research Institute (EPRI) Full timeJob Title: EMF/RF Power Engineering Specialist IIIThe Electric Power Research Institute (EPRI) is seeking a highly skilled EMF/RF Power Engineering Specialist III to join our team. As a key member of our organization, you will play a critical role in shaping the future of energy by conducting research in complex, high-impact technical or scientific...
-
AI Systems Engineer
2 weeks ago
Palo Alto, California, United States xAI Full timeAbout xAIxAI is a cutting-edge technology company dedicated to developing AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.Our team is comprised of highly motivated and experienced engineers who thrive on curiosity and are driven to deliver exceptional results. We operate with a flat organizational...
-
Palo Alto, California, United States Tesla Full timeJob Title: AI Research Engineer for Model Scaling and Self-DrivingAt Tesla, you will have access to unparalleled resources that set us apart from other companies in the AI industry. You will have access to the largest self-driving dataset in the world, providing a unique environment to investigate scaling laws for sequential decision-making problems. Tesla...
-
Data Research Engineer
4 weeks ago
Palo Alto, California, United States Pennsylvania State University Full timeJob DescriptionWe are seeking a highly motivated and experienced Data Research Engineer to join the Algorithms, Prototyping and Integration (API) Department of the Applied Research Laboratory (ARL) at Penn State University.Job SummaryThe successful candidate will assist in providing our customers with state-of-the-art visualization and decision support...
-
Research and Development Engineer
3 weeks ago
Palo Alto, California, United States The Pennsylvania State University Full timeJob DescriptionWe are seeking an undergraduate student researcher to join our Fluids Machinery Department at the Applied Research Laboratory (ARL) at Penn State University.ResponsibilitiesAssist with writing and debugging computer code to analyze turbomachinery and hydrodynamic designsAssist in the development of optimization methods for constrained...
-
Systems Research and Development Engineer
2 weeks ago
Palo Alto, California, United States Penn State University Talent Acquisition Full timeJob Description and Position RequirementsWe are seeking a highly skilled Systems Research and Development Engineer to join our team at the Applied Research Laboratory (ARL) at Penn State University.The successful candidate will be responsible for providing hands-on leadership of project teams, technical management and leadership of research and development...
-
Data Research Engineer
4 weeks ago
Palo Alto, California, United States Pennsylvania State University Full timeJob DescriptionWe are seeking a highly motivated and experienced Data Research Engineer to join our team at the Applied Research Laboratory (ARL) at Penn State University.Key ResponsibilitiesAssemble large, complex sets of data that meet research requirementsBuild required infrastructure for optimal extraction, transformation, and loading of data from...