Member of Technical Staff- Inference

2 months ago

palo alto, United States Acceler8 Talent Full time

Member of Technical Staff, Research Engineer (Inference) - Palo Alto, CA

Join a team at the forefront of AI innovation, where your expertise in model inference can make a tangible impact. This role is ideal for engineers who thrive in a focused, high-tech environment, solving complex challenges related to large-scale AI deployments. As a Member of Technical Staff, Research Engineer (Inference), you'll play a pivotal role in optimizing and deploying state-of-the-art models for real-world applications.

About the Company

This AI studio, recognized for its groundbreaking work in developing and deploying highly effective language models, is now focused on scaling its technology for enterprise use cases. With a strong foundation in model alignment and fine-tuning, the team is well-funded and equipped with cutting-edge resources, offering a unique environment for those passionate about pushing AI boundaries. Their culture is centered on collaboration, technical excellence, and a pragmatic approach to AI advancements.

About the Role

As a Member of Technical Staff, Research Engineer (Inference), you’ll be involved in optimizing AI models for enterprise deployment, ensuring they perform efficiently under varying conditions. Your work will focus on reducing latency, improving throughput, and maintaining model performance during inference. Engineers in this role should have a deep understanding of the trade-offs in model inference, including balancing hardware constraints with real-time processing demands.

What We Can Offer You:

Competitive compensation aligned with your experience and contributions.
Unlimited paid time off and flexible parental leave.
Comprehensive medical, dental, and vision coverage.
Visa sponsorship for qualified hires.
Professional growth opportunities through coaching, conferences, and training.

Key Responsibilities:

Optimize and deploy large language models (LLMs) for inference across cloud and on-prem environments.
Utilize frameworks like ONNX, TensorRT, and TVM to accelerate model performance.
Troubleshoot complex issues related to model scaling and performance.
Collaborate with cross-functional teams to refine and deploy inference pipelines using PyTorch, Docker, and Kubernetes.
Balance competing demands, such as model accuracy and inference speed, in enterprise settings.

If you have experience with LLM inference, model optimization tools, and infrastructure management, this role aligns perfectly with your skills.

Member of Technical Staff- Inference

1 month ago

palo alto, United States Acceler8 Talent Full time

Member of Technical Staff, Research Engineer (Inference) - Palo Alto, CAJoin a team at the forefront of AI innovation, where your expertise in model inference can make a tangible impact. This role is ideal for engineers who thrive in a focused, high-tech environment, solving complex challenges related to large-scale AI deployments. As a Member of Technical...
Member of Technical Staff- Inference

3 months ago

Palo Alto, United States Acceler8 Talent Full time

Member of Technical Staff, Research Engineer (Inference) - Palo Alto, CAJoin a team at the forefront of AI innovation, where your expertise in model inference can make a tangible impact. This role is ideal for engineers who thrive in a focused, high-tech environment, solving complex challenges related to large-scale AI deployments. As a Member of Technical...
Member of Technical Staff

3 weeks ago

Palo Alto, United States Acceler8 Talent Full time

Shape the Future of Conversational AI About UsWe are a public benefit corporation dedicated to harnessing advanced large language models to create an AI platform tailored for enterprise needs, with a particular focus on conversational AI. Our team is composed of friendly, innovative, and collaborative individuals committed to developing impactful AI...
Member of Technical Staff

3 weeks ago

palo alto, United States Acceler8 Talent Full time

Shape the Future of Conversational AI About UsWe are a public benefit corporation dedicated to harnessing advanced large language models to create an AI platform tailored for enterprise needs, with a particular focus on conversational AI. Our team is composed of friendly, innovative, and collaborative individuals committed to developing impactful AI...
Member of Technical Staff

3 weeks ago

palo alto, United States Acceler8 Talent Full time

Shape the Future of Conversational AI About UsWe are a public benefit corporation dedicated to harnessing advanced large language models to create an AI platform tailored for enterprise needs, with a particular focus on conversational AI. Our team is composed of friendly, innovative, and collaborative individuals committed to developing impactful AI...
AI Engineer

2 weeks ago

Palo Alto, United States xAI Full time

Job DescriptionJob DescriptionAbout xAIxAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. Engineers are...
Machine Learning Engineer

2 weeks ago

palo alto, United States Acceler8 Talent Full time

Elevate AI Performance: Join Us as a Research Engineer in Model Inference!What We're Building:As we embark on a new phase of growth, our focus is on collaborating with commercial partners to adapt and fine-tune our state-of-the-art AI models for their unique business needs. With a strong track record in developing and deploying cutting-edge models in...
Machine Learning Engineer

2 weeks ago

palo alto, United States Acceler8 Talent Full time

Elevate AI Performance: Join Us as a Research Engineer in Model Inference!What We're Building:As we embark on a new phase of growth, our focus is on collaborating with commercial partners to adapt and fine-tune our state-of-the-art AI models for their unique business needs. With a strong track record in developing and deploying cutting-edge models in...
Machine Learning Engineer

2 weeks ago

Palo Alto, United States Acceler8 Talent Full time

Elevate AI Performance: Join Us as a Research Engineer in Model Inference!What We're Building:As we embark on a new phase of growth, our focus is on collaborating with commercial partners to adapt and fine-tune our state-of-the-art AI models for their unique business needs. With a strong track record in developing and deploying cutting-edge models in...
Member of Technical Staff

3 weeks ago

Palo Alto, CA, United States Acceler8 Talent Full time

Shape the Future of Conversational AI About UsWe are a public benefit corporation dedicated to harnessing advanced large language models to create an AI platform tailored for enterprise needs, with a particular focus on conversational AI. Our team is composed of friendly, innovative, and collaborative individuals committed to developing impactful AI...
Machine Learning Engineer

3 weeks ago

Palo Alto, CA, United States Acceler8 Talent Full time

Elevate AI Performance: Join Us as a Research Engineer in Model Inference!What We're Building:As we embark on a new phase of growth, our focus is on collaborating with commercial partners to adapt and fine-tune our state-of-the-art AI models for their unique business needs. With a strong track record in developing and deploying cutting-edge models in...
Patent Agent or Technical Advisor

1 month ago

Palo Alto, California, United States Vanguard-IP Full time

Job SummaryWe are seeking a highly skilled Patent Agent or Technical Advisor to join our team at Vanguard-IP. As a key member of our team, you will be responsible for preparing draft patent applications, drafting responses to communications from the USPTO, and assisting in diligence matters. This role requires excellent academic credentials, strong...
AI Research Engineer

4 weeks ago

Palo Alto, California, United States xAI Full time

About xAIxAI is a cutting-edge artificial intelligence company dedicated to creating innovative AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.Our team is a tight-knit group of highly motivated and talented individuals who share a passion for engineering excellence. We encourage our engineers to work...
AI Engineer

2 weeks ago

Palo Alto, United States xAI Full time

Job DescriptionJob DescriptionAbout xAIxAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. Engineers are...
Data/ML Architect

3 weeks ago

Palo Alto, United States Purgo Full time

The Data Architect role offers the successful candidate the opportunity to pioneer the adoption of generative AI in design, development, and migration of data applications. This is a hands-on technical role involving deep collaboration with both Purgo AI’s product/engineering team and its customers/partners. The role drives the maturation and adoption of...
Patent Agent or Technical Advisor Chemistry

4 weeks ago

Palo Alto, United States Vanguard-IP Full time

REQUIREMENTS • Prior patent prosecution experience • Bachelor's degree in chemistry, organic chemistry, medicinal chemistry, biochemistry, or chemical engineering preferred with relevant research experience; or Master's or PhD in a related chemistry field. • Qualified to sit for the patent bar; preferably already licensed to practice before USPTO. •...
Staff Machine Learning Compiler Engineer

3 weeks ago

Palo Alto, United States Rivian Full time

About RivianRivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract.As a company, we constantly challenge what's possible, never simply accepting what has always been done. We reframe old problems, seek new solutions and operate...
AI Engineer

2 weeks ago

Palo Alto, United States xAI Full time

Job DescriptionJob DescriptionAbout xAIxAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. Engineers are...
Applied AI ML Director, Principal Machine Learning Platform Engineer

1 week ago

palo alto, United States JP Morgan Chase Full time

Job DescriptionWe are seeking a highly skilled and innovative AI ML Director, Principal Machine Learning Platforms to join our team within the Corporate AI ML Technology Group. The ideal candidate will have extensive experience in traditional AI, ML infrastructure, ML Platform tools , GenAI, and Machine Learning Platforms. As an Executive Director, Applied...
Software Development Engineer, Search, Innovation Pioneer

1 month ago

Palo Alto, California, United States Amazon Full time

We're seeking a pioneering software development engineer to join our team at Amazon, where we're working to improve shopping using conversational capabilities of large language models. As a member of our dynamic team, you'll work with talented scientists, engineers, and technical program managers to innovate on behalf of our customers.Key...

Americas

Europe

Asia / Oceania

Africa

Member of Technical Staff- Inference