Member of Technical Staff- Inference

2 months ago


Palo Alto, United States Acceler8 Talent Full time

Member of Technical Staff, Research Engineer (Inference) - Palo Alto, CA


Join a team at the forefront of AI innovation, where your expertise in model inference can make a tangible impact. This role is ideal for engineers who thrive in a focused, high-tech environment, solving complex challenges related to large-scale AI deployments. As a Member of Technical Staff, Research Engineer (Inference), you'll play a pivotal role in optimizing and deploying state-of-the-art models for real-world applications.


About the Company

This AI studio, recognized for its groundbreaking work in developing and deploying highly effective language models, is now focused on scaling its technology for enterprise use cases. With a strong foundation in model alignment and fine-tuning, the team is well-funded and equipped with cutting-edge resources, offering a unique environment for those passionate about pushing AI boundaries. Their culture is centered on collaboration, technical excellence, and a pragmatic approach to AI advancements.


About the Role

As a Member of Technical Staff, Research Engineer (Inference), you’ll be involved in optimizing AI models for enterprise deployment, ensuring they perform efficiently under varying conditions. Your work will focus on reducing latency, improving throughput, and maintaining model performance during inference. Engineers in this role should have a deep understanding of the trade-offs in model inference, including balancing hardware constraints with real-time processing demands.


What We Can Offer You:

  • Competitive compensation aligned with your experience and contributions.
  • Unlimited paid time off and flexible parental leave.
  • Comprehensive medical, dental, and vision coverage.
  • Visa sponsorship for qualified hires.
  • Professional growth opportunities through coaching, conferences, and training.


Key Responsibilities:

  • Optimize and deploy large language models (LLMs) for inference across cloud and on-prem environments.
  • Utilize frameworks like ONNX, TensorRT, and TVM to accelerate model performance.
  • Troubleshoot complex issues related to model scaling and performance.
  • Collaborate with cross-functional teams to refine and deploy inference pipelines using PyTorch, Docker, and Kubernetes.
  • Balance competing demands, such as model accuracy and inference speed, in enterprise settings.


If you have experience with LLM inference, model optimization tools, and infrastructure management, this role aligns perfectly with your skills.



  • Palo Alto, United States Acceler8 Talent Full time

    Member of Technical Staff, Research Engineer (Inference) - Palo Alto, CAJoin a team at the forefront of AI innovation, where your expertise in model inference can make a tangible impact. This role is ideal for engineers who thrive in a focused, high-tech environment, solving complex challenges related to large-scale AI deployments. As a Member of Technical...


  • Palo Alto, United States Acceler8 Talent Full time

    Member of Technical Staff, Research Engineer (Inference) - Palo Alto, CAJoin a team at the forefront of AI innovation, where your expertise in model inference can make a tangible impact. This role is ideal for engineers who thrive in a focused, high-tech environment, solving complex challenges related to large-scale AI deployments. As a Member of Technical...


  • palo alto, United States Acceler8 Talent Full time

    Member of Technical Staff, Research Engineer (Inference) - Palo Alto, CAJoin a team at the forefront of AI innovation, where your expertise in model inference can make a tangible impact. This role is ideal for engineers who thrive in a focused, high-tech environment, solving complex challenges related to large-scale AI deployments. As a Member of Technical...


  • palo alto, United States Acceler8 Talent Full time

    Member of Technical Staff, Research Engineer (Inference) - Palo Alto, CAJoin a team at the forefront of AI innovation, where your expertise in model inference can make a tangible impact. This role is ideal for engineers who thrive in a focused, high-tech environment, solving complex challenges related to large-scale AI deployments. As a Member of Technical...


  • Palo Alto, California, United States Acceler8 Talent Full time

    Unlock AI Innovation as a Member of Technical Staff, Research Engineer (Inference)Acceler8 Talent is seeking a highly skilled Research Engineer (Inference) to join our team of AI innovators. As a key member of our technical staff, you will play a pivotal role in optimizing and deploying state-of-the-art models for real-world applications.About the CompanyOur...

  • Research Engineer

    2 weeks ago


    Palo Alto, California, United States Acceler8 Talent Full time

    Unlock AI Innovation as a Research Engineer (Inference)Embark on a challenging role that pushes the boundaries of AI innovation. As a Research Engineer (Inference), you'll be at the forefront of optimizing and deploying large language models for real-world applications. This position is ideal for engineers who thrive in a high-tech environment, solving...


  • Palo Alto, California, United States Inflection AI Full time

    About Inflection AIInflection AI is a public benefit corporation leveraging our world-class large language model to build the first AI platform focused on the needs of the enterprise.We are an organization passionate about what we are building, enjoy working together, and strive to hire people with diverse backgrounds and experience.About the RoleThe...


  • Palo Alto, California, United States Tesla Full time

    About the RoleWe are seeking a highly skilled Compiler Engineer to join our AI Inference team at Tesla. As a key member of our team, you will be responsible for designing and developing the compiler for our AI inference stack, which runs neural networks in millions of Tesla vehicles and Optimus.You will collaborate closely with our AI Engineers and Hardware...


  • Palo Alto, California, United States Snapchat Full time

    Job DescriptionSnap Inc. is a technology company that believes the camera presents the greatest opportunity to improve the way people live and communicate. We're looking for a Staff Software Engineer to join the ML Feature Generation Team at Snap Inc.The team is responsible for building the declarative ML Feature Generation platform at Snap. The platform...


  • Palo Alto, United States Acceler8 Talent Full time

    Member of Technical Staff, Pretraining Software EngineerIntroduction: We are seeking a Member of Technical Staff, Pretraining Software Engineer, who is eager to contribute to the development of our AI models through effective data pretraining techniques. This role offers the opportunity to work on the collection and preparation of data essential for training...


  • Palo Alto, United States Acceler8 Talent Full time

    Member of Technical Staff, Pretraining Software EngineerIntroduction: We are seeking a Member of Technical Staff, Pretraining Software Engineer, who is eager to contribute to the development of our AI models through effective data pretraining techniques. This role offers the opportunity to work on the collection and preparation of data essential for training...


  • Palo Alto, United States Acceler8 Talent Full time

    Member of Technical Staff, ML Infrastructure EngineerIntroduction: We are seeking a Member of Technical Staff, ML Infrastructure Engineer, who is passionate about building and optimizing the infrastructure that supports our machine learning models. This role offers the opportunity to work on cutting-edge technology and ensure the efficient deployment and...

  • AI Research Engineer

    4 weeks ago


    Palo Alto, California, United States Acceler8 Talent Full time

    About the RoleWe are seeking a highly skilled AI Research Engineer to join our team as a Member of Technical Staff, Research Engineer (Inference). As a key member of our team, you will be responsible for optimizing and deploying large language models (LLMs) for inference across cloud and on-prem environments.Key Responsibilities:Optimize LLMs for inference...


  • Palo Alto, California, United States Snapchat Full time

    About the RoleWe are seeking a highly skilled Staff Software Engineer to join our ML Feature Generation Team at Snap Inc. The successful candidate will drive technical direction for the team to accelerate ML iteration speed and improve system performance and efficiency.Key ResponsibilitiesDrive technical direction for the team to accelerate ML iteration...


  • Palo Alto, California, United States Vanguard-IP Full time

    Job SummaryWe are seeking a highly skilled Patent Agent or Technical Advisor to join our team at Vanguard-IP. As a key member of our team, you will be responsible for preparing draft patent applications, drafting responses to communications from the USPTO, and assisting in diligence matters. This role requires excellent academic credentials, strong...


  • Palo Alto, United States Endor Labs Full time

    This role is based out of Palo Alto California - Hybrid. If you are interested in helping to build a large-scale SaaS service at an early-stage company and the list below matches your background, we would love to talk to you! About Us At Endor Labs, we're not just making waves; we're setting the new standard in application security! In our first year, we've...

  • AI Systems Engineer

    4 weeks ago


    Palo Alto, California, United States xAI Full time

    About xAIxAI is a cutting-edge technology company dedicated to developing AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.Our team is comprised of highly motivated and experienced engineers who thrive on curiosity and are driven to deliver exceptional results. We operate with a flat organizational...

  • Kitchen Staff Member

    1 month ago


    Palo Alto, California, United States Habit Burger Grill - University Place Full time

    Job OverviewHabit Burger Grill - Ballard is seeking a skilled Cook / Kitchen Crew Member to join our team. As a key member of our kitchen staff, you will be responsible for preparing and cooking food to order, ensuring high-quality dishes are delivered to our guests.ResponsibilitiesPrepare and cook food to order, maintaining a clean and organized kitchen...

  • Kitchen Staff Member

    2 weeks ago


    Palo Alto, California, United States Habit Burger Grill - University Place Full time

    Job OverviewHabit Burger Grill - Tukwila is seeking a skilled Cook / Kitchen Crew Member to join our team. As a key member of our kitchen staff, you will be responsible for preparing and cooking food to order, ensuring high-quality dishes are delivered to our guests.ResponsibilitiesPrepare and cook food to order, maintaining a clean and organized kitchen...


  • Palo Alto, California, United States Amazon Full time

    We're working to improve shopping on Amazon using the conversational capabilities of large language models, and are searching for pioneers who are passionate about technology, innovation, and customer experience, and are ready to make a lasting impact on the industry.You'll be working with talented scientists, engineers, and technical program managers (TPM)...