Research Engineer

2 days ago


Palo Alto, California, United States Inflection AI Full time
About the Role

We are seeking a highly skilled Research Engineer to join our Inference team at Inflection AI. As a key member of our team, you will be responsible for optimizing model inference processes, reducing latency, and improving throughput without compromising model performance.

Key Responsibilities:

  • Deploy and optimize Large Language Models (LLMs) for inference in cloud and on-prem environments
  • Use tools and frameworks for model optimization and acceleration, such as ONNX, TensorRT, or TVM
  • Troubleshoot and solve complex problems related to model performance and scaling
  • Have a deep understanding of the trade-offs involved in model inference, including hardware constraints and real-time processing requirements
  • Be proficient with PyTorch and familiar with infrastructure management tools like Docker and Kubernetes for deploying inference pipelines

What We Offer:

  • Competitive salary range of $175,000 - $325,000 depending on experience
  • Unlimited paid time off
  • Parental leave and flexibility for all parents and caregivers
  • Generous medical, dental, and vision plans for US employees
  • Visa sponsorship for new hires
  • Avenues for personal growth such as coaching, conference attendance, or specific trainings

Our Culture:

We value excellence and ownership, teamwork and generosity, constructive disagreement, and feedback. We believe in scale as the engine of progress in AI and are building one of the largest supercomputers in the world to develop and deploy the new generation of AIs.

Why Inflection AI?

We are a vertically integrated AI studio, building one of the most advanced large language models in the world. We wear multiple hats and don't distinguish between engineering and research. We continuously explore and exploit, creating new and perfecting existing techniques and solutions. User feedback is our North Star.



  • Palo Alto, California, United States RI Research Instruments GmbH Full time

    RI Research Instruments GmbH is dedicated to pioneering multimodal AI technologies that enhance human creativity and capabilities. We recognize that true intelligence requires a multimodal approach. Our focus is on advancing beyond traditional language models to develop systems that can perceive, comprehend, and interact with the world around us. We are in...


  • Palo Alto, California, United States RI Research Instruments GmbH Full time

    RI Research Instruments GmbH is dedicated to advancing multimodal artificial intelligence to enhance human creativity and capabilities. We recognize that multimodality is essential for true intelligence. Our goal is to transcend traditional language models by integrating vision into our systems. We are focused on developing and scaling multimodal foundation...


  • Palo Alto, California, United States Electric Power Research Institute Full time

    Job SummaryThis is a challenging role that requires a highly skilled and experienced professional to conduct research in complex technical or scientific fields. The successful candidate will be responsible for leading technical activities, providing guidance to junior staff, and communicating complex strategies and results to subject matter experts and...


  • Palo Alto, California, United States PsiQuantum Full time

    Position: Photonics Research Engineer at PsiQuantumAt PsiQuantum, we are at the forefront of quantum computing innovation, committed to transforming industries globally. We are currently in search of a skilled Photonics Research Engineer to become an integral part of our innovative team.Role Overview:Develop, execute, and refine advanced experimental...


  • Palo Alto, California, United States The Pennsylvania State University Full time

    Job DescriptionWe are seeking an undergraduate research assistant to join our Fluids Machinery Department at the Applied Research Laboratory (ARL) at Penn State University.ResponsibilitiesAssist with writing and debugging computer code to analyze turbomachinery and hydrodynamic designsAssist in the development of optimization methods for constrained...

  • Software Engineer

    4 hours ago


    Palo Alto, California, United States Penn State University Talent Acquisition Full time

    Job SummaryWe are seeking a highly skilled Software Engineer to join our team at Penn State University's Applied Research Laboratory. As a member of our Communications and Signal Processing Division, you will be responsible for designing and developing cutting-edge software solutions to support various research processes and applications.Key...


  • Palo Alto, California, United States Penn State University Talent Acquisition Full time

    Job SummaryWe are seeking a highly motivated and experienced Systems Research and Development Engineer to join our team at Penn State University. As a key member of our Communications and Signal Processing Division, you will be responsible for researching, designing, developing, integrating, and testing advanced communications systems.Key...


  • Palo Alto, California, United States Penn State University Talent Acquisition Full time

    Job SummaryWe are seeking a highly motivated and experienced Systems Research and Development Engineer to join our team at Penn State University. As a key member of our Communications and Signal Processing Division, you will be responsible for researching, designing, developing, integrating, and testing advanced communications systems.Key...

  • Research Engineer

    3 days ago


    Palo Alto, California, United States Penn State University Talent Acquisition Full time

    Job DescriptionWe are seeking a highly skilled Research Engineer to join our team at the Applied Research Laboratory (ARL) at Penn State University. The successful candidate will be responsible for conducting research in the area of vibrations and acoustics, with a focus on structural acoustics, dynamic characterization of structures and materials, fatigue...


  • Palo Alto, California, United States Penn State University Talent Acquisition Full time

    Job DescriptionPenn State University Talent Acquisition is seeking a highly skilled Senior RF Research Engineer to join our team at the Applied Research Laboratory. As a key member of our team, you will be responsible for managing complex RF Propagation projects, contributing to customers' strategic objectives, mentoring staff, and developing relationships...


  • Palo Alto, California, United States The Pennsylvania State University Full time

    Job SummaryWe are seeking a highly motivated and experienced Systems Research and Development Engineer to join our team at The Pennsylvania State University. As a key member of our Communications and Signal Processing Division, you will be responsible for researching, designing, developing, integrating, and testing advanced communications systems.Key...


  • Palo Alto, California, United States PsiQuantum Full time

    About PsiQuantum:At PsiQuantum, we are on a transformative journey to revolutionize the computing landscape through quantum technology. Our mission is to develop the world's first practical quantum computer, which will significantly enhance computational capabilities across various industries.Position Overview:We are seeking a talented Photonics Research...


  • Palo Alto, California, United States Tesla, Inc. Full time

    Foundation Models Research EngineerTesla is in search of outstanding software engineers to advance the development of AI's foundation models. You will collaborate with a select group of elite deep learning professionals to create cutting-edge neural networks and explore the frontiers of AI research and innovation. Your contributions will facilitate the...


  • Palo Alto, California, United States Tesla, Inc. Full time

    Foundation Models Research EngineerTesla is on the lookout for outstanding software engineers to contribute to the development of AI's foundation models. You will collaborate with a select group of elite deep learning specialists to create cutting-edge neural networks and explore the frontiers of AI research and innovation. Your contributions will facilitate...


  • Palo Alto, California, United States Tesla, Inc. Full time

    Foundation Models Research EngineerTesla is on the lookout for outstanding software engineers to contribute to the development of AI's foundation models. You will collaborate with a select group of elite deep learning professionals to create cutting-edge neural networks and expand the horizons of AI research and innovation. Your contributions will facilitate...


  • Palo Alto, California, United States Tykhe Inc Full time

    Unlock the Future of GenAI InfrastructureAt Tykhe Inc, we're pushing the boundaries of GenAI infrastructure, focusing on Voice/Audio/Speech, Vision, and Multi-modal platforms. If you're an expert in designing, developing, training, and fine-tuning state-of-the-art models using cutting-edge technologies and frameworks, we want to hear from you.Key...


  • Palo Alto, California, United States Tesla, Inc. Full time

    Research Engineer, Foundation Models, Self-DrivingTesla is on the lookout for outstanding software engineers to develop the foundation models for Tesla AI. You will collaborate with a select group of elite deep learning specialists to create cutting-edge neural networks and explore the frontiers of AI research and innovation. Your contributions will...


  • Palo Alto, California, United States Penn State University Full time

    Job SummaryWe are seeking a highly motivated Postdoctoral Scholar to join our team at the Applied Research Laboratory, Penn State University. The successful candidate will assist with various activities related to material synthesis, processing, characterization, and process modeling for the fabrication and development of advanced materials, coatings, and...


  • Palo Alto, California, United States Pennsylvania State University Full time

    Job DescriptionWe are seeking a highly motivated and experienced Materials Science Research Engineer to join our team at the Applied Research Laboratory (ARL) at Penn State University. The successful candidate will be responsible for designing, developing, and conducting experimental coating deposition trials for a wide range of materials and coatings.Key...


  • Palo Alto, California, United States Pennsylvania State University Full time

    Job DescriptionWe are seeking a highly motivated and experienced Materials Science Research Engineer to join our team at the Applied Research Laboratory (ARL) at Penn State University.Job SummaryThe successful candidate will design, develop, and conduct experimental coating deposition trials for a wide range of materials and coatings. They will also develop...