Artificial Intelligence Software Engineer

2 days ago


San Francisco, California, United States Perplexity AI Full time

We are seeking a highly skilled AI Inference Engineer to join our team at Perplexity AI.

As a member of our team, you will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.

The ideal candidate will have experience with ML systems and deep learning frameworks, as well as familiarity with common LLM architectures and inference optimization techniques.

The role involves developing APIs for AI inference, benchmarking and addressing bottlenecks throughout our inference stack, improving the reliability and observability of our systems, and exploring novel research and implementing LLM inference optimizations.

The successful candidate will have a strong background in machine learning and software engineering, and will be able to work effectively in a fast-paced environment.

Additionally, the candidate should have experience with deploying reliable, distributed, real-time model serving at scale, and should be able to understand GPU architectures or have experience with GPU kernel programming using CUDA.

Benchmarking and addressing bottlenecks throughout our inference stack

**Responsibilities**

  1. Develop APIs for AI inference that will be used by both internal and external customers
  2. Benchmark and address bottlenecks throughout our inference stack
  3. Improve the reliability and observability of our systems and respond to system outages
  4. Explore novel research and implement LLM inference optimizations

**Qualifications**

  • Experience with ML systems and deep learning frameworks (e.g. PyTorch, TensorFlow, ONNX)
  • Familiarity with common LLM architectures and inference optimization techniques (e.g. continuous batching, quantization, etc.)
  • Experience with deploying reliable, distributed, real-time model serving at scale
  • (Optional) Understanding of GPU architectures or experience with GPU kernel programming using CUDA

At Perplexity AI, we offer a comprehensive compensation package that includes a competitive salary, equity, and benefits. The cash compensation range for this role is $190,000 - $240,000. Final offer amounts are determined by multiple factors, including experience and expertise, and may vary from the amounts listed above.

We also offer a comprehensive benefits package, including comprehensive health, dental, and vision insurance for you and your dependents, as well as a 401(k) plan.

If you are a motivated and experienced AI Inference Engineer looking to join a dynamic and growing company, please submit your application.

We are an equal opportunities employer and welcome applications from all qualified candidates.



  • San Jose, California, United States aiXplain Full time

    OverviewaiXplain empowers innovation through state-of-the-art artificial intelligence and machine learning, aiming to achieve a vision of industry leadership in engineering and science.Salary: $120,000 - $180,000 per year, depending on experience and qualifications. This is a competitive salary range for the role of Software Development Engineer - Artificial...


  • San Francisco, California, United States Autodesk Full time

    About the Role:We are seeking a seasoned Artificial Intelligence Engineering Director to join our team at Autodesk. As a key member of our Research Engineering organization, you will lead a team of research engineers in developing new ML-powered product features that empower our customers to create innovative designs and products.About You:You have a strong...


  • San Jose, California, United States AI Technologies LLC Full time

    Company Overview:AI Technologies LLC is a leading technology firm that specializes in the development of innovative artificial intelligence solutions. Our team of experts works tirelessly to create cutting-edge products that revolutionize various industries.Job Title: Artificial Intelligence Software DeveloperLocation: San Jose, CADuration: 8...


  • South San Francisco, California, United States Talent Software Services Full time

    **Job Summary:**Talent Software Services is seeking a highly skilled Artificial Intelligence and Computational Chemistry Researcher for a contract position in South San Francisco, CA.The opportunity will be one year with a strong chance for a long-term extension. Our client is looking for a talented researcher to drive research and engineering efforts to...


  • San Francisco, California, United States TBWA\Chiat\Day Full time

    We are seeking an experienced Data-Driven Artificial Intelligence Expert to join our team at Perplexity. As a pioneer in the field of conversational AI, we are committed to delivering cutting-edge solutions that push the boundaries of innovation.Salary: $220,000 - $300,000 per year.About the RoleThis is an exciting opportunity to work with our talented...


  • San Francisco, California, United States Aera Technology Full time

    Aera Technology is a pioneer in Decision Intelligence, revolutionizing the way enterprises operate sustainably, intelligently, and efficiently.Role OverviewWe're seeking an experienced Lead Artificial Intelligence Architect to design and implement machine learning systems that power our Aera platform. As a key member of our team, you will be responsible for...


  • San Francisco, California, United States CentML Full time

    We're looking for a talented Artificial Intelligence Systems Engineer to help us develop the CentML platform. Our mission is to make AI accessible to everyone, and we need experts like you to make it happen.Key Responsibilities:Design and develop the CentML platform, leveraging your expertise in AI and cloud-based systems.Built solutions for scheduling...


  • San Francisco, California, United States Magic AI Full time

    Job DescriptionMagic AI's mission is to create safe artificial intelligence that accelerates humanity's progress on the world's most pressing problems.We believe the most promising path to safe AI lies in automating research and code generation to improve models and solve alignment more reliably than humans alone. Our approach combines frontier-scale...


  • San Jose, California, United States ByteDance Full time

    About UsByteDance, founded in 2012, is a leading technology company dedicated to inspiring creativity and enriching life. With a diverse range of products, including TikTok, Helo, and Resso, we empower users worldwide to create, consume, and interact with content in innovative ways.Salary RangeThe annual compensation for this position in the selected...


  • San Francisco, California, United States Robotics Technologies LLC Full time

    About the RoleWe are seeking a seasoned Director of Artificial Intelligence to lead our AI/ML efforts and drive business growth.The ideal candidate will possess deep technical expertise, exceptional leadership skills, and a strong ability to collaborate with cross-functional teams.The salary for this position is estimated to be around $180,000 - $220,000 per...


  • San Mateo, California, United States Verkada Full time

    At Verkada, we are dedicated to delivering innovative cloud-based physical security solutions that empower organizations to protect their people and assets.About UsWe are a leader in the field of cloud-based B2B physical security, offering six product lines - video security cameras, access control, environmental sensors, alarms, workplace, and intercoms -...


  • San Francisco, California, United States Nava Software Solutions LLC Full time

    Job OverviewNava Software Solutions LLC is seeking an experienced Artificial Intelligence Enterprise Architect to lead the design and implementation of AI-powered solutions.Key Responsibilities:Architectural Design:Collaborate with stakeholders to understand business requirements and translate them into architectural blueprints.Design scalable, secure, and...


  • San Francisco, California, United States Capital One Full time

    Senior AI Engineer Job DescriptionCapital One is a leading financial institution dedicated to revolutionizing banking through innovative use of Artificial Intelligence (AI) and Machine Learning (ML). We are seeking an experienced Senior AI Engineer to join our team.In this role, you will be responsible for designing, developing, testing, deploying, and...


  • San Francisco, California, United States Programmers Full time

    Programmers Seeks AI/ML Tech LeadProgrammers is looking for a seasoned Artificial Intelligence and Machine Learning (AI/ML) Technology Leader to spearhead the development of an advanced AI/ML platform.The ideal candidate will have extensive experience in MLOps, ML engineering, and data science with a proven track record of leading agile teams. Additionally,...


  • San Francisco, California, United States ZipRecruiter Full time

    Unlock a high-impact role shaping and delivering cutting-edge AI/ML technology for an industry-leading social marketing platform. Scrollmark is on a mission to redefine social commerce by leveraging AI to grow communities, increase engagement, and drive revenue through social messaging.About the OpportunityYou will develop and maintain product-facing...


  • San Francisco, California, United States OpenAI Full time

    About OpenAIWe are at the forefront of artificial intelligence, driving innovation and shaping the future with cutting-edge research. Our mission is to ensure that AI's benefits reach everyone. We are looking for visionary researchers to join our Applied Group, where you'll transform groundbreaking research into real-world applications that can change...


  • San Francisco, California, United States Amazon Full time

    About AmazonAmazon is a multinational technology company that focuses on e-commerce, cloud computing, digital streaming, and artificial intelligence. Our mission is to be Earth's most customer-centric company where people can find and discover anything they might want to buy online.


  • San Francisco, California, United States Unreal Gigs Full time

    About Unreal GigsWe're a cutting-edge company at the forefront of robotics innovation, dedicated to harnessing the potential of artificial intelligence to drive intelligent automation.


  • San Jose, California, United States IBM Full time

    **IBM Research Scientist Internship Opportunity**About the RoleThis internship is an excellent chance to contribute to cutting-edge research in Artificial Intelligence and Machine Learning at IBM. As a researcher intern, you will work closely with our team to advance the development of safe, trustworthy, and reliable Foundation Models.About IBM ResearchAt...


  • San Francisco, California, United States Apple Full time

    Imagine the PossibilitiesAt Apple, we're pushing the boundaries of what's possible with artificial intelligence and machine learning. As a member of our Information Intelligence team, you'll have the opportunity to work on groundbreaking technology that's redefining search and intelligence for hundreds of millions of people around the world.Key...