AI Software Architect for High-Performance AI Inference Chip

5 days ago


San Francisco, California, United States ZipRecruiter Full time
About the Job

We are seeking a highly experienced Software Architect to lead our software efforts and advance the software stack that includes ML frameworks, compilers, libraries, and runtime.

Job Responsibilities
  • Advance Compiler and Runtime Technology: Develop high-performance acceleration of AI workloads across various neural network architectures.
  • Design New Software and Hardware Solutions: Research and design new software and hardware AI solutions involving simulators, optimizing compilers, code generators, and runtime execution frameworks for deep learning accelerators.
  • Evaluate Parallelization Strategies: Evaluate trade-offs of different parallelization strategies such as performance, power, energy, and memory consumption.
  • Enhance AI Software Tools: Keep up with industry and academic developments to enhance our products and support the latest DNNs emerging from research and industry.
Requirements
  • Experience: 10+ years of experience developing software for highly parallel architectures.
  • Skills: Strong problem-solving skills, understanding of Deep Learning fundamentals, development skills in C/C++, Python, and excellent soft skills.
  • Education: Computer Science, Engineering, or related degree; preferably MS or PhD.
Benefits
  • 20 vacation days
  • Strong health and extended health benefits
  • Unlimited sick days
  • Stock options

Please note that this is a senior-level position requiring extensive experience and expertise in AI software architecture. The estimated annual salary for this role is $160,000-$200,000, depending on location and qualifications.



  • San Francisco, California, United States Untether AI Full time

    Software Architect for AI InferenceWe are seeking an exceptional Software Architect to join our team at Untether AI, where you will play a key role in designing and developing software that interacts with our innovative chip. As part of our top-notch team, you will collaborate closely with hardware engineers and fellow software engineers to create software...


  • San Francisco, California, United States ZipRecruiter Full time

    About the Company:">ZipRecruiter's client is a pioneering company that designs and manufactures cutting-edge pure digital AI inference chips. They are seeking a skilled Software Architect to lead their software efforts, drive innovation, and advance the software stack that includes ML frameworks, compilers, libraries, and...


  • San Francisco, California, United States Spice AI Full time

    About Spice AI">At Spice AI, we're creating technology to help developers build intelligent applications and agents that learn and adapt. Our mission is to make it easier for developers to combine code, data, and AI to build truly intelligent, decision-making systems.">We created the Spice.ai OSS, a portable AI database written in Rust with a unified SQL...


  • San Francisco, California, United States Together AI Full time

    About the Role">We are seeking a highly skilled DevOps Engineer to join our team at Together AI. As an MLOps engineer, you will develop systems and APIs that enable our customers to perform inference and fine-tune LLMs.">Key Responsibilities">Implement runtime systems that perform inference at scale using AI/ML models from simple models up to the largest...


  • San Francisco, California, United States Magic AI Full time

    About MagicAt Magic AI, we are building safe Artificial General Intelligence (AGI) to accelerate humanity's progress on the world's most pressing problems. Our mission is to automate research and code generation to improve models and solve alignment more reliably than humans alone.We believe our approach, combining frontier-scale pre-training,...


  • San Francisco, California, United States Perplexity AI Full time

    Job DescriptionWe are seeking an AI Inference Engineer to join our growing team. As a key member of our engineering team, you will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.Benchmark and address bottlenecks throughout our inference stackImprove the reliability and observability of our systems...


  • San Francisco, California, United States Magic AI Full time

    Company OverviewMagic AI is a pioneering company dedicated to developing safe Artificial General Intelligence (AGI) that accelerates humanity's progress on the world's most pressing issues. Our mission is to make a meaningful impact by automating research and code generation, enabling us to improve models and solve alignment more reliably than humans...


  • San Francisco, California, United States rippling- ATS Full time

    Bronco AI is a pioneering applied AI lab dedicated to helping chipmakers push the boundaries of Moore's Law. Our mission is to build cutting-edge AI silicon engineers that can automate complex chip design and verification processes from initial spec to final tape-out.About the RoleAs a founding ML research engineer at Bronco AI, you will leverage your domain...


  • San Francisco, California, United States Scale AI Full time

    About ScaleScale AI is a pioneering company that's revolutionizing the way organizations build and deploy AI. With a strong mission to accelerate AI development, we provide innovative data solutions that fuel the most exciting advancements in AI.Our team is committed to making AI more accessible, powering the world's most advanced LLMs, generative models,...


  • San Francisco, California, United States Together AI Full time

    About the RoleWe are seeking an experienced Systems Research Engineer to join our team at Together AI. As a key member of our research-driven artificial intelligence company, you will play a crucial role in researching and building the next generation AI platform.Company OverviewTogether AI is committed to creating open and transparent AI systems that drive...


  • San Francisco, California, United States Magic AI Full time

    About MagicMagic is a cutting-edge technology company committed to developing safe Artificial General Intelligence (AGI) that accelerates humanity's progress on the world's most pressing challenges. Our mission revolves around automating research and code generation to improve models and solve alignment more reliably than humans alone.We believe our approach...

  • AI Research Engineer

    4 weeks ago


    San Francisco, California, United States rippling- ATS Full time

    About the RoleWe are seeking a talented AI research engineer to complement our team's rich AI background and lend their domain expertise in building intelligent reasoning systems for chip verification.This is an excellent opportunity to work in a high-ownership, high-velocity environment and contribute to the development of AI silicon engineers that can...


  • San Francisco, California, United States Abridge AI Inc. Full time

    Unlock the Potential of Healthcare with AbridgeAbridge AI Inc. is revolutionizing the healthcare industry with cutting-edge AI technology, empowering clinicians to focus on patient care while streamlining clinical documentation processes.About the RoleWe are seeking an experienced Transformative AI Systems Architect to join our team and play a pivotal role...


  • San Francisco, California, United States Spice AI Full time

    About UsSpice AI is a technology company creating innovative solutions to help developers build intelligent applications and agents that learn and adapt. Founded in 2021 by Microsoft and GitHub alumni Luke Kim and Phillip LeBlanc, we're backed by top industry leaders and venture capital firms.We're passionate about empowering developers with the tools and...


  • San Francisco, California, United States ZipRecruiter Full time

    Unlock Your Potential as a Senior AI Infrastructure Software ArchitectOverview:At ZipRecruiter, we're pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. We're redefining AI cloud infrastructure with a mission to align the future of computing with the...


  • San Francisco, California, United States Untether AI Full time

    At Untether AI, we're pushing the boundaries of artificial intelligence with our revolutionary new architecture that achieves unparalleled performance and efficiency in neural net inference. Our groundbreaking technology has already garnered significant interest from smart clients looking to be at the forefront of innovation.We're seeking a seasoned...


  • San Francisco, California, United States Liquid AI Full time

    Job DescriptionWe are looking for a talented Senior Optimization Engineer to join our team and help us develop highly optimized ML inference stacks for various hardware platforms. The successful candidate will have extensive experience in coding, with expertise in Python, PyTorch, CUDA, and C++. They should be able to work independently, taking ownership of...


  • San Francisco, California, United States Perplexity AI Full time

    We are seeking an experienced Data Inference Specialist to join our team at Perplexity AI.OverviewAt Perplexity AI, we've achieved tremendous growth and adoption since launching the world's first fully functional conversational answer engine. Our AI-powered search assistant has amassed 10 million monthly active users, with mobile apps installed over 1...


  • San Francisco, California, United States Liquid AI Full time

    Harness Machine Learning Potential: As a key member of our team, you'll play a vital role in shaping the future of machine learning at Liquid AI. With a competitive salary range of $150,000 - $170,000 per annum, depending on experience and qualifications, you'll have the opportunity to grow professionally and make a meaningful impact. Job Description: Our...


  • San Francisco, California, United States Together AI Full time

    Company Overview:At Together AI, we believe open and transparent AI systems will drive innovation and create the best outcomes for society. Our team has been behind technological advancements such as FlashAttention, Hyena, FlexGen, and RedPajama.Job Description:We are seeking an experienced MLOps engineer to develop systems and APIs that enable our customers...