GPU Programming Systems Research Engineer

1 week ago


San Francisco, California, United States Together AI Full time
About the Role

We are seeking a highly skilled Systems Research Engineer to join our team at Together AI. As a key member of our research team, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications.

Key Responsibilities
  • Optimize and fine-tune GPU code to achieve better performance and scalability
  • Collaborate with cross-functional teams to integrate GPU-accelerated solutions into existing software systems
  • Stay up-to-date with the latest advancements in GPU programming techniques and technologies
Requirements
  • Strong background in GPU programming and parallel computing, such as CUDA and/or Triton
  • Knowledge of ML/AI applications and models
  • Knowledge of performance profiling and optimization tools for GPU programming
  • Excellent problem-solving and analytical skills
  • Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or equivalent practical experiences
About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models.

We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancements such as FlashAttention, Hyena, FlexGen, and RedPajama.

We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

What We Offer

We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work.

The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level, and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

We are an Equal Opportunity Employer and are proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.



  • San Francisco, California, United States Together AI Full time

    About the RoleWe are seeking a highly skilled Systems Research Engineer with expertise in GPU programming to join our team at Together AI. As a key member of our research team, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications.Key ResponsibilitiesDesign and develop high-performance...


  • San Francisco, California, United States Together AI Full time

    About the RoleWe are seeking a highly skilled Systems Research Engineer to join our team at Together AI. As a key member of our research team, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications.Key ResponsibilitiesOptimize and fine-tune GPU code to achieve better performance and...


  • San Francisco, California, United States Together Full time

    About the RoleWe are seeking a highly skilled Systems Research Engineer with expertise in GPU programming to join our team at Together AI. As a key member of our research team, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications.Key ResponsibilitiesOptimize and fine-tune GPU code to achieve...


  • San Francisco, California, United States Together AI Full time

    About the RoleWe are seeking a highly skilled Systems Research Engineer with expertise in GPU programming to join our team at Together AI. As a key member of our research team, you will play a critical role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications.Key ResponsibilitiesOptimize and fine-tune GPU code to achieve...

  • Senior GPU Engineer

    1 week ago


    San Francisco, California, United States Succinct Full time

    About SuccinctSuccinct is a pioneering company in the field of zero-knowledge proofs, dedicated to making this complex technology accessible to developers. Our mission is to empower developers to build scalable, interoperable, and private blockchain solutions.The RoleWe are seeking a Senior GPU Engineer to join our team and contribute to the development of...


  • San Francisco, California, United States mistral Full time

    Position OverviewMistral AI is seeking a skilled professional to enhance the performance of large language models through efficient GPU utilization. This role focuses on optimizing training and serving processes using advanced GPU technology.Key ResponsibilitiesDeveloping low-level programming solutions to maximize the performance of cutting-edge GPUs...

  • Senior GPU Engineer

    3 weeks ago


    San Francisco, California, United States Succinct Full time

    About the RoleWe are seeking a highly skilled Senior GPU Engineer to join our team at Succinct. As a key member of our engineering team, you will play a critical role in designing, developing, and optimizing software solutions to enable GPU acceleration of our zkVM, SP1, and to contribute to the development of our hardware-accelerated prover network.Key...


  • San Jose, California, United States Cadence Design Systems Full time

    Job SummaryCadence Design Systems is seeking a highly skilled Senior Systems Engineer - InfiniBand GPU to join our team. As a key member of our team, you will be responsible for designing, implementing, and maintaining high-performance computing systems using InfiniBand technology.Key ResponsibilitiesDesign and implement high-performance computing systems...


  • San Francisco, California, United States mistral Full time

    Mistral AI is seeking a specialist in the domain of optimizing and training expansive language models with high efficiency on GPU technology.Key Responsibilities:-Developing low-level programming to fully leverage the capabilities of advanced GPUs (H100) and maximize their performance.-Reevaluating various components of the generative model architecture to...

  • Senior GPU Engineer

    4 weeks ago


    San Francisco, California, United States Succinct Full time

    About the RoleWe are seeking a highly skilled Senior GPU Engineer to join our team at Succinct, a leading innovator in zero-knowledge proofs and zkVM technology. As a key member of our engineering team, you will play a critical role in designing, developing, and optimizing software solutions to accelerate our zkVM, SP1, and contribute to the development of...


  • San Jose, California, United States Cadence Design Systems Full time

    Job SummaryCadence Design Systems is seeking a highly skilled Senior Systems Engineer to join our team. As a key member of our data center operations team, you will be responsible for ensuring the smooth operation of our InfiniBand GPU infrastructure.Key ResponsibilitiesAssist with all projects and repairs throughout the data centerParticipate in an on-call...

  • Research Scientist

    2 weeks ago


    San Francisco, California, United States RI Research Instruments GmbH Full time

    About the RoleWe are seeking a highly skilled Research Engineer to join our team at RI Research Instruments GmbH. As a Research Engineer, you will play a key role in designing and developing large-scale machine learning systems from the ground up.Key ResponsibilitiesDesign and implement large-scale machine learning systems, ensuring they are safe, steerable,...

  • GPU Modeling Engineer

    2 weeks ago


    San Jose, California, United States SAMSUNG Full time

    Job SummarySamsung is seeking a skilled GPU Modeling Engineer to join our team at the Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL). As a GPU Modeling Engineer, you will be responsible for designing and architecting C++ models of GPU blocks, implementing and testing these models, and identifying optimizations to...

  • GPU Modeling Engineer

    20 hours ago


    San Jose, California, United States SAMSUNG Full time

    Job SummarySamsung is seeking a highly skilled GPU Modeling Engineer to join our team at the Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL). As a GPU Modeling Engineer, you will be responsible for designing and architecting C++ models of GPU blocks, implementing and testing these models, and identifying architecture,...


  • San Jose, California, United States Cadence Design Systems Full time

    About the RoleCadence Design Systems is seeking a highly skilled Senior Systems Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining our data center infrastructure, with a focus on InfiniBand GPU solutions.Key ResponsibilitiesAssist with all projects and repairs throughout...


  • San Francisco, California, United States Succinct Full time

    Overview of Our VisionZero-knowledge proofs (ZKPs) represent a pivotal technology for enhancing blockchain scalability, interoperability, and privacy. However, their complexity often poses a barrier for many developers. At Succinct, our goal is to simplify the implementation of zero-knowledge proofs, making them accessible to a broader audience of...

  • GPU Modeling Engineer

    3 weeks ago


    San Jose, California, United States SAMSUNG Full time

    Job SummarySamsung, a world leader in advanced semiconductor technology, is seeking a highly skilled GPU Modeling Engineer to join our team at the Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL). As a GPU Modeling Engineer, you will play a critical role in the development of our next-generation mobile GPU, Xclipse,...

  • GPU Modeling Engineer

    4 weeks ago


    San Jose, California, United States SAMSUNG Full time

    Job SummarySamsung, a world leader in advanced semiconductor technology, is seeking a highly skilled GPU Modeling Engineer to join our team at the Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL). As a GPU Modeling Engineer, you will play a critical role in the development of our cutting-edge GPU technology.Key...


  • San Francisco, California, United States mistral Full time

    About the Role:Mistral AI is seeking a highly skilled expert in GPU programming to join our team. As a key member of our organization, you will be responsible for developing and optimizing large language models for high-speed training and serving on GPUs.Key Responsibilities:GPU Optimization: Design and implement low-level code to maximize the performance of...


  • San Francisco, California, United States OpenAI Full time

    About the TeamThe Applied Engineering team at OpenAI is a collaborative group that works across research, engineering, product, and design to bring cutting-edge AI technology to consumers and businesses. Our team is responsible for running the infrastructure that supports the models backing ChatGPT and the API, including inference kubernetes clusters, GPU...