Systems Research Engineer, GPU Programming

1 week ago


San Francisco, California, United States Together AI Full time
Role

At our company, as a Systems Research Engineer focusing on GPU Programming, your role will be essential in the development and optimization of GPU-accelerated kernels and algorithms for ML/AI applications. You will collaborate closely with the modeling and algorithm team to co-design GPU kernels and model architecture, aiming to improve the performance and efficiency of our AI systems. Collaborating with hardware and software teams, your contribution will be key in the co-design of efficient GPU architectures and programming models, utilizing your expertise in GPU programming and parallel computing. Keeping up-to-date with the latest advancements in GPU programming techniques is crucial to ensure that our AI infrastructure remains innovative.

Requirements
  • Expertise in GPU programming and parallel computing, including CUDA and/or Triton
  • Understanding of ML/AI applications and models
  • Familiarity with performance profiling and optimization tools for GPU programming
  • Strong problem-solving and analytical skills
  • Degree in Computer Science, Electrical Engineering, or equivalent practical experience
Responsibilities
  • Optimizing and refining GPU code for improved performance and scalability
  • Working with cross-functional teams to integrate GPU-accelerated solutions into existing software systems
  • Staying informed about the latest GPU programming techniques and technologies
About the Company

Our company, Together AI, is dedicated to research-oriented artificial intelligence. We believe in transparent AI systems driving innovation for the benefit of society. Our mission is to reduce the cost of modern AI systems significantly through collaborative efforts in software, hardware, algorithms, and models. Having contributed to prominent open-source research, models, and datasets, we are pioneers in technological advancements with projects like FlashAttention, Hyena, FlexGen, and RedPajama. Join us and be part of our passionate team of researchers shaping the future of AI infrastructure.

Compensation

We provide competitive compensation, startup equity, health insurance, and other benefits, along with remote work flexibility. The salary range for this full-time position in the US is between $160,000 and $230,000, in addition to equity and benefits. Salary levels are determined based on location, experience, skills, and job role.

Equal Opportunity

Together AI upholds Equal Employment Opportunity principles, offering fair employment opportunities regardless of race, color, ancestry, religion, gender, nationality, sexual orientation, age, marital status, disability, gender identity, veteran status, and more.

Please refer to our privacy policy.

  • San Jose, California, United States Samsung Electronics Perú Full time

    Please visit Samsung membership to see Privacy Policy, which defaults according to your location. You can change Country/Language at the bottom of the page.If you are a resident of the European Union or the European Economic Area, please click here . If you are a resident of the U.S., please click here . If you are a resident of the Philippines, please click...

  • Research Engineer

    1 week ago


    San Francisco, California, United States Understanding Recruitment Full time

    Research EngineerAre you an innovative thinker passionate about advancing AI technology? We're looking for a Research Engineer to join our dynamic team in pioneering the future of artificial intelligence. Our company stands at the forefront of AI development, working on cutting-edge projects that reimagine how technology interacts and assists in everyday...


  • San Jose, California, United States Samsung Electronics Perú Full time

    Please visit Samsung membership to see Privacy Policy, which defaults according to your location. You can change Country/Language at the bottom of the page.If you are a resident of the European Union or the European Economic Area, please click here . If you are a resident of the U.S., please click here . If you are a resident of the Philippines, please click...

  • Systems Engineer

    1 month ago


    San Francisco, California, United States Imbue (formerly Generally Intelligent) Full time

    SummaryAs a systems engineer, you'll work on pioneering machine learning infrastructure that enables running large numbers of experiments in parallel across local and cloud GPUs, extremely fast training, and guarantees that we can trust experiment results. This allows us to do actual science to understand, from first principles, how to build human-like...

  • Systems Engineer

    4 weeks ago


    San Francisco, California, United States Imbue (formerly Generally Intelligent) Full time

    SummaryAs a systems engineer, you'll work on pioneering machine learning infrastructure that enables running large numbers of experiments in parallel across local and cloud GPUs, extremely fast training, and guarantees that we can trust experiment results. This allows us to do actual science to understand, from first principles, how to build human-like...


  • San Francisco, California, United States Imbue Full time

    Summary: As a systems engineer, you'll work on pioneering machine learning infrastructure that enables running large numbers of experiments in parallel across local and cloud GPUs, extremely fast training, and guarantees that Imbue can trust experiment results. This allows them to do actual science to understand, from first principles, how to build...


  • San Jose, California, United States Samsung Electronics America Full time

    Position SummarySamsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy - the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is applied...


  • San Diego, California, United States CEREBRAS SYSTEMS INC. Full time

    Cerebras' systems are designed with a singular focus on machine learning. Our processor is the Wafer Scale Engine (WSE), a single chip with performance equivalent to a cluster of GPUs, giving the user cluster-scale capability with the simplicity of programming a single device. Because of this programming simplicity, large model training can be scaled out...


  • San Francisco, California, United States Wispr AI Full time

    Wispr is building a more natural way to interact with technology with neural interfaces. We have an elite team of engineers, product designers, and research scientists building magic.About Wispr: We've raised $25M from top-tier VCs like NEA and 8VC. Our angels and advisors include Chester Chipperfield (product lead for the first Apple Watch), Ben Jones (COO,...


  • San Francisco, California, United States Wispr AI Full time

    Wispr is building a more natural way to interact with technology with neural interfaces. We have an elite team of engineers, product designers, and research scientists building magic.About Wispr: We've raised $25M from top-tier VCs like NEA and 8VC. Our angels and advisors include Chester Chipperfield (product lead for the first Apple Watch), Ben Jones (COO,...


  • San Francisco, California, United States META Full time

    Summary:In this role, you will be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of the bigger industry-leading PyTorch AI framework organization. MTIA Software Team has been developing a comprehensive AI Compiler strategy that delivers a highly flexible platform to train & serve new DL/ML model architectures, combined...


  • San Francisco, California, United States Genai Works Full time

    About the TeamThe Applied AI team safely brings OpenAI's technology to the world. We released ChatGPT, Plugins, DALL·E, and the APIs for GPT-4, GPT-3, embeddings, and fine-tuning. We also operate inference infrastructure at scale. There's a lot more on the immediate horizon.We seek to learn from deployment and distribute the benefits of AI, while ensuring...


  • San Francisco, California, United States Shadeform Full time

    Shadeform is a cloud GPU marketplace that provides a unified platform and API to access cloud GPUs across 15+ cloud providers. We're hiring a founding engineer to further accelerate our growth and move faster towards our goal of making GPU infrastructure widely accessible.We closed a seed round in September '23 and are well capitalized. We run very lean and...


  • San Francisco, California, United States Unreal Gigs Full time

    Company Overview: Welcome to the forefront of artificial intelligence innovation Our company is dedicated to pushing the boundaries of AI technology to solve complex problems and drive transformative change across industries. We're committed to developing cutting-edge AI algorithms that push the limits of what's possible. Join us and be part of a dynamic...


  • San Francisco, California, United States Unreal Gigs Full time

    Company Overview:Welcome to the forefront of artificial intelligence innovation Our company is dedicated to pushing the boundaries of AI technology to solve complex problems and drive transformative change across industries. We're committed to developing cutting-edge AI algorithms that push the limits of what's possible. Join us and lead our team in shaping...

  • Research Engineer

    1 month ago


    San Francisco, California, United States Autodesk Full time

    Job Requisition ID # 23WD74103 Position OverviewWe are looking for a Research Engineer, Mechatronics & Software Research to develop and implement research prototypes for the Autodesk Research organization.Our dedicated team, Autodesk Research, seeks to explore the "new possible" what's possible in the future which means we're looking as far as 10 years out,...

  • Research Engineer

    4 weeks ago


    San Francisco, California, United States Autodesk Full time

    Job Requisition ID # 23WD74103 Position OverviewWe are looking for a Research Engineer, Mechatronics & Software Research to develop and implement research prototypes for the Autodesk Research organization.Our dedicated team, Autodesk Research, seeks to explore the "new possible" what's possible in the future which means we're looking as far as 10 years out,...

  • Senior Engineer

    1 week ago


    San Diego, California, United States LanceSoft Full time

    Job Title: Graphics Software EngineerWe are currently seeking a Graphics Software Engineer to join our team. If you have a passion for GPU technology, this role might be perfect for you!Top 5 Required Skills:Experience with Open GLESExperience with VulkanExperience with DirectXExperience with GPGPUExperience with C/C++Required Education:Bachelor's degree in...


  • San Francisco, California, United States Unreal Gigs Full time

    Company Overview:Welcome to the forefront of artificial intelligence innovation Our company is dedicated to pushing the boundaries of AI technology to solve complex problems and drive transformative change across industries. We're committed to developing cutting-edge AI algorithms that push the limits of what's possible. Join us and be part of a dynamic team...


  • San Francisco, California, United States Spellbrush Full time

    The Role:Spellbrush, the world's leading generative AI studio behind niji・journey, is looking for an AI Infrastructure Engineer to join us in building out end-to-end ML infrastructure to run our models on all platforms.What you'll do:Design, implement and run our next-generation inference architecture for running all our models powering all platforms and...