Systems Research Engineer, Machine Learning Systems

2 days ago


San Francisco CA, United States Together AI Full time

RoleAs a Systems Research Engineer specialized in Machine Learning Systems, you will play a crucial role in researching and building the next generation AI platform at Together. Working closely with the modeling, algorithm, and engineering teams, you will design large-scale distributed training systems and a low-latency/high-throughput inference engine that serves a diverse, rapidly growing user base. Your research skills will be vital in staying up-to-date with the latest advancements in machine learning systems, ensuring that our AI infrastructure remains at the forefront of innovation.RequirementsStrong background in machine learning systems, such as distributed learning and efficient inference for large language models and diffusion modelsKnowledge of ML/AI applications and models, especially foundation models such as large language models and diffusion models, how they are constructed and how they are usedKnowledge of system performance profiling and optimization tools for ML systemsExcellent problem-solving and analytical skillsBachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or equivalent practical experienceResponsibilitiesOptimize and fine-tune existing training and inference platform to achieve better performance and scalabilityCollaborate with cross-functional teams to integrate cutting edge research ideas into existing software systemsDevelop your own ideas of optimizing the training and inference platforms and push the frontier of machine learning systems researchStay up-to-date with the latest advancements in machine learning systems techniques and apply many of them to the Together platformAbout Together AITogether AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.CompensationWe offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.Equal OpportunityTogether AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.Please see our privacy policy at 



  • San Francisco, United States Tbwa ChiatDay Inc Full time

    Systems Research Engineer, Machine Learning SystemsRoleAs a Systems Research Engineer specialized in Machine Learning Systems, you will play a crucial role in researching and building the next generation AI platform at Together. Working closely with the modeling, algorithm, and engineering teams, you will design large-scale distributed training systems and a...


  • San Francisco, United States Tbwa ChiatDay Inc Full time

    Systems Research Engineer, Machine Learning SystemsRoleAs a Systems Research Engineer specialized in Machine Learning Systems, you will play a crucial role in researching and building the next generation AI platform at Together. Working closely with the modeling, algorithm, and engineering teams, you will design large-scale distributed training systems and a...


  • San Francisco, United States Together AI Full time

    RoleAs a Systems Research Engineer specialized in Machine Learning Systems, you will play a crucial role in researching and building the next generation AI platform at Together. Working closely with the modeling, algorithm, and engineering teams, you will design large-scale distributed training systems and a low-latency/high-throughput inference engine that...


  • San Francisco, California, United States Tbwa ChiatDay Inc Full time

    At Together AI, we are seeking a highly skilled Machine Learning Systems Researcher to join our team. As a researcher in this role, you will play a critical part in designing and building the next generation AI platform.The ideal candidate will have a strong background in machine learning systems, including distributed learning and efficient inference for...


  • San Francisco, California, United States Tbwa ChiatDay Inc Full time

    Design and Build Next-Generation AI InfrastructureAt Together AI, we're on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We invite you to join our passionate team of researchers in building the next generation AI infrastructure.About the RoleAs a Systems Research Engineer...


  • San Francisco, United States Abridge Full time

    Job DescriptionJob DescriptionAbridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients.Our enterprise-grade technology transforms...


  • Cupertino, CA, United States Apple Full time

    SummaryPosted: Role Number:200579163Our team is building next generation systems and tools supporting the research and development of machine learning models, and their integration into the products Apple customers love. We're a fast moving, highly skilled but small team designing and building a collection of tools and systems used by Apple’s MLEs and data...


  • San Francisco, California, United States Together AI Full time

    About the RoleTogether AI is seeking a Distributed ML Systems Engineer to design and build large-scale machine learning systems that power our accelerated AI initiatives. This involves developing fault-tolerant distributed systems that handle high-load and high-performance requirements.Key Responsibilities Include:Designing and building scalable machine...


  • San Francisco, United States Sentry Full time

    About the roleAs a Senior Machine Learning Systems Engineer on Sentry’s AI/ML team, you’ll be directly responsible for building the core infrastructure used to develop, evaluate, deploy, iterate on models and pipelines at scale. This role is crucial; you will be at the forefront of integrating machine learning into our core products, from error...


  • San Francisco, United States Your Personal AI Full time

    We are looking for a highly skilled and innovative Machine Learning Engineer to join our dynamic team at Your Personal AI. In this role, you will be responsible for designing, developing, and deploying state-of-the-art machine learning models that drive the core of our AI-driven solutions. You will collaborate closely with cross-functional teams to identify...


  • San Jose, United States Adobe Systems Full time

    Our CompanyChanging the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies...


  • San Jose, United States Cadence Design Systems Full time

    At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology. Cadence Design Systems is a world leader in providing computational software for all aspects of intelligent system design. You will be part of a cross-disciplinary R&D team working on the emerging boundary of scientific computing and machine...


  • San Francisco, CA, United States CyberCoders Full time

    San Francisco, CAHybrid Full-time $160,000.00 - $230,000.00Posted 05/29/2024If you are a Machine Learning Engineer with experience fine-tuning LLM's, please read on!We are an artificial intelligence firm focused on research. To push the boundaries of AI, we provide cutting-edge open-source research, models, and datasets. Our decentralized cloud services...


  • San Francisco, United States Evolve Group Full time

    Machine Learning EngineerA well-funded company is looking for a Machine Learning Engineer to join their team.What You’ll Do:Develop and improve AI models and systems (design, train, test, and deploy them).Lead and mentor a team of engineers.Work with product teams to add AI features to apps.Collaborate with researchers on new AI innovations.Build secure...


  • San Francisco, United States Evolve Group Full time

    Machine Learning EngineerA well-funded company is looking for a Machine Learning Engineer to join their team.What You’ll Do:Develop and improve AI models and systems (design, train, test, and deploy them).Lead and mentor a team of engineers.Work with product teams to add AI features to apps.Collaborate with researchers on new AI innovations.Build secure...


  • San Diego, California, United States Tata Consultancy Services Full time

    **About the Role**We are looking for a skilled Machine Learning Systems Engineer to join our team. As a key member of our AI group, you will be responsible for designing, developing, and deploying large-scale machine learning models that drive business value.**Key Responsibilities**Develop high-performance, low-latency code using Rust/C++ to support machine...


  • San Francisco, United States Abridge Full time

    Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients.Our enterprise-grade technology transforms patient-clinician conversations into...


  • San Jose, CA, United States CISCO Systems Full time

    Application window is expected to close on January 24, 2025.Note: This job posting may close earlier if the position is filled or if a sufficient number of applications are received.Who We AreThe Cisco Security AI team delivers AI products and platform for all Cisco Secure products and portfolios so businesses around the world can defend against threats and...


  • Sunnyvale, CA, United States Baidu Full time

    Do you want to be part of the AI revolution? Do you want to think out of the box, thriving on challenges in AI industry and the desire to solve them? Do you want to work with a world-class team to explore the fast-growing AI hardware opportunities and impact on AI industry?We’re looking forward to you joining us to collaborate, contribute, and...


  • San Francisco, California, United States Naptha AI Full time

    About the RoleWe are seeking an exceptional AI researcher to join our team at Naptha AI, focusing on advancing the state of the art in multi-agent systems and agent interoperability. As a key member of our research team, you will be responsible for researching and developing novel approaches to agent collaboration, coordination, and scalability.Key...