Senior AI Infrastructure Engineer

2 weeks ago


San Jose, California, United States Adobe Full time
Senior AI Infrastructure Engineer

About Adobe

At Adobe, we are dedicated to transforming the world through innovative digital experiences. Our mission is to empower everyone—from budding creators to established brands—with the tools they need to craft and deliver outstanding digital content. We are passionate about enabling individuals to produce stunning visuals, videos, and applications, while reshaping how businesses engage with their audiences across various platforms.

We are committed to attracting top talent and fostering an exceptional workplace culture where respect and equal opportunity are paramount. We believe that groundbreaking ideas can emerge from any level of the organization, and we are excited to see what you can contribute.

Position Overview

Firefly represents our latest suite of generative AI models designed to enhance Adobe products, providing a revolutionary approach to content creation. This initiative builds upon four decades of technological advancements at Adobe.

Central to Firefly are our commercially viable AI models, meticulously trained on a vast collection of images owned or licensed by Adobe. We are seeking a strategic and influential professional to help advance these models, presenting a unique opportunity to impact millions of creative professionals and redefine their workflows.

Key Responsibilities

  1. Architect, develop, and sustain robust AI/ML infrastructure solutions that facilitate the training and deployment of large-scale AI models, utilizing Kubernetes and Python on AWS cloud.
  2. Enhance and optimize distributed training frameworks using GPUs to boost performance and scalability, ensuring improved resiliency, elasticity, and data handling while supporting GPU optimization techniques such as FP8, FSDP, and model parallelism.
  3. Produce high-quality, maintainable, and testable code adhering to industry best practices.
  4. Work collaboratively with ML Researchers and Machine Learning Engineers to expedite the training of state-of-the-art ML models.
  5. Stay abreast of the latest advancements in academia and the open-source community to facilitate the swift adoption of cutting-edge technologies that enhance ML platform performance.
  6. Contribute to the development of superior models by refining orchestration and scheduling, increasing job scalability, and accelerating experimentation through AutoML and similar methodologies.
  7. Partner with data scientists and ML researchers to optimize the model training pipeline and ensure efficient resource utilization.
  8. Lead innovations in infrastructure practices to support pioneering machine learning research and development.

Qualifications

  1. PhD or Master's degree in computer science or a related discipline, with over 5 years of relevant industry experience.
  2. Demonstrated expertise in Python and the development of systems, frameworks, and SDKs.
  3. Strong background in infrastructure, with a comprehensive understanding of model serving, training, orchestration, and GPU resource management.
  4. Experience with machine learning and distributed PyTorch.
  5. Exceptional critical thinking, analytical, and quantitative problem-solving skills.
  6. Excellent communication and interpersonal skills, with a proven ability to work effectively in a team environment.

Preferred Qualifications

  1. Familiarity with KubeFlow, MLFlow, Ray, SageMaker, or similar platforms.
  2. Experience with Nvidia HPC.
  3. Knowledge of PyTorch distributed, MPI, Megatron, Horovod, and other AI training frameworks.

Compensation and Benefits

Our compensation reflects the labor market across various U.S. regions, with pay differing based on defined markets. The U.S. salary range for this position is $170,900 to $325,200 annually, with variations based on work location and job-related knowledge, skills, and experience.

Adobe is proud to be an Equal Employment Opportunity and affirmative action employer, ensuring that we do not discriminate based on gender, race, ethnicity, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other protected characteristic. We are committed to creating an inclusive environment for all employees.

Adobe aims to make our resources accessible to all users. If you require accommodations to navigate our website or complete the application process, please reach out for assistance.



  • San Jose, California, United States Adobe Full time

    Senior AI Infrastructure Engineer About AdobeAt Adobe, we are dedicated to transforming the world through innovative digital experiences. Our mission is to empower individuals—from budding creators to established global brands—with the tools they need to craft and deliver extraordinary digital content. We are passionate about enabling creativity and...


  • San Jose, California, United States Adobe Full time

    Senior AI Infrastructure Engineer About AdobeAt Adobe, we are dedicated to transforming the world through innovative digital experiences. Our mission is to empower individuals—from budding creators to established brands—with the tools they need to craft exceptional digital content. We are passionate about enabling creativity and enhancing the way...


  • San Jose, California, United States Adobe Full time

    Senior AI Infrastructure Engineer About AdobeAt Adobe, we are dedicated to transforming the world through innovative digital experiences. Our mission is to empower individuals—from budding creators to established brands—with the tools they need to craft and deliver outstanding digital content. We are passionate about enabling creativity and enhancing the...


  • San Francisco, California, United States Unum AI Full time

    About Unum AIUnum AI is a deep-tech startup revolutionizing data infrastructure for extreme scale and AI applications.Job DescriptionWe are seeking a highly skilled Senior DevOps Engineer: Data Infrastructure Specialist to join our team in designing and implementing next-generation data infrastructure solutions.Key ResponsibilitiesOrchestrate software...


  • San Francisco, California, United States Unum AI Full time

    **About Unum AI**We are a deep-tech startup revolutionizing data infrastructure for extreme scale and artificial intelligence. Our mission is to empower the next million data-intensive and AI applications.**Job Summary**We are seeking passionate and competitive Senior C++ Research Engineers to join our team in designing next-generation data...


  • San Francisco, California, United States Unum AI Full time

    **About Unum AI**We are a deep-tech startup revolutionizing the field of data infrastructure to support extreme scale and artificial intelligence. Our mission is to empower a million data-intensive and AI applications with cutting-edge technology.**Job Summary**We are seeking a highly skilled Senior C++ Research Engineer to join our team in designing and...


  • San Francisco, California, United States Perplexity AI Full time

    Job DescriptionPerplexity AI is seeking a highly skilled Backend Software Engineer to join our team of innovators revolutionizing the way people interact with the internet. As a key member of our engineering team, you will be responsible for designing, implementing, and scaling systems that power our API products.Key Responsibilities:Design and Develop APIs...

  • Senior AI Engineer

    1 week ago


    San Jose, California, United States Energy Jobline Full time

    About the RoleWe are seeking an experienced Senior AI Engineer to join our Enterprise AI Team at Energy Jobline. As a key member of our team, you will be responsible for designing and building scalable, high-performance AI infrastructure and developing systems that enable our users to work with large-language models (LLMs) and foundation models (FMs).Key...


  • San Francisco, California, United States Unum AI Full time

    About Unum AIUnum AI is a deep-tech startup revolutionizing data infrastructure for extreme scale and AI applications.Job DescriptionWe are seeking a highly skilled Data Infrastructure Engineer to join our team and contribute to the design and development of next-generation data infrastructure.Key ResponsibilitiesOrchestrate Software Development and Hardware...


  • San Francisco, California, United States Zoom Corporation Full time

    OverviewAs an AI Infrastructure Engineer at Zoom Corporation, you will be responsible for the development and management of our advanced AI systems and frameworks. Your contributions will significantly enhance the training, deployment, and operational aspects of AI, ensuring improved functionality, scalability, and reliability. This role is essential in...


  • San Jose, California, United States Microsoft Corporation Full time

    At Microsoft Corporation, the Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) team is pivotal in driving the evolution of our expansive Cloud Infrastructure, which is integral to our "Intelligent Cloud" vision. SCHIE is responsible for delivering the essential infrastructure and foundational technologies that support over 200 online services,...

  • AI Solutions Engineer

    2 weeks ago


    San Francisco, California, United States Perplexity AI Full time

    Job OverviewPerplexity AI is at the forefront of developing an innovative answer engine, enabling users to discover information in more effective and engaging ways. We leverage large language models (LLMs) for knowledge retrieval at scale, catering to millions of users globally. To further our mission, we are seeking skilled engineers to design...


  • San Jose, California, United States Cisco Systems, Inc. Full time

    About the RoleWe are seeking an experienced Senior AI Technical Product Manager to join our team at Cisco Systems, Inc. This pivotal role contributes to our commitment to providing timely and exceptional AI Program and Solutions delivery.Key ResponsibilitiesCoordinate all aspects of product management for AI delivery, encompassing concept, design,...


  • San Jose, California, United States ThisWay Full time

    Job Opportunity at ThisWayThisWay is seeking a highly skilled Principal AI/ML Engineer to lead the development and evolution of AI platforms and products within the Security AI team.Key Responsibilities:Develop and implement AI-based solutions to drive innovation and maintain a competitive edge in the AI and machine learning space.Collaborate with...

  • Senior AI Engineer

    4 days ago


    San Francisco, California, United States Scout AI Full time

    About Scout AI Our Mission We are a pioneering technology company that specializes in artificial intelligence and machine learning solutions. Our goal is to bridge the gap between education and hiring by providing innovative tools and services that enhance the skill level of individuals worldwide. We are a dynamic and agile team led by experienced...


  • San Francisco, California, United States Perplexity AI Full time

    Position OverviewPerplexity AI is on the lookout for a seasoned Search Engineer to revolutionize the search framework that underpins our innovative products. If you are passionate about advancing technology and making a substantial difference, this opportunity is tailored for you.Key ResponsibilitiesDeveloping and architecting extensive infrastructure to...


  • San Francisco, California, United States Perplexity AI Full time

    Position OverviewPerplexity AI is on the lookout for a seasoned Search Engineer to redefine the architecture of our search framework that underpins our offerings. If you are passionate about innovation and making a substantial difference, this position may be the perfect fit for you.Key ResponsibilitiesDeveloping and constructing extensive infrastructure to...


  • San Francisco, California, United States Perplexity AI Full time

    Job Opportunity at Perplexity AIWe are seeking a highly skilled Senior Backend Software Engineer to join our team at Perplexity AI. As a key member of our engineering team, you will be responsible for designing, implementing, and scaling our backend systems that power our web and mobile products.Key ResponsibilitiesDatabase Management: Maintain and optimize...


  • San Francisco, California, United States Relyance AI Full time

    About the RoleRelyance AI is seeking a highly skilled Lead DevSecOps Engineer - Cloud Infrastructure Specialist to join our team. As a key member of our engineering team, you will be responsible for designing and implementing secure and scalable cloud infrastructure components that meet the needs of our customers.Key ResponsibilitiesCloud Infrastructure...


  • San Francisco, California, United States Scale AI, Inc. Full time

    About the RoleWe are seeking a highly skilled Cloud Infrastructure Engineer to join our Platform Engineering team at Scale AI, Inc. As a key member of our team, you will be responsible for designing and developing core cloud infrastructure platforms and systems, while supporting orchestration, data abstraction, data pipelines, identity & access management,...