Senior AI Infrastructure Engineer

2 weeks ago


San Jose, California, United States Adobe Full time
Senior AI Infrastructure Engineer

About Adobe

At Adobe, we are dedicated to transforming the world through innovative digital experiences. Our mission is to empower individuals—from budding creators to established brands—with the tools they need to craft exceptional digital content. We are passionate about enabling creativity and enhancing the way businesses engage with their audiences across various platforms.

We strive to recruit top talent and foster an inclusive workplace where every employee is valued and has equal opportunities. We believe that great ideas can emerge from any level of the organization, and we are excited to see what you can contribute.

Position Overview

We are seeking a strategic and impactful role within our Firefly initiative, which represents a new suite of generative AI models designed to enhance Adobe products. This initiative is a natural progression of Adobe's technological advancements over the past four decades.

Firefly's foundation lies in our commercially viable AI models, meticulously trained on vast collections of images owned or licensed by Adobe. This role offers a unique opportunity to influence millions of creative professionals, assisting them in revolutionizing their workflows.

Key Responsibilities

  1. Architect, develop, and sustain robust AI/ML infrastructure solutions to facilitate the training and deployment of extensive AI models, utilizing Kubernetes and Python within the AWS cloud environment.
  2. Enhance and optimize distributed training frameworks that leverage GPU capabilities to boost performance and scalability. Focus on improving resilience, elasticity, data loading, and provide comprehensive support for GPU optimization techniques such as FP8, FSDP, and model parallelism.
  3. Produce high-quality, maintainable, and testable code that adheres to industry standards.
  4. Work collaboratively with ML Researchers and Machine Learning Engineers to expedite the training of state-of-the-art ML models.
  5. Stay informed about the latest advancements in academia and the open-source community to swiftly integrate innovative technologies that enhance ML platform performance.
  6. Contribute to model improvement by refining orchestration and scheduling, scaling job numbers, and facilitating rapid experimentation through AutoML and similar methodologies.
  7. Partner with data scientists and ML researchers to optimize the model training pipeline and ensure efficient resource allocation.
  8. Lead initiatives to innovate infrastructure practices that support groundbreaking machine learning research and development.

Qualifications

  1. PhD or Master's degree in computer science or a related discipline, accompanied by 5+ years of relevant industry experience.
  2. Demonstrated expertise in Python and the development of systems, frameworks, and SDKs.
  3. Solid understanding of infrastructure, model serving, training, orchestration, and GPU resource management.
  4. Experience with machine learning and distributed PyTorch.
  5. Strong analytical, critical thinking, and quantitative problem-solving skills.
  6. Exceptional communication and interpersonal skills, with a collaborative mindset.

Preferred Qualifications

  1. Familiarity with KubeFlow, MLFlow, Ray, SageMaker, or similar tools.
  2. Experience with Nvidia HPC.
  3. Knowledge of PyTorch distributed, MPI, Megatron, Horovod, and other AI training frameworks.

Compensation

Our compensation structure reflects the labor market across various U.S. regions, with salary variations based on location and job-related qualifications. The salary range for this position is competitive and will be discussed during the hiring process.

Adobe is committed to equal employment opportunities and affirmative action. We do not discriminate based on gender, race, ethnicity, age, disability, religion, sexual orientation, gender identity, veteran status, or any other protected characteristic. We welcome applications from qualified individuals with arrest or conviction records in accordance with applicable laws.

Adobe aims to ensure accessibility for all users. If you require accommodations to navigate our website or complete the application process, please reach out for assistance.



  • San Jose, California, United States Adobe Full time

    Senior AI Infrastructure Engineer About AdobeAt Adobe, we are dedicated to transforming the world through innovative digital experiences. Our mission is to empower individuals—from budding creators to established global brands—with the tools they need to craft and deliver extraordinary digital content. We are passionate about enabling creativity and...


  • San Jose, California, United States Adobe Full time

    Senior AI Infrastructure Engineer About AdobeAt Adobe, we are dedicated to transforming the world through innovative digital experiences. Our mission is to empower individuals—from budding creators to established brands—with the tools they need to craft and deliver outstanding digital content. We are passionate about enabling creativity and enhancing the...


  • San Jose, California, United States Adobe Full time

    Senior AI Infrastructure Engineer About AdobeAt Adobe, we are dedicated to transforming the world through innovative digital experiences. Our mission is to empower everyone—from budding creators to established brands—with the tools they need to craft and deliver outstanding digital content. We are passionate about enabling individuals to produce stunning...


  • San Francisco, California, United States Unum AI Full time

    About Unum AIUnum AI is a deep-tech startup revolutionizing data infrastructure for extreme scale and AI applications.Job DescriptionWe are seeking a highly skilled Senior DevOps Engineer: Data Infrastructure Specialist to join our team in designing and implementing next-generation data infrastructure solutions.Key ResponsibilitiesOrchestrate software...


  • San Francisco, California, United States Unum AI Full time

    **About Unum AI**We are a deep-tech startup revolutionizing data infrastructure for extreme scale and artificial intelligence. Our mission is to empower the next million data-intensive and AI applications.**Job Summary**We are seeking passionate and competitive Senior C++ Research Engineers to join our team in designing next-generation data...


  • San Francisco, California, United States Unum AI Full time

    **About Unum AI**We are a deep-tech startup revolutionizing the field of data infrastructure to support extreme scale and artificial intelligence. Our mission is to empower a million data-intensive and AI applications with cutting-edge technology.**Job Summary**We are seeking a highly skilled Senior C++ Research Engineer to join our team in designing and...


  • San Francisco, California, United States Perplexity AI Full time

    Job DescriptionPerplexity AI is seeking a highly skilled Backend Software Engineer to join our team of innovators revolutionizing the way people interact with the internet. As a key member of our engineering team, you will be responsible for designing, implementing, and scaling systems that power our API products.Key Responsibilities:Design and Develop APIs...

  • Senior AI Engineer

    1 week ago


    San Jose, California, United States Energy Jobline Full time

    About the RoleWe are seeking an experienced Senior AI Engineer to join our Enterprise AI Team at Energy Jobline. As a key member of our team, you will be responsible for designing and building scalable, high-performance AI infrastructure and developing systems that enable our users to work with large-language models (LLMs) and foundation models (FMs).Key...


  • San Francisco, California, United States Unum AI Full time

    About Unum AIUnum AI is a deep-tech startup revolutionizing data infrastructure for extreme scale and AI applications.Job DescriptionWe are seeking a highly skilled Data Infrastructure Engineer to join our team and contribute to the design and development of next-generation data infrastructure.Key ResponsibilitiesOrchestrate Software Development and Hardware...


  • San Francisco, California, United States Zoom Corporation Full time

    OverviewAs an AI Infrastructure Engineer at Zoom Corporation, you will be responsible for the development and management of our advanced AI systems and frameworks. Your contributions will significantly enhance the training, deployment, and operational aspects of AI, ensuring improved functionality, scalability, and reliability. This role is essential in...


  • San Jose, California, United States Microsoft Corporation Full time

    At Microsoft Corporation, the Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) team is pivotal in driving the evolution of our expansive Cloud Infrastructure, which is integral to our "Intelligent Cloud" vision. SCHIE is responsible for delivering the essential infrastructure and foundational technologies that support over 200 online services,...

  • AI Solutions Engineer

    2 weeks ago


    San Francisco, California, United States Perplexity AI Full time

    Job OverviewPerplexity AI is at the forefront of developing an innovative answer engine, enabling users to discover information in more effective and engaging ways. We leverage large language models (LLMs) for knowledge retrieval at scale, catering to millions of users globally. To further our mission, we are seeking skilled engineers to design...


  • San Jose, California, United States Cisco Systems, Inc. Full time

    About the RoleWe are seeking an experienced Senior AI Technical Product Manager to join our team at Cisco Systems, Inc. This pivotal role contributes to our commitment to providing timely and exceptional AI Program and Solutions delivery.Key ResponsibilitiesCoordinate all aspects of product management for AI delivery, encompassing concept, design,...


  • San Jose, California, United States ThisWay Full time

    Job Opportunity at ThisWayThisWay is seeking a highly skilled Principal AI/ML Engineer to lead the development and evolution of AI platforms and products within the Security AI team.Key Responsibilities:Develop and implement AI-based solutions to drive innovation and maintain a competitive edge in the AI and machine learning space.Collaborate with...

  • Senior AI Engineer

    4 days ago


    San Francisco, California, United States Scout AI Full time

    About Scout AI Our Mission We are a pioneering technology company that specializes in artificial intelligence and machine learning solutions. Our goal is to bridge the gap between education and hiring by providing innovative tools and services that enhance the skill level of individuals worldwide. We are a dynamic and agile team led by experienced...


  • San Francisco, California, United States Perplexity AI Full time

    Position OverviewPerplexity AI is on the lookout for a seasoned Search Engineer to revolutionize the search framework that underpins our innovative products. If you are passionate about advancing technology and making a substantial difference, this opportunity is tailored for you.Key ResponsibilitiesDeveloping and architecting extensive infrastructure to...


  • San Francisco, California, United States Perplexity AI Full time

    Position OverviewPerplexity AI is on the lookout for a seasoned Search Engineer to redefine the architecture of our search framework that underpins our offerings. If you are passionate about innovation and making a substantial difference, this position may be the perfect fit for you.Key ResponsibilitiesDeveloping and constructing extensive infrastructure to...


  • San Francisco, California, United States Perplexity AI Full time

    Job Opportunity at Perplexity AIWe are seeking a highly skilled Senior Backend Software Engineer to join our team at Perplexity AI. As a key member of our engineering team, you will be responsible for designing, implementing, and scaling our backend systems that power our web and mobile products.Key ResponsibilitiesDatabase Management: Maintain and optimize...


  • San Francisco, California, United States Relyance AI Full time

    About the RoleRelyance AI is seeking a highly skilled Lead DevSecOps Engineer - Cloud Infrastructure Specialist to join our team. As a key member of our engineering team, you will be responsible for designing and implementing secure and scalable cloud infrastructure components that meet the needs of our customers.Key ResponsibilitiesCloud Infrastructure...


  • San Francisco, California, United States Scale AI, Inc. Full time

    About the RoleWe are seeking a highly skilled Cloud Infrastructure Engineer to join our Platform Engineering team at Scale AI, Inc. As a key member of our team, you will be responsible for designing and developing core cloud infrastructure platforms and systems, while supporting orchestration, data abstraction, data pipelines, identity & access management,...