Engineering Manager, GradientAI Infrastructure

1 week ago


Seattle, Washington, United States DigitalOcean Full time

Dive in and do the best work of your career at DigitalOcean. Journey alongside a strong community of top talent who are relentless in their drive to build the simplest scalable cloud. If you have a growth mindset, naturally like to think big and bold, and are energized by the fast-paced environment of a true industry disruptor, you'll find your place here. We value winning together—while learning, having fun, and making a profound difference for the dreamers and builders in the world.

We want people who are passionate about building features that you and your peers will love. DigitalOcean's GradientAI Infrastructure Team is welcoming a new technical engineering manager to support our engineers, grow our culture, and lead a team developing our AI/ML infrastructure products. Upon selection, you will be responsible for guiding the development of a 6-8 person engineering team, facilitating communications, providing clarity of vision and priority, and empowering the team to create innovative solutions for our partners and customers. This team will be building a new product that will bring our famed DigitalOcean Simplicity to the world of Large Language Model (LLM) hosting, serving, and optimization. If you are someone who shares our passions for technology solutions, healthy services, and being loving service providers, team members, and leaders, we want to meet you

What You'll Be Doing:
  • Growing and leading a highly-collaborative engineering team
  • Developing and shepherding complex AI and cloud engineering projects through the entire product development lifecycle (PDLC) - ideation, product definition, experimentation, prototyping, development, testing, release, and operations
  • Helping the team achieve higher standards of performance and product quality
  • Introducing and improving processes for team performance and quality-of-life
  • Collaborating with product owners and cross-functional teams to design idiomatic, feature-rich, and operationally sustainable software solutions
  • Oversee the design and implementation of scalable, automated systems for DNS provisioning, monitoring, and failover.
  • Facilitating transparent, constructive communication and a fair, but growth-oriented, distribution of responsibilities between team members
  • Providing coaching and counseling via mentoring, one-on-one meetings, etc
What You'll Add to DigitalOcean:
  • 7+ years of experience in software engineering, which should include 4+ years of distributed systems development, 2+ years building AI/ML technologies (ideally related to LLM hosting and inference), and 2+ years in a people management or team lead role.
  • A passion for leading, coaching, and mentoring software engineers
  • Enduring interest in distributed systems design, AI/ML, and implementation at scale in the cloud.
  • Deep expertise in cloud computing platforms and modern AI/ML technologies
  • Experience with modern LLMs, ideally related to hosting, serving, and optimizing such models
  • Experience researching, evaluating, and building with open source technologies
  • Proficiency in programming languages commonly used in cloud development, such as Python and Go
  • Experience with infrastructure as code (IaC) tools like Terraform or Ansible
  • Experience with various GPU platforms from AMD and NVIDIA and associated toolsets for tuning, configuring, and accelerating workloads on them would be ideal, but not required
  • Knowledge of networking concepts (e.g., TCP/IP, VPCs, subnets, routing) and storage systems
  • A strong sense of ownership and a drive to figure out and resolve any issues preventing you and your team from delivering value to your customers
  • An appreciation for process and developing cross-disciplinary collaboration between engineering, operations, support, and product groups
  • Strong project management skills
  • Familiarity with end-to-end quality best practices and their implementation
  • Enthusiasm for staffing, interviewing, growing, and retaining teams
  • Experience coordinating with partner teams across time zones and geographies
Compensation Range:
  • $176,000 - $220,000

  • This is a remote role

#LI-Remote

Why You'll Like Working for DigitalOcean
  • We innovate with purpose. You'll be a part of a cutting-edge technology company with an upward trajectory, who are proud to simplify cloud and AI so builders can spend more time creating software that changes the world. As a member of the team, you will be a Shark who thinks big, bold, and scrappy, like an owner with a bias for action and a powerful sense of responsibility for customers, products, employees, and decisions.
  • We prioritize career development. At DO, you'll do the best work of your career. You will work with some of the smartest and most interesting people in the industry. We are a high-performance organization that will always challenge you to think big. Our organizational development team will provide you with resources to ensure you keep growing. We provide employees with reimbursement for relevant conferences, training, and education. All employees have access to LinkedIn Learning's 10,000+ courses to support their continued growth and development.
  • We care about your well-being. Regardless of your location, we will provide you with a competitive array of benefits to support you from our Employee Assistance Program to Local Employee Meetups to flexible time off policy, to name a few. While the philosophy around our benefits is the same worldwide, specific benefits may vary based on local regulations and preferences.
  • We reward our employees. The salary range for this position is based on market data, relevant years of experience, and skills. You may qualify for a bonus in addition to base salary; bonus amounts are determined based on company and individual performance. We also provide equity compensation to eligible employees, including equity grants upon hire and the option to participate in our Employee Stock Purchase Program.
  • DigitalOcean is an equal-opportunity employer. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.


  • Seattle, Washington, United States Apple Full time

    Apple Services Engineering team is one of the most exciting examples of Apple's long-held passion for combining art and technology. The Cloud Infrastructure team enables Apple's apps and services, and we do it on a massive scale, to hundreds of millions of customers in over 35 languages to more than 150 countries.Our Engineering Program Managers partner with...


  • Seattle, Washington, United States Anthropic Full time $300,000 - $485,000 per year

    About AnthropicAnthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.About The RoleAnthropic...


  • Seattle, Washington, United States Gradial Full time $130,000 - $200,000 per year

    Gradial is a Seattle-based startup enabling digital experiences at the speed of thought. We empower marketers and creatives to implement their ideas directly, with software that adapts over time. Our platform automates website and design system updates, large-scale migrations to new design systems, and continuous content optimization while adhering to...


  • Seattle, Washington, United States Insight Global Full time

    Insight Global is looking for an Apple Infrastructure Engineer to join one of North America's largest retail and wellness organizations. This is a 6 month contract, with the possibility of extension - contingent on performance. As an Apple Infrastructure Engineer, you'll work collaboratively across diverse teams, while also independently contributing to the...


  • Seattle, Washington, United States Apple Full time

    Do you want to make Apple products smarter for our users? The AIML Core Infra team is looking for an experienced software engineer to work on core infrastructure for information intelligence at Apple. We build systems to connect Apple users to information as part of the wider Apple Intelligence initiative. You will work closely with product teams and...


  • Seattle, Washington, United States Scale AI Full time $179,400 - $310,500

    As a Software Engineer on the ML Infrastructure team, you will design and build the next generation of foundational systems that power all ML Infrastructure compute at Scale - from model training and evaluation to large-scale inference and experimentation.Our platform is responsible for orchestrating workloads across heterogeneous compute environments (GPU,...


  • Seattle, Washington, United States DAT Solutions Full time

    About DATDAT is an award-winning employer of choice and a next-generation SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 45 years. We continue to transform the industry year over year, by deploying a suite of software solutions to millions of customers every day - customers who depend on...


  • Seattle, Washington, United States Docker, Inc Full time

    At Docker, we make app development easier so developers can focus on what matters. Our remote-first team spans the globe, united by a passion for innovation and great developer experiences. With over 20 million monthly users and 20 billion image pulls, Docker is the #1 tool for building, sharing, and running apps—trusted by startups and Fortune 100s alike....


  • Seattle, Washington, United States Google Full time

    Note: By applying to this position you will have an opportunity to share your preferred working location from the following:Sunnyvale, CA, USA; Seattle, WA, USA.Minimum qualifications:Bachelor's degree or equivalent practical experience.2 years of experience with software development in C++ programming language, or 1 year of experience with an advanced...


  • Seattle, Washington, United States Capgemini Engineering Full time

    Job Title:Senior Systems Engineering ManagerRole Location:Everett Washington -OnsiteRelocation support availableJob DescriptionJoin Capgemini in shaping the future of Systems Engineering. We're seeking a visionary leader to drive transformational change across our engineering practices and client engagements. This role is ideal for a senior professional who...