Staff Engineer, Cloud Infrastructure

2 weeks ago


San Francisco, United States Amplitude Full time

Amplitude is a leading digital analytics platform that helps companies unlock the power of their products. More than 3,500 customers, including Atlassian, Jersey Mike's, NBCUniversal, Shopify, and Under Armour, rely on Amplitude to gain self-service visibility into the entire customer journey. Amplitude guides companies every step of the way as they capture data they can trust, uncover clear insights about customer behavior, and take faster action. When teams understand how people are using their products, they can deliver better product experiences that drive growth.


As an organization, we approach challenges with humility, take ownership of our contributions, and embrace a growth mindset that pushes us to constantly improve ourselves, each other, and the value we bring to customers and partners.


Amplitude's Commitment to Diversity Equity & Inclusion (DEI): Amplitude believes that diversity enables the creation of better products, improves the ability to solve complex problems, and drives more powerful solutions. We strive to create an environment of inclusion - one focused on psychological safety, empathy, and human connection - that will allow employees of all backgrounds to thrive.

Staff Platform Engineer


About the Role:


We are looking for a highly experienced and collaborative Staff Cloud Infrastructure Engineer to join our team. You will be responsible for the design, automation, and optimization of our Kubernetes-based platform, ensuring that it is scalable, easy to use, and reliable. This is a critical role for our company, and we are seeking someone who not only has deep technical expertise in cloud infrastructure and Kubernetes but also values mentoring, collaboration, and open communication. Your work will directly impact our developer productivity by building systems and abstractions that simplify the deployment of new workloads, making it easy for developers to focus on building features, not infrastructure.


In this role, you will help drive a cultural shift in how our Platform team operates, working to create positive relationships across the company, building trust, and making our platform easier to use. We value someone who listens, learns, and communicates effectively while still ensuring high technical standards and reliability.


Key Responsibilities:


  • Lead the design, implementation, and management of our Kubernetes-based platform, focusing on scalability, developer experience, and system reliability.
  • Architect and maintain automation around Kubernetes, ensuring that the platform is easy for developers to use and requires minimal toil to deploy or modify workloads in a self-service model.
  • Collaborate with cross-functional teams (developers, leaders, and other infrastructure teams) to gather requirements, build consensus, and deliver impactful solutions.
  • Integrate observability into the platform, using tools like Datadog, Prometheus, Grafana, New Relic, and Splunk to monitor system health and performance.
  • Drive infrastructure-as-code initiatives using tools like Kubernetes Operators, Helm, Kustomize, and Terraform promoting automation, repeatability, and reliability.
  • Ensure that the platform integrates seamlessly with CI/CD pipelines (using Argo CD / Workflows / Rollouts, Github Actions, Jenkins, or similar) and continuously improve developer workflows.
  • Contribute to the operational excellence of the platform, including on-call responsibilities and incident management, while building self-healing capabilities where possible.
  • Act as a mentor to other engineers on the team, promoting growth and knowledge sharing, ensuring that the team thrives even in the absence of specific individuals.
  • Foster a culture of collaboration, empathy, and trust within the team and across departments, helping to bridge gaps between engineering and other business functions.
  • Take a hands-on approach to problem-solving, sometimes submitting PRs to resolve issues in codebases or providing detailed solutions when teams need assistance.

What We're Looking For:


  • 8+ years of experience in some combination of cloud-native software development, platform engineering, site reliability engineering, and/or cloud infrastructure, with a more recent focus on Kubernetes and the cloud-native ecosystem.
  • Strong expertise in Kubernetes and related CNCF projects (e.g., Argo CD/Workflows, Backstage, Envoy, CoreDNS, and more) and in simplifying complex cloud infrastructure for broader teams.
  • Operational experience at scale with technologies like Kafka and Airflow.
  • Proficient in common infrastructure languages like Golang, Python, and Terraform, with experience developing and operating production systems.
  • Extensive experience with AWS cloud infrastructure, networking, and security.
  • Proven experience with monitoring and observability tools (Datadog, Splunk, Prometheus, Grafana Cloud, etc.) and a strong understanding of system performance tuning.
  • Expertise in building abstractions over Kubernetes to simplify developer interaction with the platform.
  • Excellent communication skills, with the ability to collaborate across teams, build consensus, and drive initiatives in a high-pressure environment.
  • High level of empathy and patience, with a commitment to mentoring and helping others succeed, and the ability to incorporate feedback and turn it into actionable improvements.
  • Experience with infrastructure-as-code and automation (Terraform, Helm, Kustomize, etc.), with a focus on reducing toil and operational overhead.
  • A mindset focused on improving the developer experience and business alignment, with the flexibility to make decisions that may go against ideal technical preferences when necessary.
#J-18808-Ljbffr

  • San Francisco, United States Amplitude Full time

    Amplitude is a leading digital analytics platform that helps companies unlock the power of their products. More than 3,500 customers, including Atlassian, Jersey Mike's, NBCUniversal, Shopify, and Under Armour, rely on Amplitude to gain self-service visibility into the entire customer journey. Amplitude guides companies every step of the way as they capture...


  • San Francisco, United States Unreal Gigs Full time

    Are you passionate about designing and managing robust cloud infrastructures that empower applications to scale seamlessly and perform at their best? Do you thrive on building and optimizing cloud systems that are secure, efficient, and tailored to support dynamic workloads? If you’re ready to take on the challenge of architecting cloud infrastructure that...


  • San Francisco, United States ZipRecruiter Full time

    Job DescriptionDo you enjoy solving technical issues, empathize with customer user experiences and want to keep up with the latest tech? We are looking for a Cloud Infrastructure Engineer that will work with talented software engineering and support teams to deploy, maintain and ensure reliability of our applications in a fast-paced environment.Successful...


  • San Francisco, United States ZipRecruiter Full time

    Job DescriptionAre you passionate about designing and managing robust cloud infrastructures that empower applications to scale seamlessly and perform at their best? Do you thrive on building and optimizing cloud systems that are secure, efficient, and tailored to support dynamic workloads? If you’re ready to take on the challenge of architecting cloud...


  • San Jose, United States Spectro Cloud, Inc Full time

    We are Spectro Cloud, Inc. a leading company in the cloud computing industry.Job Title:Cloud Infrastructure ArchitectThe ideal candidate will have a degree in Computer Science or related field and at least 60 months of experience as a Software Engineer, System Engineer, or Technical Lead.A Master's degree or equivalent can also be considered if the candidate...


  • San Francisco, United States Unreal Gigs Full time

    Are you passionate about designing and managing robust cloud infrastructures that empower applications to scale seamlessly and perform at their best? Do you thrive on building and optimizing cloud systems that are secure, efficient, and tailored to support dynamic workloads? If you’re ready to take on the challenge of architecting cloud infrastructure that...


  • San Francisco, California, United States ESL FACEIT Group Full time

    Job DescriptionWe are seeking a talented Cloud Infrastructure Engineer to join our team at ESL FACEIT Group. As a key member of our infrastructure team, you will be responsible for designing, analyzing, and troubleshooting large-scale distributed systems.ResponsibilitiesMaintain and improve monitoring and observability tools, ensuring seamless performance...


  • San Francisco, California, United States Crusoe Energy Inc Full time

    About Crusoe Energy Inc.Crusoe Energy Inc. is a pioneering technology company dedicated to unlocking the value of stranded energy resources through innovative computation solutions.We aim to harmonize the long-term interests of the climate with the future of global computing infrastructure. As data centers consume an exponentially growing power footprint to...


  • San Francisco, California, United States Crusoe Full time

    Job OverviewCrusoe, a pioneering AI-first Cloud infrastructure company, is seeking a highly skilled Senior Cloud Infrastructure Engineer to join their Internal Tooling team. As a key member of this team, you will play a crucial role in defining technical direction, communicating strategic vision, and establishing a culture of engineering excellence.


  • San Francisco, California, United States Philo Full time

    Job SummaryWe are seeking an experienced Senior Cloud Infrastructure Engineer to join our team at Philo. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining cloud-based systems that support our streaming platform.About PhiloPhilo is a leading provider of live and on-demand streaming services. Our...


  • San Francisco, California, United States ZipRecruiter Full time

    Job OverviewWe are seeking an experienced Senior Cloud Infrastructure Engineer to join our team at Abridge. As a key member of our engineering organization, you will be responsible for designing, building, and operating our cloud infrastructure using industry-leading technologies.About the RoleDesign and deploy scalable, secure, and highly available cloud...


  • San Francisco, California, United States Bluepipes Full time

    At Company, we are seeking a skilled Senior Software Engineer - Cloud Infrastructure to join our team. As a key member of our cloud infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and secure cloud-based systems. Your expertise in cloud infrastructure will help us to improve our services and meet the growing...


  • San Francisco, California, United States Crusoe Full time

    Senior Cloud Software Engineer Role at CrusoeCrusoe is pioneering the future of AI-first cloud infrastructure, with a mission to align computing with sustainable climate goals. Our company has established itself as a leading provider of trusted, reliable AI platform solutions for Fortune 500 companies.About the CompanyWe redefining AI cloud infrastructure...


  • San Francisco, California, United States ZipRecruiter Full time

    We are seeking a highly skilled Cloud Infrastructure Engineer to join our team at ZipRecruiter. As a key member of our engineering team, you will be responsible for designing and implementing scalable, secure, and cost-effective cloud architectures using services from major cloud providers (AWS, GCP, Azure).You will ensure that cloud infrastructure aligns...


  • San Francisco, California, United States Amplitude Full time

    Unlock the Power of Your Products with AmplitudeAbout AmplitudeWe're a leading digital analytics platform that helps companies gain self-service visibility into their entire customer journey. Our customers, including Atlassian, Jersey Mike's, NBCUniversal, Shopify, and Under Armour, rely on us to capture data they can trust, uncover clear insights about...


  • San Francisco, California, United States AirTree Ventures Pty Full time

    We are seeking a highly skilled Cloud Infrastructure Architect to join our team at AirTree Ventures Pty. As a key member of our engineering team, you will play a critical role in designing and maintaining our cloud infrastructure ecosystem.Key Responsibilities:Design, build, and maintain cloud infrastructure using infrastructure-as-code...


  • San Francisco, United States Amplitude Full time

    Amplitude is a leading digital analytics platform that helps companies unlock the power of their products. More than 3,200 customers, including Atlassian, Jersey Mike’s, NBCUniversal, Shopify, and Under Armour, rely on Amplitude to gain self-service visibility into the entire customer journey. Amplitude guides companies every step of the way as they...


  • San Francisco, California, United States VamosVentures Full time

    Transforming Data Storage for a Scalable FutureVamosVentures is seeking an exceptional Cloud Infrastructure Database Engineer to join our innovative team. In this role, you will have the opportunity to design and develop robust data storage systems that power our AI-powered spend platform.As a Cloud Infrastructure Database Engineer at VamosVentures, you will...


  • San Francisco, California, United States CV Library Full time

    Overview:We are seeking a highly skilled Cloud Infrastructure Engineer to join our growing SRE team at Atlassian. The successful candidate will be responsible for scaling Cloud services, owning Caching infrastructure, tooling and automation that supports Atlassian's suite of Cloud products.Responsibilities:Analyzing and improving services and processes to...

  • Cloud Infrastructure

    4 weeks ago


    san francisco, United States Signiminds Technologies Inc Full time

    Note: Position requires having Security Clearance, candidates with clearance are encouraged to apply.Job Description:As the Senior Software Engineer -Cloud Infrastructure you will collaborate with development and quality engineering to build and maintain our continuous integration pipeline from development to production. You’ll bring a strong systems...