Sr Staff Engineer, Platform

4 weeks ago


San Francisco, United States Amplitude Full time

Amplitude is a leading digital analytics platform that helps companies unlock the power of their products. More than 3,200 customers, including Atlassian, Jersey Mike's, NBCUniversal, Shopify, and Under Armour, rely on Amplitude to gain self-service visibility into the entire customer journey. Amplitude guides companies every step of the way as they capture data they can trust, uncover clear insights about customer behavior, and take faster action. When teams understand how people are using their products, they can deliver better product experiences that drive growth.


As an organization, we approach challenges with humility, take ownership of our contributions, and embrace a growth mindset that pushes us to constantly improve ourselves, each other, and the value we bring to customers and partners.


Amplitude's Commitment to Diversity Equity & Inclusion (DEI): Amplitude believes that diversity enables the creation of better products, improves the ability to solve complex problems, and drives more powerful solutions. We strive to create an environment of inclusion - one focused on psychological safety, empathy, and human connection - that will allow employees of all backgrounds to thrive.

Senior Staff Platform Engineer (Kubernetes)


About the Role:


We are looking for a highly experienced and collaborative Staff Platform Engineer to join our team. You will be responsible for the design, automation, and optimization of our Kubernetes-based platform, ensuring that it is scalable, easy to use, and reliable. This is a critical role for our company, and we are seeking someone who not only has deep technical expertise in cloud infrastructure and Kubernetes but also values mentoring, collaboration, and open communication. Your work will directly impact our developer productivity by building systems and abstractions that simplify the deployment of new workloads, making it easy for developers to focus on building features, not infrastructure.


In this role, you will help drive a cultural shift in how our Platform team operates, working to create positive relationships across the company, building trust, and making our platform easier to use. We value someone who listens, learns, and communicates effectively while still ensuring high technical standards and reliability.


Key Responsibilities:


  • Lead the design, implementation, and management of our Kubernetes-based platform, focusing on scalability, developer experience, and system reliability.
  • Architect and maintain automation around Kubernetes, ensuring that the platform is easy for developers to use and requires minimal toil to deploy or modify workloads in a self-service model.
  • Collaborate with cross-functional teams (developers, leaders, and other infrastructure teams) to gather requirements, build consensus, and deliver impactful solutions.
  • Integrate observability into the platform, using tools like Datadog, Prometheus, Grafana, New Relic, and Splunk to monitor system health and performance.
  • Drive infrastructure-as-code initiatives using tools like Kubernetes Operators, Helm, Kustomize, and Terraform promoting automation, repeatability, and reliability.
  • Ensure that the platform integrates seamlessly with CI/CD pipelines (using Argo CD / Workflows / Rollouts, Github Actions, Jenkins, or similar) and continuously improve developer workflows.
  • Contribute to the operational excellence of the platform, including on-call responsibilities and incident management, while building self-healing capabilities where possible.
  • Act as a mentor to other engineers on the team, promoting growth and knowledge sharing, ensuring that the team thrives even in the absence of specific individuals.
  • Foster a culture of collaboration, empathy, and trust within the team and across departments, helping to bridge gaps between engineering and other business functions.
  • Take a hands-on approach to problem-solving, sometimes submitting PRs to resolve issues in codebases or providing detailed solutions when teams need assistance.

What We're Looking For:


  • 8+ years of experience in some combination of cloud-native software development, platform engineering, site reliability engineering, and/or cloud infrastructure, with a more recent focus on Kubernetes and the cloud-native ecosystem.
  • Strong expertise in Kubernetes and related CNCF projects (e.g., Argo CD/Workflows, Backstage, Envoy, CoreDNS, and more) and in simplifying complex cloud infrastructure for broader teams.
  • Operational experience at scale with technologies like Kafka and Airflow.
  • Proficient in common infrastructure languages like Golang, Python, and Terraform, with experience developing and operating production systems.
  • Extensive experience with AWS cloud infrastructure, networking, and security.
  • Proven experience with monitoring and observability tools (Datadog, Splunk, Prometheus, Grafana Cloud, etc.) and a strong understanding of system performance tuning.
  • Expertise in building abstractions over Kubernetes to simplify developer interaction with the platform.
  • Excellent communication skills, with the ability to collaborate across teams, build consensus, and drive initiatives in a high-pressure environment.
  • High level of empathy and patience, with a commitment to mentoring and helping others succeed, and the ability to incorporate feedback and turn it into actionable improvements.
  • Experience with infrastructure-as-code and automation (Terraform, Helm, Kustomize, etc.), with a focus on reducing toil and operational overhead.
  • A mindset focused on improving the developer experience and business alignment, with the flexibility to make decisions that may go against ideal technical preferences when necessary.
#J-18808-Ljbffr

  • San Francisco, United States Databricks Full time

    RDQ126R105 At Databricks, we are passionate about helping data teams solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to...


  • San Francisco, United States Databricks Full time

    RDQ126R105 At Databricks, we are passionate about helping data teams solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to...


  • San Francisco, United States Databricks Full time

    RDQ126R105 At Databricks, we are passionate about helping data teams solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to...


  • San Francisco, United States Amplitude Full time

    Amplitude is a leading digital analytics platform that helps companies unlock the power of their products. More than 3,200 customers, including Atlassian, Jersey Mike's, NBCUniversal, Shopify, and Under Armour, rely on Amplitude to gain self-service visibility into the entire customer journey. Amplitude guides companies every step of the way as they capture...


  • San Francisco, United States Amplitude Full time

    Amplitude is a leading digital analytics platform that helps companies unlock the power of their products. More than 3,200 customers, including Atlassian, Jersey Mike’s, NBCUniversal, Shopify, and Under Armour, rely on Amplitude to gain self-service visibility into the entire customer journey. Amplitude guides companies every step of the way as they...


  • San Francisco, California, United States Amplitude Full time

    Unlock the Power of Your Products with AmplitudeAbout AmplitudeWe're a leading digital analytics platform that helps companies gain self-service visibility into their entire customer journey. Our customers, including Atlassian, Jersey Mike's, NBCUniversal, Shopify, and Under Armour, rely on us to capture data they can trust, uncover clear insights about...


  • San Francisco, United States Sparktek Full time

    Job Title: Sr Platform Network Engineer with Strong GCP experience Location: Mountain View, CA [Needs to be onsite for 1 week once in a quarter on your own expenses] Job Type: Contract Job Description: We are looking for a Senior GCP Platform Engineer with expertise in networking to join our dynamic team. In this role, you will be...


  • San Francisco, United States ZipRecruiter Full time

    Job Description What we're looking for Puzzle is redefining how companies navigate and leverage their financials. Our cutting-edge accounting software seamlessly integrates with modern fintech tools, offering founders and finance teams a real-time and comprehensive view of their financial landscape. We are on a mission to empower entrepreneurs with the...


  • San Francisco, United States Amplitude Full time

    Amplitude is a leading digital analytics platform that helps companies unlock the power of their products. More than 3,500 customers, including Atlassian, Jersey Mike's, NBCUniversal, Shopify, and Under Armour, rely on Amplitude to gain self-service visibility into the entire customer journey. Amplitude guides companies every step of the way as they capture...


  • San Francisco, United States NexHealth Full time

    About NexHealthOur healthcare system is frustratingly analog. When you live in a world of one-tap car rides, meal delivery, and unlimited streaming, why do you have to call to schedule an appointment with a doctor and are still handed a clipboard to fill in a form? NexHealth’s mission is to accelerate innovation in healthcare. We’re doing this by...


  • San Francisco, United States ZipRecruiter Full time

    Job DescriptionWhat we're looking forPuzzle is redefining how companies navigate and leverage their financials. Our cutting-edge accounting software seamlessly integrates with modern fintech tools, offering founders and finance teams a real-time and comprehensive view of their financial landscape. We are on a mission to empower entrepreneurs with the...


  • San Francisco, California, United States Discord Full time

    **About Us**At Discord, we're passionate about building a platform that makes it easy and fun for people to talk and hang out before, during, and after playing games.As a Senior Staff Engineer on our Platform Ecosystem team, you'll play a critical role in shaping the future of gaming and developer experience. We're seeking an experienced leader who can...


  • San Francisco, United States Tbwa ChiatDay Inc Full time

    Staff Software Engineer - Notifications PlatformSan Francisco, CA or Remote (U.S.)Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform: play video games. Over 90% of our users play games, spending a combined 1.5 billion hours playing thousands of unique titles on...


  • San Francisco, United States Tranzeal Full time

    As a Distinguished Engineer, you will collaborate with our Sr Staff, Staff, and Sr. Engineers to innovate and construct new systems, enhance existing ones, and Client fresh opportunities to apply your specialized knowledge in Secrets Management to resolve critical issues. You will spearhead the strategy and execution of a technical roadmap that accelerates...


  • San Francisco, United States AirTree Ventures Pty Full time

    The Role As Staff Software Engineer for Linktree’s Data Platform team, you’ll play a crucial role in building a robust, scalable data platform that drives innovative experiences in our core product. Your contributions will unlock new opportunities for our tens of millions of users and our billions of visitors around the world, helping us achieve...


  • San Francisco, United States Discord Full time

    Discord is used by over 200 million people every month for many different reasons, but there's one thing that nearly everyone does on our platform: play video games. Over 90% of our users play games, spending a combined 1.5 billion hours playing thousands of unique titles on Discord each month. Discord plays a uniquely important role in the future of gaming....


  • San Francisco, United States Discord Full time

    Discord is used by over 200 million people every month for many different reasons, but there's one thing that nearly everyone does on our platform: play video games. Over 90% of our users play games, spending a combined 1.5 billion hours playing thousands of unique titles on Discord each month. Discord plays a uniquely important role in the future of gaming....


  • San Francisco, United States AirTree Ventures Pty Full time

    The RoleAs Staff Software Engineer for Linktree’s Data Platform team, you’ll play a crucial role in building a robust, scalable data platform that drives innovative experiences in our core product. Your contributions will unlock new opportunities for our tens of millions of users and our billions of visitors around the world, helping us achieve...


  • San Francisco, United States Discord Full time

    Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform: play video games. Over 90% of our users play games, spending a combined 1.5 billion hours playing thousands of unique titles on Discord each month. Discord plays a uniquely important role in the future of...

  • Platform Engineer

    3 weeks ago


    San Francisco, United States Abridge Full time

    Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients.Our enterprise-grade technology transforms patient-clinician conversations into...