Staff Site Reliability Engineer, Kubernetes

Found in: beBee jobs US - 2 weeks ago


New York, New York, United States Peloton Interactive Full time

ABOUT THE ROLE

Peloton is seeking an outstanding Staff Site Reliability Engineer with an EKS (Kubernetes) focus. This person will work with teams across the Platform organization to help build and maintain a multi-cluster, multi-region, reliable, and highly scalable Kubernetes platform. In this role, you will have a rare and great opportunity to work with groundbreaking technologies that drive innovation and ensure the reliability of running workloads in a flexible, scalable, and secure way. You will be a technical leader within your team, influencing and driving technical investments across partner teams with a "Platform Thinking" approach.

YOUR DAILY IMPACT AT PELOTON

You will help others in design, execution, and problem-solving.
Host a critical infrastructure that ensures that our developers have the best experience possible on thousands of Kubernetes pods across multiple clusters
Develop and lead our Container Orchestration Platform, leading all aspects of a diverse ecosystem of over 2,000 applications. This includes Multi-Cluster/Multi-tenant Kubernetes with 15+ clusters per environment, Istio Multi-cluster Mesh, and an AWS multi-account structure.
Architect, develop, test, release, and support CI/CD systems such as ArgoCD, Jenkins, GitHub Actions, Gradle, and Artifactory.
Adhere to standard methodologies in architectural design, testing (unit, integration, visual, and regression), and scrum methodology.
Evaluate developer platform designs, technical decisions, and code to ensure all are high quality, efficient, and well documented.
Assist in planning, execution, and updating of technical roadmaps.
Drive automatic, fast auto-scaling for teams across Peloton, including services powering the Peloton Connected Fitness devices (Bike, Row, Tread, Guide), Peloton digital experiences and eCommerce platform
Design, enhance, and implement additional services for our centralized Observability Platforms, ensuring efficient log management based on Splunk, and effective monitoring and alerting powered by DataDog and PagerDuty.
Design, build, and automate new solutions centered around the Kubernetes container orchestration platform and its ecosystem of projects
Provide a platform for machine learning (and other exciting workloads) Allow developers to move quickly and experiment, without getting in the way
Promote standard methodologies for building and operating highly reliable systems
Automate everything, from infrastructure down to day-to-day tasks
Drive incident management processes, following industry practices and conducting timely post-mortems of infrastructure incidents and high judgment in knowing when to triage and when to dive down into a root-cause analysis
Assist with operational security and compliance seek out potential threats to security and reliability and advocate solutions
Participate in a rotating on-call duty schedule, providing support and assistance for the services within our team's responsibility

YOU BRING TO PELOTON

Master's degree in Computer Science, Engineering, or a similar field of study or equivalent work experience
8+ years of experience in software engineering, with a proven understanding of Kubernetes and Infrastructure as Code
4+ years of systems configuration and automation experience (e.g. Ansible, Chef, Puppet, Terraform)
Extensive knowledge and hands-on experience in AWS Cloud infrastructure and Services, including CI/CD and IaC provisioning tools such as Jenkins, ArgoCD, Scalr, Terraform, and Github Actions
Experience in a cloud environment like AWS or GCP, and familiarity with running containerized services
Experience with a programming language like Python, Golang, or Java.
Knowledge of best practices in observability and monitoring for Kubernetes clusters at scale with experience in cost optimization tools like Kubecost, Goldilocks, etc.
Knowledge of standard processes in regards to securing a Kubernetes cluster and its deployments at scale

BONUS

Passion for working with development teams making the transition to a container-native world
Passion for reliable, scalable, observable software with a sense of ownership
Craft and operate large, reliable, and scalable distributed systems
Knowledge of network infrastructure basics, including DNS, DHCP, firewalling, and load balancing, to facilitate multi-functional collaboration.

#LI-Hybrid

#LI-SW2
Peloton Interactive focuses on Hardware, Retail, Fitness, Video Streaming, and Android. Their company has offices in New York City. They have a very large team that's between employees. To date, Peloton Interactive has raised $1.041B of funding; their latest round was closed on August 2018 at a valuation of $4.125B.
You can view their website at or find them on Twitter, Facebook, and LinkedIn.
  • Junior Site Reliability Engineer

    Found in: beBee jobs US - 5 days ago


    New York, New York, United States Sesame Workshop Full time

    Sesame Workshop is seeking a Junior Site Reliability Engineer. Sesame Workshop is an independent nonprofit organization dedicated to helping children grow smarter, stronger, and kinder. This role is within the Digital Media Engineering (DME) group which is part of the Technology and Engineering department and will help provide support for our diverse media...

  • Site Reliability Engineer

    Found in: beBee jobs US - 6 days ago


    New York, New York, United States Instabase Full time

    This is a hybrid role to be based in our New York or San Francisco office.Our Site Reliability Engineering team combines the Software Engineering & Systems Engineering to build scalable, distributed, fault-tolerant systems. The SRE team keeps a watchful eye on the System Performance, Capacity and Failure modes to ensure high availability, and ability to...

  • Staff Site Reliability Engineer

    Found in: beBee jobs US - 6 days ago


    New York, New York, United States smlXL Full time

    "About the jobsmlXL is a 'stealth' start-up building an Information retrieval service with Consumer and Enterprise applications. Our first focus is providing a far richer understanding of the semantics of blockchain activity, making data and information accessible and useful to all.We aren't ready to talk broadly about what we are working on, but we might be...

  • Principal Infrastructure Engineer

    Found in: beBee jobs US - 2 weeks ago


    New York, New York, United States Paradigm Full time

    The roleAs a core member of our infrastructure team, you will build and maintain major features, through inception, design, implementation and launch, working closely with product and engineering disciplines across the company. You will spend the majority of your time on cross-functional self-contained feature teams focused on delivering value to the...

  • Staff Software Engineer, DevOps

    Found in: beBee jobs US - 2 weeks ago


    New York, New York, United States Ripple Full time

    Developer Operations (DevOps) at Ripple is responsible for communication, collaboration and integration between the Development and Operations teams so infrastructures, tools and processes are streamlined and effective for faster and automated delivery of products.The Staff Software Engineer, DevOps will contribute in discovery, design and implementation of...

  • Staff DevOps Engineer

    Found in: beBee jobs US - 1 week ago


    New York, New York, United States NBCUniversal Full time

    Company DescriptionWe create world-class content, which we distribute across our portfolio of film, television, and streaming, and bring to life through our theme parks and consumer experiences. We own and operate leading entertainment and news brands, including NBC, NBC News, MSNBC, CNBC, NBC Sports, Telemundo, NBC Local Stations, Bravo, USA Network, and...

  • Build and Release Engineer

    Found in: beBee jobs US - 2 weeks ago


    New York, New York, United States Donato technologies Full time

    Build and Release Engineer Location Newark CA Duration 12+ Months contractNeed Consultant Passport number at the time of submission.Consultants need to be OnSite right from the Day 1 of the project.The consultant must have 10+ years of DevOp / Build and Release experience and 5+ years of Python Coding.Job Description 10+ years of hands-on technical...

  • Senior AI Engineer

    Found in: beBee jobs US - 6 days ago


    New York, New York, United States Motion Recruitment Full time

    A NYC based Marketing Intelligence and Search Platform is seeking a passionate Senior / Staff Software Engineer with extensive experience in architecting and building highly scalable & complex backend systems to join our AI Platform team within the Search & AI Platform org. The Search & AI team develops the next generation of Search infrastructure and AI...

  • Sr. Platform Engineer

    Found in: beBee jobs US - 5 days ago


    New York, New York, United States Galileo Full time

    ABOUT USGalileo is a team-based medical practice working to improve the quality and affordability of health care for all. Operating across 50 states, Galileo offers high-touch, data-driven, multi-specialty, longitudinal care to diverse and complex patients—on the phone, in the home, and everywhere in between. Regional and national health plans, employers,...

  • AWS DevOps Engineer

    Found in: beBee jobs US - 6 days ago


    New York, New York, United States SIRI Software Solutions, LLC Full time

    Title AWS DevOps EngineerLocation RemoteDuration 3 months+Client HyundaiJob Description"Person needs to be an AWS DevOps engineer with the following experience Developing and debugging containerization and clustering technologies like Docker and Kubernetes Writing debugging & running Cloud Formation scripts to provision AWS Services Experience in...

  • Remote Senior DevSecOps Engineer

    Found in: beBee jobs US - 6 days ago


    New York, New York, United States Flashbots Full time

    Flashbots is a research and development organization working on mitigating the negative externalities of Maximal Extractable Value (MEV) and avoiding the existential risks MEV could cause to stateful blockchains like Ethereum. Our primary focus is to enable a permissionless, transparent, and sustainable ecosystem for MEV, via a three-pronged...

  • Principal Engineer

    Found in: beBee jobs US - 2 weeks ago


    New York, New York, United States Daylight Full time

    Daylight is a decentralized energy network building a more reliable and sustainable energy grid. Our platform supports energy applications like electrification marketplaces and programmable, distributed power plants, all powered by a collaborative community.Our work is fast-paced, collaborative, and cross-disciplinary. Our team works out of our New York City...

  • Senior Software Engineer, Backend

    Found in: beBee jobs US - 1 week ago


    New York, New York, United States re:collect Full time

    At re:collect, we are building an AI-powered thought partner that helps you ideate and create without breaking flow. Designed to mimic how your mind works, we use machine learning to connect and retrieve your digital information effortlessly. No tagging, organizing, or linking is required. We enable our customers to move beyond managing information to...

  • Staff Software Engineer

    Found in: beBee jobs US - 2 weeks ago


    New York, New York, United States Maybern Full time

    Staff Software EngineerNew York, NY - Onsite_Who We Are _Maybern is transforming the way private fund managers effectively manage their funds through cutting edge technology. Maybern is founded by top engineering experts with deep knowledge of the fund management space.Private funds manage $15T in capital and are growing at 20% YoY, but with increasing...

  • Staff Software Engineer

    Found in: beBee jobs US - 7 days ago


    New York, New York, United States Grow Therapy Full time

    About the RoleAs a staff software engineer at Grow Therapy, you'll have a huge amount of scope and autonomy. You get to see a number of different product surface areas (provider portal, marketplace, client experience, infrastructure, and enabling operations) in addition to working full-stack (Flask/Python on the backend, React/TypeScript on the frontend,...

  • DevOps Engineer

    Found in: beBee jobs US - 6 days ago


    New York, New York, United States Mathpix Full time

    "We are looking for a confident and capable engineer to take on a lot on responsibility working on dev ops at Mathpix.You are a great fit for this role if you have:proficiency scripting using Pythonsome Docker experienceknowledge or willingness to quickly learn: Golang, Kubernetes, and AWS APIsexperience with SQLexperience designing distributed...

  • Platform Engineer

    Found in: beBee jobs US - 2 days ago


    New York, New York, United States Marsh McLennan Full time

    Marsh McLennan is the industry leader in helping companies create dynamic solutions that make a difference in the moments that matter. We are searching for a Platform Automation Engineer, who can be based in Australia, UK, US. This role can be remote. Platform Product Specialist What can you expect? This is an opportunity to join an agile, leading edge...

  • Senior Software Engineer

    Found in: beBee jobs US - 6 days ago


    New York, New York, United States TrueFoundry Full time

    Responsibilities:Help companies deploy their machine learning models at scale across a wide range of use-cases and sectors. Build integrations with other platforms to make it easy for our customers to use our product without changing their workflow. Write maintainable, scalable performant codeworking with large scale data pipelines, Optimising databases and...

  • Lead Software Development Engineer

    Found in: beBee jobs US - 2 weeks ago


    New York, New York, United States Atechstar Full time

    JOB DESCRIPTION What Youll DoLead a portfolio of diverse technology projects and a team of developers with deep experience in machine learning distributed microservices and full stack systems to create solutions that help meet regulatory needs for the company Share your passion for staying on top of tech trends experimenting with and learning new...

  • Staff Software Engineer, Frontend Tech

    Found in: beBee jobs US - 6 days ago


    New York, New York, United States Justworks Full time

    Who You AreAs a Staff Software Engineer at Justworks, you will leverage your demonstrated ability to own and lead the implementation of features in large-scale, distributed applications to support the long term growth of Justworks. You understand the concept of "separation of concerns" at multiple levels, and have leveraged design patterns to solve technical...