Senior Site Reliability Engineer

3 weeks ago


Palo Alto, United States Plume Design Inc Full time

Life at Plume

At Plume, we believe that technology isn't about moving faster, it's about making life’s moments better. Which is why we’ve built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to create the future of connected spaces—and human experiences—at massive scale.

We now deliver services to over 50 million locations globally and have managed over 2.5 billion devices on our platform. We’re expanding rapidly, pioneering a new category, and we achieved our Series F funding in just four years. Our customers include many of the world's largest Communications Service Providers (CSPs) who look to Plume to help them evolve their smart home offerings while gleaning insights from their own data. 

With a bias for action and a love for being trailblazers, the team at Plume embodies a combination of relentless curiosity and imaginative innovation. We challenge ourselves to think in ways that other companies don't, work to do what should be done (rather than what can), and if we can’t do it exceptionally well, we don’t do it. It’s how we've assembled a team of world-class builders, thinkers, and doers. And it’s how we’re reinventing what’s possible every day.

Opportunity

We’re looking for infrastructure engineers who are driven, independent and like to learn, exchange and collaborate. Advanced experience on Tooling/Tech Stack is not essential, familiarity with standard infrastructure concepts and production troubleshooting will be a differential.

What You’ll Do 

  • Focus on Production operations/matters and on-call 
  • Provide live customer support as needed
  • Provision and scale multi-datacenter Kubernetes Infrastructure and Applications (EKS)
  • Deploy Software in multiple Production Environments
  • Own monitoring and alerting to production systems, improvements and changes
  • Contribute improvements to the current automation 
  • Contribute improvements to our on-call process and alerting

What You’ll Bring

  • Availability to be in on-call rotation for Production issues
  • Availability to work with a distributed team in different timezones

Desired Skill Set

  • 4+ Years of experience with Production Troubleshooting
  • 1+ years of Kubernetes Knowledge (operate)
  • 1+ years Basic Terraform Knowledge
  • Experience both setting up and utilizing Monitoring and observability tools
    • e.g. New Relic, Nagios/Icinga, Grafana, Prometheus
  • 2+ years of experience Programming/Scripting - one of the following
    • eg. Perl, Python, PHP, GoLang, Java, etc
  • 8+ years of experience with modern Linux Operating systems (Enterprise Linux or Debian based)
  • 6+ years of experience with modern cloud infrastructure, preferably AWS
  • Bachelor’s degree in related field or equivalent experience

Differentiators

  • Troubleshooting production performance/service degradation or outage issues at scale
  • Experience with Infrastructure Troubleshooting in VMs and/or Bare Metal (ssh/Linux)
  • Experience with direct customer support
  • Advanced Kubernetes knowledge
  • Advanced Terraform knowledge
  • Experience operating NoSQL Databases in Production
  • Experience operating Relational Databases in Production
  • Generic Configuration Management experience

The total Compensation package would include an anticipated salary range of $144,000.00 to $169,500.00 + bonus + equity + benefits. Benefits include a 401k plan and a company match, basic life insurance plus unparalleled health, dental, vision, and other benefits and perks. Please see here for more details. An employee’s base salary and position within the range may depend on a number of factors including job-related knowledge, education, skills, experience, and other business-related considerations. Published ranges are provided in good faith at the time of posting.

This is NOT a remote role. This is HYBRID and requires someone to work in our Palo Alto, CA office 3 days a week. We are unable to offer relocation assistance at this time. 

About Plume

As the creator of the only open, hardware-independent, cloud-controlled experience platform for CSPs and their subscribers, Plume partners with over 350 CSP customers, including some of the world’s largest such as Comcast, Charter, Liberty Global, and J:COM. 

Using OpenSync, the most widely supported open-source, silicon-to-cloud framework for smart spaces, Plume’s software-defined network allows CSPs to decouple their service offerings from hardware and rapidly curate and deliver new services over a multi-vendor, open-platform architecture.  

Backed by investors such as Insight Partners and SoftBank Vision Fund 2, Plume is now valued at $2.6B, having added over $500M in funding in 2021 alone.

Plume is an equal opportunity workplace that maintains a continuing policy of nondiscrimination in all employment practices and decisions, ensuring equal employment opportunities for all qualified individuals without regard to race, color, creed, religion, sex, national origin, age, physical or mental disability, sexual orientation, gender identity, marital status, pregnancy, childbirth or related individual conditions, medical conditions (as defined by state law), military or veteran status, or any other characteristic protected by federal, state or local law.



  • Palo Alto, California, United States SHEIN Technology LLC Full time

    About the jobJob Title: Senior Site Reliability Engineer IReports to: Senior Manager of Site Reliability EngineeringJob Location: Palo Alto, CA, USAJob Status: Exempt, FT About SHEIN SHEIN is a global online fashion and lifestyle retailer, offering SHEIN branded apparel and products from a global network of vendors, all at affordable prices. Headquartered in...


  • Palo Alto, California, United States SHEIN Technology LLC Full time

    About the jobJob Title: Senior Site Reliability Engineer IReports to: Senior Manager of Site Reliability EngineeringJob Location: Palo Alto, CA, USAJob Status: Exempt, FT About SHEIN SHEIN is a global online fashion and lifestyle retailer, offering SHEIN branded apparel and products from a global network of vendors, all at affordable prices. Headquartered in...


  • Palo Alto, United States Aptos Full time

    Aptos is a people-first blockchain on a mission to help billions of people achieve universal and fair access to decentralized assets in a safe and scalable way. Founded by some of the original creators and maintainers that researched, designed, and built the Diem blockchain to serve this purpose, we have dedicated several years toward this mission. We...


  • Palo Alto, United States Aptos Full time

    Aptos is a people-first blockchain on a mission to help billions of people achieve universal and fair access to decentralized assets in a safe and scalable way. Founded by some of the original creators and maintainers that researched, designed, and built the Diem blockchain to serve this purpose, we have dedicated several years toward this mission. We...


  • Palo Alto, United States ASSURED Full time

    Job Description Job Description Assured is on a mission to modernize insurance. Claims processing (i.e. should we pay this claim?), while often overlooked, is the foundation of the entire industry. It’s currently highly manual, involving phone calls, faxes, and gut instinct—costing tens of billions of dollars a year. We can do better. At Assured, we...


  • Palo Alto, United States Aptos Full time

    Aptos is a people-first blockchain on a mission to help billions of people achieve universal and fair access to decentralized assets in a safe and scalable way. Founded by some of the original creators and maintainers that researched, designed, and built the Diem blockchain to serve this purpose, we have dedicated several years toward this mission. We...


  • Palo Alto, United States Aptos Full time

    Aptos is a people-first blockchain on a mission to help billions of people achieve universal and fair access to decentralized assets in a safe and scalable way. Founded by some of the original creators and maintainers that researched, designed, and built the Diem blockchain to serve this purpose, we have dedicated several years toward this mission. We...


  • Palo Alto, California, United States TEKsystems Full time

    :Role: Site Reliability Engineer (SRE for Cloud)Location: Remote Project - MUST live in Pacific coast time zoneDuration: 1 year with possible extensionNumber of positions: 1We urgently looking for 1 Site Reliability Engineer (SRE for Cloud), mid level, who are available asap with the following skills:Role: Site Reliability Engineer (SRE): Global Payments...


  • Palo Alto, United States Assured Full time

    Job DescriptionJob DescriptionAssured is on a mission to modernize insurance. Claims processing (i.e. should we pay this claim?), while often overlooked, is the foundation of the entire industry. It’s currently highly manual, involving phone calls, faxes, and gut instinct—costing tens of billions of dollars a year. We can do better.At Assured, we provide...


  • Palo Alto, United States TEKsystems Full time

    Description: Role: Site Reliability Engineer (SRE for Cloud) Location: Remote Project - MUST live in Pacific coast time zone Duration: 1 year with possible extension Number of positions: 1 We urgently looking for 1 Site Reliability Engineer (SRE for Cloud), mid level, who are available asap with the following skills: Role: Site Reliability Engineer...


  • Palo Alto, United States TEKsystems Full time

    Description: Role: Site Reliability Engineer (SRE for Cloud) Location: Remote Project - MUST live in Pacific coast time zone Duration: 1 year with possible extension Number of positions: 1 We urgently looking for 1 Site Reliability Engineer (SRE for Cloud), mid level, who are available asap with the following skills: Role: Site Reliability Engineer...


  • Palo Alto, United States TEKsystems Full time

    Description: Role: Site Reliability Engineer (SRE for Cloud) Location: Remote Project - MUST live in Pacific coast time zone Duration: 1 year with possible extension Number of positions: 1 We urgently looking for 1 Site Reliability Engineer (SRE for Cloud), mid level, who are available asap with the following skills: Role: Site Reliability...


  • Palo Alto, United States MongoDB Full time

    The worldwide data management software market is massive (According to IDC, the worldwide database software market, which it refers to as the database management systems software market, was forecasted to be approximately $82 billion in 2023 growing to approximately $137 billion in 2027. This represents a 14% compound annual growth rate). At MongoDB we are...


  • Palo Alto, United States Assured Full time

    Job DescriptionJob DescriptionAssured is on a mission to modernize insurance. Claims processing (i.e. should we pay this claim?), while often overlooked, is the foundation of the entire industry. It’s currently highly manual, involving phone calls, faxes, and gut instinct—costing tens of billions of dollars a year. We can do better.At Assured, we provide...


  • Palo Alto, United States MongoDB Full time

    The worldwide data management software market is massive (According to IDC, the worldwide database software market, which it refers to as the database management systems software market, was forecasted to be approximately $82 billion in 2023 growing to approximately $137 billion in 2027. This represents a 14% compound annual growth rate). At MongoDB we are...


  • Palo Alto, United States Plume Design Inc Full time

    Life at Plume At Plume, we believe that technology isn't about moving faster, it's about making life’s moments better. Which is why we’ve built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to...


  • Palo Alto, United States Quartzenterprises Full time

    We are looking for a Systems Engineer to join our growing Corporate IT Team. This is an exciting role and includes a wide set of responsibilities, with the day to day focus on the ongoing improvement and maintenance of systems and services within the Organization. This role will report into the Corp-IT System Engineers Manager. The candidate will have...


  • Palo Alto, United States Pivotal Full time

    Pivotal is the leader in the emerging market of electric Vertical Takeoff and Landing (eVTOL) aircraft. We design, develop, and manufacture light eVTOL aircraft and are renowned for the BlackFly, the first light eVTOL to fly manned missions and enter the consumer market. Efficient, compact, and simple, Pivotal vehicles are designed for a wide range of...


  • Palo Alto, United States Amazon.com Inc Full time

    Are you interested in building hyper-scale database services in the cloud? Do you want to revolutionize the way databases are built for the cloud? Do you want to have direct and immediate impact on hundreds of thousands of users who use AWS database Reliability Engineer, SQL, Infrastructure, Liability, Software Developer, Engineer, Technology

  • Senior Engineer

    1 week ago


    Palo Alto, California, United States American Express Full time

    Why Work with American Express as a Senior Engineer - Generative AIIf you're passionate about being part of a global and diverse community of colleagues committed to delivering exceptional customer experiences, American Express is the place to be. As a Senior Engineer, you'll be at the forefront of architecting, coding, and shipping innovative software that...