Site Reliability Engineer

5 days ago


Denver, United States RingCentral, Inc Full time

Say hello to opportunities.

It‘s not every day that you consider starting a new career. We‘re RingCentral, and we‘re happy that someone as talented as you is considering this role. First, a little about us, we‘re a $2 Billion annual revenue company with double digit Annual Recurring Revenue (ARR) and a $93 Billion market opportunity in UCaaS, Contact Center and AI-powered adjacencies. We invest more than $250 million annually to ensure our AI-enabled technology and platforms meet or exceed the needs of our customers.

The RingCentral Collaboration Group includes the Messaging Backend, Front End Client Apps, various parts of our overall AI Features and several internal tools.

This is where you and your skills come in.

We‘re currently looking for: An experienced Site Reliability Engineer (SRE) to join the RingCentral Collaboration team. As a SRE, you will be responsible for maintaining and improving uptime and availability across several of our services. You will play a crucial role in ensuring the reliability, performance, and availability of our services by identifying potential issues, and proactively resolving them. The ideal candidate should have a background in various service observability platforms as well as experience with containerization using Kubernetes, message queuing systems like Kafka, and SQL/NoSQL databases. Programming experience is desired for the role.

Job Duties:

  • Collaborate with development and operations teams to integrate monitoring solutions into the software development lifecycle and operational processes.

  • Define, propose, and drive efforts to continually improve monitoring, troubleshooting, and self-healing for our services.

  • Design and implement redundancy, failover mechanisms, and load-balancing strategies to ensure system reliability.

  • Conduct risk assessments and identify potential points of failure in the infrastructure and propose solutions to fix it.

  • Respond to (on-call) and take actions to mitigate incidents and outages.

  • Be on top of capacity requirements in a growing environment.

  • Actively work with various teams‘ codebases to extend observability and improve uptime.

  • Represent the team in global incidents resolution, and participate in on-call rotation


To succeed in this role you must have experience in:

  • Proven experience as an SRE or similar role of 6+ years.

  • Problem-solving and troubleshooting skills.

  • Linux in-depth knowledge.

  • Knowledge of one of the programming languages (see Preferable technology stack).

  • Experience with cloud platforms.

  • Knowledge of one or more of the configuration management tools.

  • Ability to work in a diverse multicultural environment, communicating with globally distributed teams.

  • Team player with self-start ability and strong drive to dig deeply and solve problems.

  • Fluent in spoken and written English.

Preferable Technology Stack:

  • OS: Linux (CentOS/RedHat/Oracle/Amazon Linux)

  • Programming languages: Python, JavaScript, Go, Java

  • Cloud: AWS, Azure, GCP

  • Containerization: Kubernetes

  • Distributed Log: Kafka, ELK stack

  • Monitoring: Zabbix, Prometheus, Alertmanager, Grafana

  • DBs: VictoriaMetrics, MongoDB, PostgreSQL, MySQL

  • IaaC: Ansible, Terraform

  • GitOps: ArgoCD

  • CI: Gitlab CI, Jenkins

  • VCS: GitLab

  • HA: Nginx Proxy



Desired Qualifications:

  • B.S in Computer Engineering, Computer Science, or equivalent experience with 4+ years of related experience

  • Proven experience with influencing the software engineering of cloud/SaaS services

  • Familiarity with AI, LLM, and various related technologies

  • Deep understanding of the DevOps Lifecycle and application of it within organizations

What we offer:

  • Comprehensive medical, dental, vision, disability, life insurance

  • Health Savings Account (HSA), Flexible Spending Account (FSAs) and Commuter benefits

  • 401K match and ESPP

  • Paid time off and paid sick leave

  • Wellness programs including 1:1 coaching and meditation guidance

  • Paid parental and pregnancy leave and new parent gift boxes

  • Family-forming benefits (IVF, Preservation, Adoption etc.)

  • Emergency backup care (Child/Adult/Pets)

  • Pet insurance and Pet Telehealth

  • Employee Assistance Program (EAP) with counseling sessions available 24/7

  • Free legal services that provide legal advice, document creation and estate planning

  • Employee bonus referral program

  • Student loan refinancing assistance

  • Employee perks and discounts program

RingCentral‘s Engineering team works on high-complexity projects that set the standard for performance and reliability at massive scale. What kind of scale? Millions of users today and hundreds of millions tomorrow. This is your chance to help imagine, develop and deliver products that raise the technological bar, and power human connections. If you‘re a talented, ambitious, creative thinker, RingCentral is the perfect environment to join a world class team and bring your ideas to life.

RingCentral‘s work culture is the backbone of our success. And don‘t just take our word for it: we are recognized as a Best Place to Work by Glassdoor, the Top Work Culture by Comparably and hold local BPTW awards in every major location. Bottom line: We are committed to hiring and retaining great people because we know you power our success. RingCentral offers on-site, remote and hybrid work options optimized for the ways we work and live now.

About RingCentral

RingCentral, Inc. (NYSE: RNG) is a leading provider of business cloud communications and contact center solutions based on its powerful Message Video Phone(MVP) global platform. More flexible and cost effective than legacy on-premises PBX and video conferencing systems that it replaces, RingCentral empowers modern mobile and distributed workforces to communicate, collaborate, and connect via any mode, any device, and any location. RingCentral is headquartered in Belmont, California, and has offices around the world.

RingCentral is an equal opportunity employer that truly values diversity. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We are committed to providing reasonable accommodations for individuals with disabilities during our application and interview process. If you require such accommodations, please click on the following link to learn more about how we can assist you.

If you are hired in Colorado the compensation range for this position is between $107,100 and $153,000 for full-time employees, in addition to eligibility for variable pay, equity, and benefits. Benefits may include, but are not limited to, health and wellness, 401k, ESPP, vacation, parental leave, and more The salary may vary depending on your location, skills, and experience.



  • Denver, United States Top Secret Clearance Jobs Full time

    About the job Site Reliability Engineer Top Secret Clearance Jobs is dedicated to helping those with the most exclusive security clearance find their next career opportunity and get interviews within 48 hours. Altamira Technologies has a long and successful history providing innovative solutions throughout the U.S. National Security community. Headquartered...


  • Denver, United States Altamira Technologies Full time

    Description Altamira Technologies has a long and successful history providing innovative solutions throughout the U.S. National Security community. Headquartered in McLean, Virginia, Altamira serves the defense, intelligence and homeland security communities worldwide by focusing on creating innovative solutions leveraging common standards in architecture,...


  • Denver, Colorado, United States Oracle Full time

    About the JobWe are looking for an experienced Site Reliability Engineer - SRE Lead to join our team at Oracle. As an SRE, you will be responsible for ensuring the reliability and scalability of our cloud-based services.Key Responsibilities:Design and implement scalable cloud architecturesDevelop and maintain monitoring and alerting systemsCollaborate with...


  • Denver, United States Saxon Global Full time

    Title: Principal Site Reliability EngineerLocation: Denver, CO - On-Site Required Duration: 6 MonthsDescription: Responsible for providing the primary management, administration, support, and ongoing maintenance of production platforms within a 24x7x365 environment and data center environment. This role must be able to understand, execute, and document...


  • Denver, Colorado, United States Lumen Inc Full time

    About LumenLumen is a leading technology company that connects people, data and applications – quickly, securely, and effortlessly. Our mission is to ignite business growth by delivering innovative solutions that meet the evolving needs of our customers.We're committed to creating a culture and company from the people up – built on teamwork, trust and...


  • Denver, CO, United States LeoVegas Group Full time

    ABOUT THE ROLE Site Reliability Engineering (SRE) is a critical part of our platform strategy. As a member of the SRE team, you will focus on providing technical expertise and support to our engineering teams to enable them to deliver high-quality software solutions efficiently. This includes helping teams with technical continuous effort, overcoming major...

  • Reliability Engineer

    6 hours ago


    Denver, United States Aramco Full time

    Aramco energizes the world economy. If the following job requirements and experience match your skills, please ensure you apply promptly. Aramco occupies a unique position in the global energy industry. We are the world's largest producer of hydrocarbons (oil and gas), with the lowest upstream carbon intensity of any major producer.  With our significant...

  • Reliability Engineer

    12 hours ago


    Denver, CO, United States Aramco Full time

    Aramco energizes the world economy. Aramco occupies a unique position in the global energy industry. We are the world's largest producer of hydrocarbons (oil and gas), with the lowest upstream carbon intensity of any major producer.  With our significant investment in technology and infrastructure, we strive to maximize the value of the energy we produce...


  • Denver, Colorado, United States Bimbo Bakeries USA Full time

    About the RoleWe are seeking a highly skilled Reliability Engineering Specialist to join our team at Bimbo Bakeries USA. In this role, you will play a critical part in ensuring the reliability and efficiency of our equipment and processes.Key ResponsibilitiesDevelop and execute training programs for maintenance personnel to enhance equipment reliability and...

  • Reliability Engineer

    12 hours ago


    Denver, CO, United States Aramco Full time

    Aramco energizes the world economy. Aramco occupies a unique position in the global energy industry. We are the world's largest producer of hydrocarbons (oil and gas), with the lowest upstream carbon intensity of any major producer.  With our significant investment in technology and infrastructure, we strive to maximize the value of the energy we...


  • denver, United States Aramco Full time

    Aramco energizes the world economy. Aramco occupies a unique position in the global energy industry. We are the world's largest producer of hydrocarbons (oil and gas), with the lowest upstream carbon intensity of any major producer.  With our significant investment in technology and infrastructure, we strive to maximize the value of the energy we produce...


  • Denver, Colorado, United States SCRAM Systems Full time

    Job SummaryWe are seeking a skilled Cloud Reliability Engineer to join our team at SCRAM Systems. This position requires expertise in managing cloud and on-premises environments, ensuring seamless application delivery, scalability, and reliability.


  • Denver, United States Bimbo Bakeries USA Full time

    Maintenance Reliability Engineerreq41652 Employment Type: Regular Location: DENVER,CO Have you ever enjoyed Arnold, Brownberry or Oroweatbread? A Thomas English muffin or bagel? Or perhaps snacked on a Sara Lee,Entenmann s or Marinela cake or donut? If the answer is yes, then you knowBimbo Bakeries USA!More than 20,000 associates in bakeries, sales...

  • Reliability Engineer

    2 weeks ago


    Denver, Colorado, United States Saint-Gobain Full time

    About Saint-GobainSaint-Gobain is a leading international manufacturer of construction materials. We aim to deliver innovative solutions for our customers while ensuring the well-being of our employees and the communities we serve.Salary InformationThe estimated salary range for this position is $87,500 to $135,500 per year. In addition to base salary, this...


  • Denver, Colorado, United States Expert Executive Recruiters (EER Global) Full time

    A leading global company in the printing industry is seeking a Site Reliability Expert to deliver on-site preventative and corrective maintenance services.Key Responsibilities:Maintain company equipment, prepare sites for installations, and train customersProvide ongoing support to customers, ensuring smooth operation of productsAdminister customer calls,...


  • Denver, United States Amazon Development Center U.S., Inc. Full time

    Seeking a Leader to expand the AWS Region Reliability team in Denver! AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the...


  • Denver, Colorado, United States Oracle Full time

    Job OverviewSenior Site Reliability Engineer at Oracle is a pivotal role that empowers healthcare workers by normalizing Electronic Health Records (EHRs). This position involves designing, deploying, and optimizing Oracle Health applications, leveraging generative AI and modernized technologies.About the RoleWe are seeking a highly skilled engineer to join...


  • Denver, CO, United States S&P Global Full time

    About the Role: Grade Level (for internal use): 11 The Team: SRE team members work together with the Business, RSO, Developers and Product team members to enhance the stability of our Pega based workflow applications. Although we do not develop code, we have the ability to look at the existing code, replicate issues in lower environments, and provide...


  • Denver, United States Amazon Development Center U.S., Inc. Full time

    The AWS Intelligence Initiative program is hiring cleared builders for a unique opportunity to work with some of the best and brightest engineers and technical leaders. Builders undergo an intensive 3-6 month training and development program to learn about operational culture, enhance their technical skills through a hands-on job path curriculum, to obtain a...


  • Denver, Colorado, United States Fiore & Sons, Inc. Full time

    About Fiore & Sons, Inc.We are a family-owned business that extends our family approach to everyone who works with us. Our unparalleled retention rate is founded on caring leadership, healthy communication, and a very competitive benefits package. We strive to provide stability and opportunity for our people and are committed to career growth, with a strong...