Site Reliability Engineer

1 week ago


Denver, United States RingCentral Full time

Say hello to opportunities. It’s not every day that you consider starting a new career. We’re RingCentral, and we’re happy that someone as talented as you is considering this role. First, a little about us, we’re a $2 Billion annual revenue company with double digit Annual Recurring Revenue (ARR) and a $93 Billion market opportunity in UCaaS, Contact Center and AI-powered adjacencies. We invest more than $250 million annually to ensure our AI-enabled technology and platforms meet or exceed the needs of our customers. The RingCentral Collaboration Group includes the Messaging Backend, Front End Client Apps, various parts of our overall AI Features and several internal tools. This is where you and your skills come in. We’re currently looking for an experienced Site Reliability Engineer (SRE) to join the RingCentral Collaboration team. As an SRE, you will be responsible for maintaining and improving uptime and availability across several of our services. You will play a crucial role in ensuring the reliability, performance, and availability of our services by identifying potential issues, and proactively resolving them. The ideal candidate should have a background in various service observability platforms as well as experience with containerization using Kubernetes, message queuing systems like Kafka, and SQL/NoSQL databases. Programming experience is desired for the role. Job Duties: Collaborate with development and operations teams to integrate monitoring solutions into the software development lifecycle and operational processes. Define, propose, and drive efforts to continually improve monitoring, troubleshooting, and self-healing for our services. Design and implement redundancy, failover mechanisms, and load-balancing strategies to ensure system reliability. Conduct risk assessments and identify potential points of failure in the infrastructure and propose solutions to fix it. Respond to (on-call) and take actions to mitigate incidents and outages. Be on top of capacity requirements in a growing environment. Actively work with various teams’ codebases to extend observability and improve uptime. Represent the team in global incidents resolution, and participate in on-call rotation. To succeed in this role you must have experience in: Proven experience as an SRE or similar role of 6+ years. Problem-solving and troubleshooting skills. Linux in-depth knowledge. Knowledge of one of the programming languages (see Preferable technology stack). Experience with cloud platforms. Knowledge of one or more of the configuration management tools. Ability to work in a diverse multicultural environment, communicating with globally distributed teams. Team player with self-start ability and strong drive to dig deeply and solve problems. Fluent in spoken and written English. Preferable Technology Stack: OS: Linux (CentOS/RedHat/Oracle/Amazon Linux) Programming languages: Python, JavaScript, Go, Java Cloud: AWS, Azure, GCP Containerization: Kubernetes Distributed Log: Kafka, ELK stack Monitoring: Zabbix, Prometheus, Alertmanager, Grafana DBs: VictoriaMetrics, MongoDB, PostgreSQL, MySQL IaaC: Ansible, Terraform GitOps: ArgoCD CI: Gitlab CI, Jenkins VCS: GitLab HA: Nginx Proxy Desired Qualifications: B.S in Computer Engineering, Computer Science, or equivalent experience with 4+ years of related experience. Proven experience with influencing the software engineering of cloud/SaaS services. Familiarity with AI, LLM, and various related technologies. Deep understanding of the DevOps Lifecycle and application of it within organizations. What we offer: Comprehensive medical, dental, vision, disability, life insurance. Health Savings Account (HSA), Flexible Spending Account (FSAs) and Commuter benefits. 401K match and ESPP. Paid time off and paid sick leave. Wellness programs including 1:1 coaching and meditation guidance. Paid parental and pregnancy leave and new parent gift boxes. Family-forming benefits (IVF, Preservation, Adoption etc.). Emergency backup care (Child/Adult/Pets). Pet insurance and Pet Telehealth. Employee Assistance Program (EAP) with counseling sessions available 24/7. Free legal services that provide legal advice, document creation and estate planning. Employee bonus referral program. Student loan refinancing assistance. Employee perks and discounts program. RingCentral’s Engineering team works on high-complexity projects that set the standard for performance and reliability at massive scale. This is your chance to help imagine, develop and deliver products that raise the technological bar, and power human connections. If you’re a talented, ambitious, creative thinker, RingCentral is the perfect environment to join a world-class team and bring your ideas to life. RingCentral’s work culture is the backbone of our success. We are recognized as a Best Place to Work by Glassdoor, the Top Work Culture by Comparably and hold local BPTW awards in every major location. We are committed to hiring and retaining great people because we know you power our success. RingCentral offers on-site, remote and hybrid work options optimized for the ways we work and live now. About RingCentral RingCentral, Inc. (NYSE: RNG) is a leading provider of business cloud communications and contact center solutions based on its powerful Message Video Phone (MVP) global platform. More flexible and cost-effective than legacy on-premises PBX and video conferencing systems that it replaces, RingCentral empowers modern mobile and distributed workforces to communicate, collaborate, and connect via any mode, any device, and any location. RingCentral is headquartered in Belmont, California, and has offices around the world. RingCentral is an equal opportunity employer that truly values diversity. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We are committed to providing reasonable accommodations for individuals with disabilities during our application and interview process. If you require such accommodations, please click on the following link to learn more about how we can assist you. If you are hired in Colorado the compensation range for this position is between $107,100 and $153,000 for full-time employees, in addition to eligibility for variable pay, equity, and benefits. Benefits may include, but are not limited to, health and wellness, 401k, ESPP, vacation, parental leave, and more The salary may vary depending on your location, skills, and experience. #J-18808-Ljbffr



  • Denver, Colorado, United States Prove Full time

    About Prove Embracing the shift towards a mobile-first economy, businesses are seeking to modernize their approach to engaging and empowering consumers. Prove offers innovative solutions through phone-centric identity tokenization and cryptographic authentication. With a focus on reducing friction, enhancing security, and accelerating revenues, Prove serves...


  • Denver, United States Entrust Full time

    Position Overview: The IFI Cloud Service includes a wide array of components including web services, application servers, and databases. The Site Reliability Engineer (SRE) will be responsible for deploying and maintaining the IFIaaS applications in Hybrid Cloud environments. Ultimately, the candidate will be responsible for the functional management of all...


  • Denver, United States Entrust Full time

    Position Overview:The IFI Cloud Service includes a wide array of components including web services, application servers, and databases. The Site Reliability Engineer (SRE) will be responsible for deploying and maintaining the IFIaaS applications in Hybrid Cloud environments. Ultimately, the candidate will be responsible for the functional management of all...


  • Denver, United States Entrust Full time

    Position Overview:The IFI Cloud Service includes a wide array of components including web services, application servers, and databases. The Site Reliability Engineer (SRE) will be responsible for deploying and maintaining the IFIaaS applications in Hybrid Cloud environments. Ultimately, the candidate will be responsible for the functional management of all...


  • Denver, United States DAT Freight Solutions Full time

    About DAT DAT is an award-winning employer of choice and a next-generation SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 45 years. We continue to transform the industry year over year, by deploying a suite of software solutions to millions of customers every day - customers who depend on...


  • Denver, Colorado, United States RingCentral Full time

    Say hello to opportunities.It's not every day that you consider starting a new career. We're RingCentral, and we're happy that someone as talented as you is considering this role. First, a little about us, we're a $2 Billion annual revenue company with double digit Annual Recurring Revenue (ARR) and a $93 Billion market opportunity in UCaaS, Contact Center...


  • Denver, Colorado, United States S&P Global Full time

    About the Role:As a Site Reliability Engineer at S&P Global, you will be part of a dynamic team that works closely with the Business, RSO, Developers, and Product teams to enhance the stability of our Pega-based workflow applications. Although we do not develop code, we have the ability to analyze existing code, replicate issues in lower environments, and...


  • Denver, United States Dat Services Inc Full time

    About DAT DATis an award-winning employer of choice and a next-generation SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 45 years. We continue to transform the industry year over year, by deploying a suite of software solutions to millions of customers every day - customers who depend on...


  • Denver, Colorado, United States Cisco Full time

    About the RoleCisco is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key ResponsibilitiesDesign and implement robust cloud infrastructure using innovative principlesCollaborate with application development...


  • Denver, Colorado, United States RingCentral Full time

    About the RoleWe are seeking a highly skilled Cloud Site Reliability Engineer to join our team at RingCentral. As a key member of our Cloud Engineering team, you will be responsible for ensuring the high availability, scalability, and performance of our cloud-based services.Key ResponsibilitiesDesign and Implement High Availability Solutions: Collaborate...


  • Denver, Colorado, United States Vertafore Full time

    Position OverviewWe are seeking a dedicated and enthusiastic individual to join our team as a Junior Site Reliability Engineer. This role is crucial in ensuring the continuous availability and reliability of our software solutions.CompensationSalary range: $65,000 - $75,000 + Performance BonusCompany BackgroundVertafore is a prominent technology firm that is...


  • Denver, United States ALSTOM Full time

    Req ID: 464509 At Alstom, we understand transport networks and what moves people. From high-speed trains, metros, monorails, and trams, to turnkey systems, services, infrastructure, signalling and digital mobility, we offer our diverse customers the broadest portfolio in the industry. Every day, more than 80,000 colleagues lead the way to greener and...


  • Denver, United States Cisco Full time

    We Are Cisco We're so happy you're thinking of joining us. Follow us on social @WeAreCisco to learn more about what employees say about why we love where we work, or check Cisco out on Glassdoor for the latest reviews. What You'll Do Think back on the latest significant internet outages and how they reinvented everyday life – even a few hours can halt...


  • Denver, Colorado, United States Tipico - North America Full time

    Company DescriptionFounded in Europe in 2004, Tipico is now a licensed U.S. Sportsbook operating in New Jersey, Iowa, Ohio, and Colorado. Renowned in Germany and globally, Tipico offers online betting across 30 sports. Guided by values such as innovation and inclusion, Tipico focuses on creating top-notch mobile sports betting and casino products. Recently...

  • Reliability Engineer

    2 weeks ago


    Denver, United States Alstom Full time

    Req ID:464509    At Alstom, we understand transport networks and what moves people. From high-speed trains, metros, monorails, and trams, to turnkey systems, services, infrastructure, signalling and digital mobility, we offer our diverse customers the broadest portfolio in the industry. Every day, more than 80 000 colleagues lead the way to...


  • Denver, United States VIZIO Full time

    About the Team: We live and breathe big data. On a daily basis, we ingest and extract useful information from hundreds of live TV channels as well as collect, analyze and report on information from millions of TVs. Today, with over 23 million devices and operating at a massive scale leveraging modern architecture, design and technologies. As any organization...


  • Denver, Colorado, United States Fruition Full time

    About the RoleFruition, a leading software development company, is seeking an experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a crucial role in improving our Continuous Integration/Continuous Deployment (CI/CD) process, ensuring the smooth delivery of high-quality web solutions to our clients.Key ResponsibilitiesCI/CD...


  • Denver, Colorado, United States Salesforce Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Salesforce. As a Site Reliability Engineer, you will play a critical role in ensuring the health and performance of our cloud infrastructure.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud infrastructure to support our business...


  • Denver, Colorado, United States Xcel Energy Full time

    Are you looking for an exciting job where you can put your skills and talents to work at a company you can feel proud to be a part of? Do you want a workplace that will challenge you and offer you opportunities to learn and grow? A position at Xcel Energy could be just what you're looking for.This position is based on site in Denver, Colorado, Senior or...


  • Denver, United States S&P Global Full time

    About the Role: Grade Level (for internal use): 11 The Team: SRE team members work together with the Business, RSO, Developers and Product team members to enhance the stability of our Pega based workflow applications. Although we do not develop code, we have the ability to look at the existing code, replicate issues in lower environments, and provide...