Senior Site Reliability Engineer

2 days ago

Remote, Oregon, United States D-Wave Full time $124,545 per year

D-Wave (NYSE: QBTS), D-Wave is a leader in the development and delivery of quantum computing systems, software, and services. We are the world's first commercial supplier of quantum computers, and the only company building both annealing and gate-model quantum computers. Our mission is to help customers realize the value of quantum, today. Our quantum computers — the world's largest — feature QPUs with sub-second response times and can be deployed on-premises or accessed through our quantum cloud service, which offers 99.9% availability and uptime. More than 100 organizations trust D-Wave with their toughest computational challenges. With over 200 million problems submitted to our quantum systems to date, our customers apply our technology to address use cases spanning optimization, artificial intelligence, research and more. Learn more about realizing the value of quantum computing today and how we're shaping the quantum-driven industrial and societal advancements of tomorrow:

You can read more about our company and our innovations in the pages of The Wall Street Journal, Time Magazine, Fast Company, MIT Technology Review, Forbes, Inc. Magazine, Wired and across many whitepapers.

At D-Wave, we're helping customers realize the value of quantum computing today and are shaping the quantum-driven industrial and societal advancements of tomorrow.

About the role

We are seeking a talented and experienced Senior Site Reliability Engineer (SRE) to join our DevOps team. As a key member of the team, you will be responsible for the reliability of our SaaS product, our research laboratory, and the infrastructure supporting our production quantum computers worldwide. You will play a critical role in ensuring the reliability, scalability, and performance of our company's systems and infrastructure. The ideal candidate will have a strong background in systems administration, automation and troubleshooting complex distributed systems.

What you'll do

Refine, refactor, and evolve monitoring systems and related tools covering our workloads in AWS, GCP, on-premises, and remote field systems across the world
Work with teams including software and hardware engineering, processor development, cryogenics, and customer support to elicit requirements, collect and store metrics, analyze trends, and provide dashboards and other tooling to enable observability across the organization
Own the alerting with other SREs to support infrastructure and on-call management systems and ensure alerting is reliable and scalable
Work closely with the DevOps on and Test Engineering teams to enable instrumenting builds and deploys to ensure reliability through every step of the software development lifecycle

About You

4+ years of experience operating and troubleshooting SaaS/PaaS applications and environments on a major cloud platform – AWS and GCP preferred – including platform-specific monitoring technologies like Cloudwatch and Stackdriver
4+ years of experience with high level SRE work including incident management, process design, managing on-call rotations (with PagerDuty), and cross-training new and existing employees
Experience with on-premises compute, including servers, storage, power, virtualization, and networking equipment, including specifically using SNMP to monitor networked devices
4+ years of experience with AOS/Elasticsearch/Loki or similar log management tools
Experience with time series databases like Prometheus/InfluxDB, document stores like MongoDB, and classic relational databases like PostgreSQL, AWS Redshift, etc.
Proficiency in InfluxQL and PromQL
Significant expertise supporting and integrating analytics and monitoring systems such as ELK, Grafana, Prometheus, Zabbix, LibreNMS, Intermapper, etc.
At least two years of programming experience in Python, Go, Bash, Ruby, or equivalent
Degree in Computing Science, Engineering or equivalent education and experience
Excellent oral and written communication skills – you like to document your work

Bonus Points

3+ years specific experience with Elasticsearch / AWS OpenSearch, Fluent, Grafana Cloud
Experience with Kubernetes monitoring
Experience with producing synthetic metrics and instrumenting existing applications and platforms to extract metrics for analysis
Experience with OpenTelemetry
Proven record of cross-training and evangelizing observability as a critical aspect of all systems

A D-Waver's DNA

We look at the future and say "why not"; we see possibilities where others see problems or routines. We show the way ahead and are committed to achieving ambitious goals.
We practice straight talk and listen generously to each other with empathy. We value different opinions and points of views. We ensure that we connect outside as well as inside to learn from others and inspire each other.
We hold ourselves accountable for delivering results. We make decisions & take responsibility so that we can act & support each other.
As leaders we motivate & engage our teams to undertake beyond what they originally thought possible, by developing our teams & creating the conditions for people to grow and empower themselves through enabling & coaching.

Our Compensation Philosophy is Simple but Powerful:

We believe providing D-Wavers with company ownership, competitive pay, and a range of meaningful benefits is the start of creating a culture where people want to give the best they've got — not because they're simply making money, but because they've fallen in love with our vision, mission, values, and team.

During the interview process, your Recruiter will review our total rewards (base, equity, bonus, perks, benefit, culture) offerings. The final offer is determined by your proficiencies within this level.

Inclusion:

We celebrate diverse perspectives to drive innovation in our pursuit. Our employees range from distinguished domain experts with decades of experience in their respective fields, to bright and motivated graduates eager to make their mark. Our diverse and innovative team will make you feel appreciated, supported and empower your career growth at D-Wave.

The Fine Print:

No 3rd party candidates will be accepted

It is D-Wave Systems Inc. policy to provide equal employment opportunity (EEO) to all persons regardless of race, color, religion, sex, national origin, age, sexual orientation, gender identity, genetic information, physical or mental disability, protected veteran status, or any other characteristic protected by federal, state/provincial, local law.

The base pay range for this role is:

124, ,545 USD (Remote, United States)

124, ,545 CAD (Remote, Canada)

Senior Site Reliability Engineer II

19 hours ago

Remote, Oregon, United States Shutterfly Full time $106,000 - $151,000 per year

At Shutterfly, we make life's experiences unforgettable. We believe there is extraordinary power in the self-expression. That's why our family of brands helps customers create products and capture moments that reflect who they uniquely are.Shutterfly is looking for a Senior Site Reliability Engineer to join our team. Shutterfly is undergoing a comprehensive...
Senior Site Reliability Engineer

2 weeks ago

Remote, Oregon, United States Maxihost Full time $120,000 - $180,000 per year

About 's global computing platform was launched in 2019, enabling businesses to programmatically deploy single-tenant Bare Metal instances in different parts of the world. We are a team of passionate individuals about hardware, software, and network infrastructure looking to build the fastest, easiest-to-use, developer-centric single-tenant Cloud...
Senior Site Reliability Engineer

2 weeks ago

Remote, Oregon, United States Jellyvision Full time $145,000 - $175,000 per year

Senior Site Reliability EngineerWho we areJellyvision ALEX, is on a mission to improve lives by helping people choose and use their benefits. We are raising the bar—for benefits and the employee experience (for our employees and those of the customers we serve) – by scaling personalization, compassion and an earnest intent to be helpful in all that we...
Site Reliability Engineer

2 weeks ago

Remote, Oregon, United States 2Prod Technologies Corp. Full time $145,000 - $210,000 per year

About 2Prod2Prod Technologies Corp. supports the federal government in delivering secure, scalable cloud solutions that advance critical national missions.Position Summary2Prod Technologies Corp. is seeking a Site Reliability Engineer (SRE) with strong GitLab expertise to support and enhance enterprise platforms. This role will focus primarily on GitLab...
Staff Software Engineer, Site Reliability

2 weeks ago

Remote, Oregon, United States BABYLIST Full time $199,200 - $239,040 per year

Who We AreBabylist is the leading registry, e-commerce, and content platform for growing families. More than 9 million people shop with Babylist every year, making it the go-to destination for seamless purchasing, trusted guidance, and expert product recommendations for new parents and the people who love them. What began as a universal registry has grown...
Principal Site Reliability Engineer

2 weeks ago

Remote, Oregon, United States Blue River Technology Full time $166,000 - $293,000 per year

We're Blue River, a team of innovators driven to create intelligent machinery that solves monumental problems for our customers. We empower our customers – farmers, construction crews, and foresters - to implement safer and more sustainable solutions, driving increased profitability with less reliance on scarce labor. We believe that focusing on the small...
Reliability Engineer

21 hours ago

Remote, Oregon, United States Prolim global corporation Full time $98,000 - $118,304 per year

Reliability Engineer (Steel Manufacturing) – Remote / Lewisville, OHLocation: Lewisville, Ohio, USA (Remote option available)Experience: 7–10 yearsAbout the RoleWe are seeking an experienced Reliability Engineer with a strong background in Steel Manufacturing to join our team. The ideal candidate will lead reliability initiatives, perform risk-based...
Senior DevOps Engineer

10 hours ago

Remote, Oregon, United States South Geeks Full time $90,000 - $120,000 per year

Hi there :)Thanks for checking in to find out about our open position. We´ll provide as much information as possible, but please feel free to reach us if you have further questions. We´ll be happy to see your application, even if there are skills you don't quite masterAbout UsAt South Geeks, we power the evolution of technology by connecting elite LATAM...
Senior Software Engineer

2 weeks ago

Remote, Oregon, United States SentinelOne Full time $150,000 - $250,000 per year

What Are We Looking For?SentinelOne is seeking a Senior Software Engineer to join the Observo AI team, our cutting-edge AI-driven data pipeline optimization platform. This role will be responsible for designing, developing, and scaling high-performance systems that process massive volumes of telemetry data while reducing costs and improving insights for...
Senior Site Project Manager

3 days ago

Remote, Oregon, United States FORTNA Full time $115,900 - $173,800

FORTNA partners with the world's leading brands to transform omnichannel and parcel distribution operations. Known world-wide for enabling companies to keep pace with digital disruption and growth objectives, we design and deliver solutions, powered by intelligent software, to optimize fast, accurate and cost-effective order fulfillment and last mile...

Americas

Europe

Asia / Oceania

Africa

Senior Site Reliability Engineer