Site Reliability Engineer

2 weeks ago


Portland, United States Cerbo Full time

 The Company

Cerbo is a high-growth healthcare SaaS company, doing our part in the medical market to support holistic lifestyles and personalized medicine. Our software – Cerbo EHR – is a cloud-based electronic health records (EHR) and patient portal software system. Healthcare offices across the country – and some around the world – use Cerbo for most everything they do in their day-to-day operations. Cerbo originally started as a developer’s nights-and-weekends project. And has grown into one of the leading EHR systems for functional or “root cause” medicine and membership- or cash-based clinics. Because of our unique origins, we often approach things a bit differently. That is, success for us is not just about the bottom line. It’s more about providing a great product, operating with integrity, and supporting our clients and our team. During the past four years our team has grown, and thousands of practitioners and patients use our product. To this end, we’re looking for a Site Reliability Engineer to join our growing team.

What You’ll Do

As the Site Reliability Engineer (SRE), you will play a pivotal role managing the future of our technology. You will work with our current SRE and engineering team to tune, optimize and enhance our Amazon Web Services Infrastructure. If you're passionate about building and maintaining highly available, scalable systems and thrive in a fast-paced environment, we'd love to hear from you

Primary Responsibilities

  • Design, implement, and maintain scalable and reliable cloud infrastructure on AWS
  • Manage and optimize Kubernetes clusters using Amazon EKS
  • Develop and maintain Infrastructure as Code using Terraform
  • Implement and improve CI/CD pipelines using GitHub Actions and ArgoCD
  • Ensure system security and implement best practices
  • Monitor and optimize system performance using Grafana and Prometheus
  • Track our AWS spending and suggest ways to cut operating costs
  • Troubleshoot and resolve complex issues in production environments
  • Collaborate with development teams to improve application reliability and performance
  • Participate in On Call rotation with other SREs and engineering team members

Required Skills

  • Extensive experience with AWS services and best practices
  • Proficiency in managing Kubernetes clusters, particularly Amazon EKS
  • Strong knowledge of Helm for Kubernetes package management
  • Extensive experience with Infrastructure as Code, specifically Terraform
  • Familiarity with CI/CD pipelines, particularly GitHub Actions
  • Advanced Linux administration skills
  • Solid understanding of networking concepts and protocols
  • Experience in implementing and maintaining security best practices
  • Proficiency in using monitoring and observability tools, especially Grafana and Prometheus
Qualifications
  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience)
  • 3+ years of experience in a Site Reliability Engineering or similar role
  • Strong problem-solving skills and attention to detail
  • Excellent communication skills and ability to work in a team environment
  • Certifications in AWS, Kubernetes, or other relevant technologies are a plus

Compensation & Benefits

  • Competitive compensation based on experience
  • Comprehensive health, dental and vision benefits
  • 401(k) plan with matching company contribution
  • Short-term disability & long-term disability insurance
  • Paid Time Off and company holidays 
  • Full suite of remote working tools and processes

 Location: 100% Remote

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.



  • Portland, United States Fiveonefour Full time

    About Fiveonefour We believe that data is the key to unleashing human potential. We've seen firsthand how data helps bridge art and science to create delightful experiences, impactful insights, and seamless automation. We're an early-stage, venture-backed company on a mission to get data out of specialized silos and adopted across all facets of a company....


  • portland, United States Matlen Silver Full time

    Compensation: $70 - $75/HourHybrid: 2 Days Onsite Portland, OregonDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...


  • portland, United States Matlen Silver Full time

    Compensation: $70 - $75/HourHybrid: 2 Days Onsite Portland, OregonDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...


  • Portland, United States Matlen Silver Full time

    Compensation: $70 - $75/HourHybrid: 2 Days Onsite Portland, OregonDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...


  • South Portland, United States FieldStack Full time

    The primary role of the Site Reliability Engineer is to design and oversee our cloud infrastructure which powers our unified commerce SaaS solution. This role works within a standard 9-5 schedule, however off hours work may be required when issues arise outside standard business hours. Essential Functions: Design, implement, and maintain Azure cloud...


  • Portland, OR, United States Matlen Silver Full time

    Compensation: $70 - $75/HourHybrid: 2 Days Onsite Portland, OregonDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...


  • South Portland, United States FieldStack Full time

    The primary role of the Site Reliability Engineer is to design and oversee our cloud infrastructure which powers our unified commerce SaaS solution. This role works within a standard 9-5 schedule, however off hours work may be required when issues arise outside standard business hours. Essential Functions: Design, implement, and maintain Azure cloud...


  • south portland, United States FieldStack Full time

    The primary role of the Site Reliability Engineer is to design and oversee our cloud infrastructure which powers our unified commerce SaaS solution. This role works within a standard 9-5 schedule, however off hours work may be required when issues arise outside standard business hours. Essential Functions: Design, implement, and maintain Azure cloud...


  • south portland, United States FieldStack Full time

    The primary role of the Site Reliability Engineer is to design and oversee our cloud infrastructure which powers our unified commerce SaaS solution. This role works within a standard 9-5 schedule, however off hours work may be required when issues arise outside standard business hours. Essential Functions: Design, implement, and maintain Azure cloud...


  • South Portland, ME, United States FieldStack Full time

    The primary role of the Site Reliability Engineer is to design and oversee our cloud infrastructure which powers our unified commerce SaaS solution. This role works within a standard 9-5 schedule, however off hours work may be required when issues arise outside standard business hours. Essential Functions: Design, implement, and maintain Azure cloud...


  • Portland, United States CPS Full time

    An excellent opportunity for a Corporate Reliability Engineer has become available near Portland, OR. This leading manufacturer offers exceptional benefits and endless growth opportunities. Responsibilities for the Corporate Reliability Engineer include: Conducting reliability assessments and failure analyses to identify issues within systems and...


  • Portland, Oregon, United States EVRAZ North America Full time

    At EVRAZ North America, we are seeking an experienced Reliability Engineering Manager to join our team in Portland, Oregon. This role is responsible for identifying and managing asset reliability risks, through the application of predictive and preventative procedures that support the safety and efficiency of our plant.Key ResponsibilitiesProvide leadership...


  • Portland, Oregon, United States PacifiCorp Full time

    About UsPacifiCorp is a leading energy company dedicated to delivering excellent customer service, environmental sustainability, and diversity, equity, and inclusion.We are seeking a talented Reliability Engineer Leader to join our team. The estimated annual salary for this position is $110,000 - $140,000, depending on experience.Job DescriptionAs a...


  • Portland, Oregon, United States PacifiCorp Full time

    PacifiCorp is seeking a skilled Power System Reliability Engineer to join our team.About the RoleThis is a challenging and rewarding opportunity for a highly motivated individual to contribute to the development and implementation of reliable power systems. The successful candidate will be responsible for ensuring the safe and efficient operation of our...


  • Portland, Oregon, United States EVRAZ North America Full time

    We are seeking a skilled Maintenance Manager to join our team at EVRAZ North America. As a key member of our maintenance department, you will be responsible for identifying and managing asset reliability risks through predictive and preventative procedures. This role requires strong problem-solving skills, the drive to achieve results, and an unwavering...


  • Portland, OR, United States CPS Inc Full time

    CPS, Inc. Corporate Reliability Engineer Portland, OR US Posted: 10/12/2024 Industry: Electrical & Mechanical Engineering Job Number: RM10.9.24 Remote Friendly: Remote Job Description An excellent opportunity for a Corporate Reliability Engineer has become available near Portland, OR. This leading manufacturer offers exceptional benefits and endless...


  • Portland, Oregon, United States Jobot Full time

    About the Opportunity","Jobot is seeking an accomplished Civil Engineering Lead to oversee all aspects of site development projects, from conception to completion. This permanent position offers a unique opportunity to apply technical skills, creativity, and leadership abilities to shape engineering projects.","Key Responsibilities","","Lead and manage a...


  • Portland, Oregon, United States PacifiCorp Full time

    Job OverviewPacifiCorp is seeking a highly experienced and strategic Senior Engineering Standards Manager to lead our substation standards team. The successful candidate will have a proven track record in developing and implementing technical solutions, engineering guidelines, and policies to drive reliability and efficiency of the electric grid.The Senior...

  • Civil Engineer

    1 week ago


    Portland, Oregon, United States VLMK Engineering + Design Full time

    Job OverviewVLMK Engineering + Design is seeking a highly skilled Civil Engineer to lead our site development team in Portland. This is an excellent opportunity for a passionate professional to expand their career and experience.


  • Portland, Oregon, United States PacifiCorp Full time

    Job DescriptionPacifiCorp seeks an experienced Distributed Power Systems Engineer to join our team. As a reliability expert, you will play a crucial role in ensuring the safe and efficient operation of our power transmission and distribution systems.ResponsibilitiesInvestigate System Outages: Conduct thorough analyses to identify root causes of system...