Site Reliability Engineer
2 weeks ago
The Company
Cerbo is a high-growth healthcare SaaS company, doing our part in the medical market to support holistic lifestyles and personalized medicine. Our software – Cerbo EHR – is a cloud-based electronic health records (EHR) and patient portal software system. Healthcare offices across the country – and some around the world – use Cerbo for most everything they do in their day-to-day operations. Cerbo originally started as a developer’s nights-and-weekends project. And has grown into one of the leading EHR systems for functional or “root cause” medicine and membership- or cash-based clinics. Because of our unique origins, we often approach things a bit differently. That is, success for us is not just about the bottom line. It’s more about providing a great product, operating with integrity, and supporting our clients and our team. During the past four years our team has grown, and thousands of practitioners and patients use our product. To this end, we’re looking for a Site Reliability Engineer to join our growing team.
What You’ll Do
As the Site Reliability Engineer (SRE), you will play a pivotal role managing the future of our technology. You will work with our current SRE and engineering team to tune, optimize and enhance our Amazon Web Services Infrastructure. If you're passionate about building and maintaining highly available, scalable systems and thrive in a fast-paced environment, we'd love to hear from you
Primary Responsibilities
- Design, implement, and maintain scalable and reliable cloud infrastructure on AWS
- Manage and optimize Kubernetes clusters using Amazon EKS
- Develop and maintain Infrastructure as Code using Terraform
- Implement and improve CI/CD pipelines using GitHub Actions and ArgoCD
- Ensure system security and implement best practices
- Monitor and optimize system performance using Grafana and Prometheus
- Track our AWS spending and suggest ways to cut operating costs
- Troubleshoot and resolve complex issues in production environments
- Collaborate with development teams to improve application reliability and performance
- Participate in On Call rotation with other SREs and engineering team members
Required Skills
- Extensive experience with AWS services and best practices
- Proficiency in managing Kubernetes clusters, particularly Amazon EKS
- Strong knowledge of Helm for Kubernetes package management
- Extensive experience with Infrastructure as Code, specifically Terraform
- Familiarity with CI/CD pipelines, particularly GitHub Actions
- Advanced Linux administration skills
- Solid understanding of networking concepts and protocols
- Experience in implementing and maintaining security best practices
- Proficiency in using monitoring and observability tools, especially Grafana and Prometheus
- Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience)
- 3+ years of experience in a Site Reliability Engineering or similar role
- Strong problem-solving skills and attention to detail
- Excellent communication skills and ability to work in a team environment
- Certifications in AWS, Kubernetes, or other relevant technologies are a plus
Compensation & Benefits
- Competitive compensation based on experience
- Comprehensive health, dental and vision benefits
- 401(k) plan with matching company contribution
- Short-term disability & long-term disability insurance
- Paid Time Off and company holidays
- Full suite of remote working tools and processes
Location: 100% Remote
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
-
Site Reliability Engineer
2 weeks ago
Portland, United States Fiveonefour Full timeAbout Fiveonefour We believe that data is the key to unleashing human potential. We've seen firsthand how data helps bridge art and science to create delightful experiences, impactful insights, and seamless automation. We're an early-stage, venture-backed company on a mission to get data out of specialized silos and adopted across all facets of a company....
-
Site Reliability Engineer
2 months ago
portland, United States Matlen Silver Full timeCompensation: $70 - $75/HourHybrid: 2 Days Onsite Portland, OregonDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...
-
Site Reliability Engineer
2 months ago
portland, United States Matlen Silver Full timeCompensation: $70 - $75/HourHybrid: 2 Days Onsite Portland, OregonDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...
-
Site Reliability Engineer
2 months ago
Portland, United States Matlen Silver Full timeCompensation: $70 - $75/HourHybrid: 2 Days Onsite Portland, OregonDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...
-
Site Reliability Engineer
2 months ago
South Portland, United States FieldStack Full timeThe primary role of the Site Reliability Engineer is to design and oversee our cloud infrastructure which powers our unified commerce SaaS solution. This role works within a standard 9-5 schedule, however off hours work may be required when issues arise outside standard business hours. Essential Functions: Design, implement, and maintain Azure cloud...
-
Site Reliability Engineer
1 month ago
Portland, OR, United States Matlen Silver Full timeCompensation: $70 - $75/HourHybrid: 2 Days Onsite Portland, OregonDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...
-
Azure Site Reliability Engineer
2 months ago
South Portland, United States FieldStack Full timeThe primary role of the Site Reliability Engineer is to design and oversee our cloud infrastructure which powers our unified commerce SaaS solution. This role works within a standard 9-5 schedule, however off hours work may be required when issues arise outside standard business hours. Essential Functions: Design, implement, and maintain Azure cloud...
-
Azure Site Reliability Engineer
2 months ago
south portland, United States FieldStack Full timeThe primary role of the Site Reliability Engineer is to design and oversee our cloud infrastructure which powers our unified commerce SaaS solution. This role works within a standard 9-5 schedule, however off hours work may be required when issues arise outside standard business hours. Essential Functions: Design, implement, and maintain Azure cloud...
-
Azure Site Reliability Engineer
1 month ago
south portland, United States FieldStack Full timeThe primary role of the Site Reliability Engineer is to design and oversee our cloud infrastructure which powers our unified commerce SaaS solution. This role works within a standard 9-5 schedule, however off hours work may be required when issues arise outside standard business hours. Essential Functions: Design, implement, and maintain Azure cloud...
-
Azure Site Reliability Engineer
1 month ago
South Portland, ME, United States FieldStack Full timeThe primary role of the Site Reliability Engineer is to design and oversee our cloud infrastructure which powers our unified commerce SaaS solution. This role works within a standard 9-5 schedule, however off hours work may be required when issues arise outside standard business hours. Essential Functions: Design, implement, and maintain Azure cloud...
-
Corporate Reliability Engineer
2 weeks ago
Portland, United States CPS Full timeAn excellent opportunity for a Corporate Reliability Engineer has become available near Portland, OR. This leading manufacturer offers exceptional benefits and endless growth opportunities. Responsibilities for the Corporate Reliability Engineer include: Conducting reliability assessments and failure analyses to identify issues within systems and...
-
Reliability Engineering Manager
2 weeks ago
Portland, Oregon, United States EVRAZ North America Full timeAt EVRAZ North America, we are seeking an experienced Reliability Engineering Manager to join our team in Portland, Oregon. This role is responsible for identifying and managing asset reliability risks, through the application of predictive and preventative procedures that support the safety and efficiency of our plant.Key ResponsibilitiesProvide leadership...
-
Reliability Engineer Leader
1 week ago
Portland, Oregon, United States PacifiCorp Full timeAbout UsPacifiCorp is a leading energy company dedicated to delivering excellent customer service, environmental sustainability, and diversity, equity, and inclusion.We are seeking a talented Reliability Engineer Leader to join our team. The estimated annual salary for this position is $110,000 - $140,000, depending on experience.Job DescriptionAs a...
-
Power System Reliability Engineer
1 week ago
Portland, Oregon, United States PacifiCorp Full timePacifiCorp is seeking a skilled Power System Reliability Engineer to join our team.About the RoleThis is a challenging and rewarding opportunity for a highly motivated individual to contribute to the development and implementation of reliable power systems. The successful candidate will be responsible for ensuring the safe and efficient operation of our...
-
Industrial Reliability Expert
2 weeks ago
Portland, Oregon, United States EVRAZ North America Full timeWe are seeking a skilled Maintenance Manager to join our team at EVRAZ North America. As a key member of our maintenance department, you will be responsible for identifying and managing asset reliability risks through predictive and preventative procedures. This role requires strong problem-solving skills, the drive to achieve results, and an unwavering...
-
Corporate Reliability Engineer
5 days ago
Portland, OR, United States CPS Inc Full timeCPS, Inc. Corporate Reliability Engineer Portland, OR US Posted: 10/12/2024 Industry: Electrical & Mechanical Engineering Job Number: RM10.9.24 Remote Friendly: Remote Job Description An excellent opportunity for a Corporate Reliability Engineer has become available near Portland, OR. This leading manufacturer offers exceptional benefits and endless...
-
Site Development Engineering Manager
7 days ago
Portland, Oregon, United States Jobot Full timeAbout the Opportunity","Jobot is seeking an accomplished Civil Engineering Lead to oversee all aspects of site development projects, from conception to completion. This permanent position offers a unique opportunity to apply technical skills, creativity, and leadership abilities to shape engineering projects.","Key Responsibilities","","Lead and manage a...
-
Senior Engineering Standards Manager
1 week ago
Portland, Oregon, United States PacifiCorp Full timeJob OverviewPacifiCorp is seeking a highly experienced and strategic Senior Engineering Standards Manager to lead our substation standards team. The successful candidate will have a proven track record in developing and implementing technical solutions, engineering guidelines, and policies to drive reliability and efficiency of the electric grid.The Senior...
-
Civil Engineer
1 week ago
Portland, Oregon, United States VLMK Engineering + Design Full timeJob OverviewVLMK Engineering + Design is seeking a highly skilled Civil Engineer to lead our site development team in Portland. This is an excellent opportunity for a passionate professional to expand their career and experience.
-
Distributed Power Systems Engineer
2 weeks ago
Portland, Oregon, United States PacifiCorp Full timeJob DescriptionPacifiCorp seeks an experienced Distributed Power Systems Engineer to join our team. As a reliability expert, you will play a crucial role in ensuring the safe and efficient operation of our power transmission and distribution systems.ResponsibilitiesInvestigate System Outages: Conduct thorough analyses to identify root causes of system...