Site Reliability Engineer

2 months ago


Remote, Oregon, United States Abarca Full time

What you'll do

In a few words...

Abarca is igniting a revolution in healthcare. We built our company on the belief that with smarter technology we are redefining pharmacy benefits, but this is just the beginning...

Our Site Reliability Engineering team leverages software engineering and infrastructure operations to create highly reliable and scalable software systems. The team is responsible for ensuring that Abarca's infrastructure operates efficiently by assisting with the design, build, and maintenance of software systems that automate and optimize the deployment, monitoring, and performance of Abarca's systems. By focusing on improving the reliability and availability of software systems through engineering best practices and tools, we manage complex distributed systems to meet our external Service Level Agreements and internal Operating Level Agreements.

As our Site Reliability Engineer, you will be responsible for collaborating on the design, build, and maintenance of reliable and scalable infrastructure and software systems. This will be accomplished by tracking error budgets against service level agreements in order to meet and maintain compliance. You will also be collaborating with our Infrastructure, Software Engineering and Security teams to identify and implement reliability and performance improvements across our systems.

The fundamentals for the job...

  • Manage error budgets while ensuring that service level agreements are being met while keeping our stakeholders satisfied and reducing penalties associated with performance issues.
  • Monitor systems for potential performance and reliability issues, proactively taking measures to prevent their occurrence and minimize service disruption.
  • Promptly troubleshoot and resolve production issues while also identifying opportunities for improvement in terms of reliability, to ensure timely resolution and mitigate future occurrences.
  • Collaborate with Software Development, among other teams, continuously improving systems and processes to increase efficiency, minimize downtime, and optimize overall system reliability.
  • Develop and maintain automation tools to improve system observability, reliability, and performance.
  • Design and implement disaster recovery plans to ensure business continuity.

What we expect of you

The bold requirements...

  • Bachelor's or Master's Degree in Information Technology, Computer Science or a related field. (In lieu of a degree equivalent experience may be considered).
  • 3+ years of experience as a site reliability engineer or within related areas.
  • Experience managing error budgets as well as service level agreements.
  • Experience programming with, but not limited to: .Net, C#, JavaScript, PyScript, T-SQL/SQL.
  • Experience with containerization technologies (e.g. Docker and Kubernetes).
  • Experience with cloud infrastructure platforms (e.g. AWS, Azure, or GCP).
  • Experience with monitoring and alerting tools (e.g. DataDog, AppDynamics, Dynatrace, Prometheus, SolarWinds, Grafana, or Nagios)
  • Participate in on-call rotation to provide 24/7 support for critical systems. Availability to work rotating or irregular shifts, including weekends and certain holidays, per business or operational needs.
  • Some travel required to Puerto Rico location 15-20%.
  • Excellent oral and written communication skills.
  • We are proud to offer a flexible hybrid work model which will require certain on-site work days (Puerto Rico Location Only)

Nice to haves...

  • Experience with automation tools (e.g. Ansible, PowerShell scripting).
  • Certified SRE Foundation (SREF).

Physical requirements...

  • Must be able to access and navigate each department at the organization's facilities.
  • Sedentary work that primarily involves sitting/standing.

At Abarca we value and celebrate diversity. Diversity, equity, inclusion, and belonging are guiding principles of Abarca and ensure Abarca's workforce reflects the communities it serves. We are proud to provide equal employment opportunities to all employees and applicants for employment and prohibit discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, medical condition, genetic information, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws.

Abarca Health LLC is an equal employment opportunity employer and participates in E-Verify. "Applicant must be a United States' citizen. Abarca Health LLC does not sponsor employment visas at this time"

The above description is not intended to limit the scope of the job or to exclude other duties not mentioned. It is not a final set of specifications for the position. It's simply meant to give readers an idea of what the role entails.

#LI-MH1 #LI-REMOTE



  • Remote, Oregon, United States Hypori Inc. Full time

    Hypori Inc, a leading provider of SaaS cybersecurity solutions, is transforming secure mobility for federal and commercial customers, including the United States Army. Hypori's secure virtual workspace enables users to access critical data and apps from any mobile device without compromising user privacy. From commercial IP to national security level intel,...


  • Remote, Oregon, United States Xero Full time

    Xero is a beautiful, easy-to-use platform that helps small businesses and their accounting and bookkeeping advisors grow and thrive. At Xero, our purpose is to make life better for people in small business, their advisors, and communities around the world. This purpose sits at the centre of everything we do. We support our people to do the best work of their...


  • Remote, Oregon, United States Brooksource Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Brooksource. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesManage and Optimize Linux-Based Systems and Servers: Ensure high availability,...


  • Remote, Oregon, United States Neon Full time

    Neon is aiming to be the go-to platform if you need a serverless Postgres with additional features like branching and scaling, to name a couple. Currently we are serving 750k databases and we want to grow that number, along with delivering more features without compromising from reliability and scalability. This is where our SRE team comes into the...


  • Remote, Oregon, United States Henry Meds Full time

    About Henry Meds:Tens of millions of Americans are unable to manage their chronic conditions with commercial medications. Using specialized compounded formulas tailored to individual patient needs, Henry helps people who have been left behind by the commercial market, all while remaining easy, accessible, and affordable. Our customers get access to the care...


  • Remote, Oregon, United States Business Wire Full time

    Business Wire, a Berkshire Hathaway company, is the global market leader in press release distribution and regulatory disclosure. We are on a mission to redefine how organizations connect with their audiences - and that's just the beginningOrganizations, large and small, depend on us to accurately publicize market-moving news and multimedia, and generate...


  • Remote, Oregon, United States Brooksource Full time

    Job DescriptionBrooksource is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key Responsibilities:Linux System Administration: Manage and optimize Linux-based systems and servers to...


  • Remote, Oregon, United States Comcast Advertising Full time

    FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...


  • Remote, Oregon, United States DFIN Full time

    Donnelley Financial Solutions (DFIN) is a leader in risk and compliance solutions, providing insightful technology, industry expertise and data insights to clients across the globe. We're here to help you make smarter decisions with insightful technology, industry expertise and data insights at every stage of your business and investment lifecycles. As...


  • Remote, Oregon, United States Sparksoft Corporation Full time

    Join us at Sparksoft, where we're not just another tech company—we're a catalyst for change. Our mission isn't just to offer IT solutions; it's to revolutionize the way you work. Here, passion isn't just a buzzword; it's the fuel behind groundbreaking ideas and transformative technologies. We serve a wide range of government clients, delivering impact...


  • Remote, Oregon, United States Tyk Full time

    DescriptionWho are Tyk, and what do we do?The Tyk API Management platform is helping to drive the connected world and power new products and services. We're changing the way that organisations connect any number of their systems and services. Whether internal, external, public or highly encrypted systems, Tyk helps businesses drive value across the retail,...


  • Remote, Oregon, United States Own Company Full time

    Own is the leading data platform trusted by thousands of organizations to protect and activate SaaS data to transform their businesses. Own empowers customers to ensure the availability, security and compliance of mission-critical data, while unlocking new ways to gain deeper insights faster. By partnering with some of the world's largest SaaS ecosystems...


  • Remote, Oregon, United States Katmai Full time

    ABOUT KATMAIKatmai is pioneering the future of virtual experiences and hybrid work. The platform brings people together inside an easy-to-navigate 3D environment, enabling natural communication & collaboration, spontaneous interactions, and a sense of place that's been missing from the digital world. The simplicity of the user experience means no headsets...


  • Remote, Oregon, United States Dutchie Full time

    About DutchieFounded in 2017, Dutchie is a comprehensive technology platform powering dispensary operations, while providing consumers with safe and easy access to cannabis. Dutchie aims to further support the positive societal change the cannabis industry brings to the world through wellness benefits, social justice, and empowering local communities through...


  • Remote, Oregon, United States Symbotic Full time

    Company OverviewSymbotic is at the forefront of transforming supply chain logistics through its advanced A.I.-driven robotic technology platform. Our intelligent software coordinates sophisticated robots within a high-density, comprehensive system, revolutionizing warehouse automation to enhance efficiency, speed, and adaptability.Position SummaryThe Lead...


  • Remote, Oregon, United States Symbotic Full time

    About the Role:The Site Installation Manager will lead the installation of Symbotic's automation systems on customer sites, ensuring timely, within-budget, and defect-free delivery of the system to the operations team.Key Responsibilities:Manage and lead site-specific Requests for Information (RFIs) and respond to contractor and vendor RFIs in accordance...


  • Remote, Oregon, United States Symbotic Full time

    About the RoleWe are seeking a highly skilled Site Installation Manager to join our Implementation organization within Symbotic. This individual will be responsible for leading the installation of our automated equipment on customer sites, ensuring timely completion, within budget, and delivered without defect.Key ResponsibilitiesManage and lead...


  • Remote, Oregon, United States Symbotic Full time

    About the RoleWe are seeking a highly skilled Senior Site Quality Engineer to join our team at Symbotic. As a key member of our Quality team, you will be responsible for ensuring the highest standards of quality, safety, and reliability in our products and processes.Key ResponsibilitiesProvide quality leadership at customer installation sites, focusing on...

  • Cloud Engineer

    3 days ago


    Remote, Oregon, United States Brooksource Full time

    Job DescriptionBrooksource is seeking a highly skilled and experienced Senior Site Reliability Engineer to join our cloud environment team.The ideal candidate will have a strong background in AWS and be well-versed in cloud-native principles. This role requires participation in an on-call rotation, with the assurance that it won't drastically impact personal...


  • Remote, Oregon, United States Symbotic Full time

    About the RoleThe Senior Site Quality Engineer will work closely with our customer sites and installation teams to ensure that quality standards are met in all aspects of installations. This role will act as a catalyst for continuous improvement in our products and systems.Key ResponsibilitiesInterface with suppliers, sub-tiers, engineering, program team,...