Site Reliability Engineer

1 week ago


Remote, Oregon, United States Brooksource Full time
Job Description

Brooksource is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.

Key Responsibilities:
  • Linux System Administration: Manage and optimize Linux-based systems and servers to ensure high availability, performance, and security of critical services.
  • Monitoring and Logging: Implement monitoring, alerting, and logging solutions to proactively identify and mitigate potential issues, and help the team stabilize monitoring and improve observability.
  • Cloud Monitoring: Enhance platform monitoring by utilizing tools such as Dynatrace and Splunk, specifically for applications running on Ruby on Rails.
  • Azure Cloud Services: Implement and manage Azure cloud monitoring to gain comprehensive visibility into infrastructure and application health, ensuring swift resolution of issues.
  • Infrastructure Design: Design, implement, and maintain highly available and scalable infrastructure solutions to support applications and services on the Azure cloud platform.
  • Release Management: Collaborate with software engineering teams to define and implement reliable deployment pipelines and release processes using GitHub and Azure Pipelines for CI/CD.
  • Automation: Develop automation scripts and tools using PowerShell and other languages to automate repetitive tasks and streamline operational workflows.
  • Disaster Recovery: Lead disaster recovery planning and testing efforts to ensure business continuity and minimize downtime in case of system failures or disasters.
  • Capacity Planning: Perform capacity planning and resource optimization to ensure optimal performance and cost-effectiveness of infrastructure.
  • Incident Response: Participate in incident response and resolution, including root cause analysis and post-incident reviews.
Requirements:
  • Education: Bachelor's degree in Computer Science, Engineering, or a related field.
  • Experience: 5+ years of experience in site reliability engineering, DevOps, or a similar role.
  • Linux Experience: Extensive experience in Linux environment.
  • Application Monitoring: Extensive application monitoring experience using platforms such as Azure Monitor, Dynatrace, or Splunk.
  • Scripting and Programming: Proficiency in scripting and programming languages such as PowerShell, Ruby on Rails, Python, Bash, or Go.
  • Azure Cloud Services: Hands-on experience with Azure cloud services and technologies.
  • CI/CD Tools: Experience with GitHub and Azure Pipelines for CI/CD.
  • Containerization: Strong understanding of containerization technologies and orchestration frameworks like Kubernetes.
  • Configuration Management: Experience with configuration management tools such as Terraform, Ansible, or Puppet.
  • Release Automation: Familiarity with release automation tools like release-please.
  • Problem-Solving: Excellent problem-solving skills and the ability to troubleshoot complex issues in distributed systems.
  • Communication: Strong communication and collaboration skills, with the ability to work effectively in a cross-functional team environment.


  • Remote, Oregon, United States Abarca Full time

    What you'll doIn a few words...Abarca is igniting a revolution in healthcare. We built our company on the belief that with smarter technology we are redefining pharmacy benefits, but this is just the beginning...Our Site Reliability Engineering team leverages software engineering and infrastructure operations to create highly reliable and scalable software...


  • Remote, Oregon, United States Hypori Inc. Full time

    Hypori Inc, a leading provider of SaaS cybersecurity solutions, is transforming secure mobility for federal and commercial customers, including the United States Army. Hypori's secure virtual workspace enables users to access critical data and apps from any mobile device without compromising user privacy. From commercial IP to national security level intel,...


  • Remote, Oregon, United States Xero Full time

    Xero is a beautiful, easy-to-use platform that helps small businesses and their accounting and bookkeeping advisors grow and thrive. At Xero, our purpose is to make life better for people in small business, their advisors, and communities around the world. This purpose sits at the centre of everything we do. We support our people to do the best work of their...


  • Remote, Oregon, United States Brooksource Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Brooksource. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesManage and Optimize Linux-Based Systems and Servers: Ensure high availability,...


  • Remote, Oregon, United States Neon Full time

    Neon is aiming to be the go-to platform if you need a serverless Postgres with additional features like branching and scaling, to name a couple. Currently we are serving 750k databases and we want to grow that number, along with delivering more features without compromising from reliability and scalability. This is where our SRE team comes into the...


  • Remote, Oregon, United States Henry Meds Full time

    About Henry Meds:Tens of millions of Americans are unable to manage their chronic conditions with commercial medications. Using specialized compounded formulas tailored to individual patient needs, Henry helps people who have been left behind by the commercial market, all while remaining easy, accessible, and affordable. Our customers get access to the care...


  • Remote, Oregon, United States Business Wire Full time

    Business Wire, a Berkshire Hathaway company, is the global market leader in press release distribution and regulatory disclosure. We are on a mission to redefine how organizations connect with their audiences - and that's just the beginningOrganizations, large and small, depend on us to accurately publicize market-moving news and multimedia, and generate...


  • Remote, Oregon, United States Comcast Advertising Full time

    FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...


  • Remote, Oregon, United States DFIN Full time

    Donnelley Financial Solutions (DFIN) is a leader in risk and compliance solutions, providing insightful technology, industry expertise and data insights to clients across the globe. We're here to help you make smarter decisions with insightful technology, industry expertise and data insights at every stage of your business and investment lifecycles. As...


  • Remote, Oregon, United States Sparksoft Corporation Full time

    Join us at Sparksoft, where we're not just another tech company—we're a catalyst for change. Our mission isn't just to offer IT solutions; it's to revolutionize the way you work. Here, passion isn't just a buzzword; it's the fuel behind groundbreaking ideas and transformative technologies. We serve a wide range of government clients, delivering impact...


  • Remote, Oregon, United States Tyk Full time

    DescriptionWho are Tyk, and what do we do?The Tyk API Management platform is helping to drive the connected world and power new products and services. We're changing the way that organisations connect any number of their systems and services. Whether internal, external, public or highly encrypted systems, Tyk helps businesses drive value across the retail,...


  • Remote, Oregon, United States Own Company Full time

    Own is the leading data platform trusted by thousands of organizations to protect and activate SaaS data to transform their businesses. Own empowers customers to ensure the availability, security and compliance of mission-critical data, while unlocking new ways to gain deeper insights faster. By partnering with some of the world's largest SaaS ecosystems...


  • Remote, Oregon, United States Katmai Full time

    ABOUT KATMAIKatmai is pioneering the future of virtual experiences and hybrid work. The platform brings people together inside an easy-to-navigate 3D environment, enabling natural communication & collaboration, spontaneous interactions, and a sense of place that's been missing from the digital world. The simplicity of the user experience means no headsets...


  • Remote, Oregon, United States Dutchie Full time

    About DutchieFounded in 2017, Dutchie is a comprehensive technology platform powering dispensary operations, while providing consumers with safe and easy access to cannabis. Dutchie aims to further support the positive societal change the cannabis industry brings to the world through wellness benefits, social justice, and empowering local communities through...


  • Remote, Oregon, United States Symbotic Full time

    Company OverviewSymbotic is at the forefront of transforming supply chain logistics through its advanced A.I.-driven robotic technology platform. Our intelligent software coordinates sophisticated robots within a high-density, comprehensive system, revolutionizing warehouse automation to enhance efficiency, speed, and adaptability.Position SummaryThe Lead...


  • Remote, Oregon, United States Symbotic Full time

    About the Role:The Site Installation Manager will lead the installation of Symbotic's automation systems on customer sites, ensuring timely, within-budget, and defect-free delivery of the system to the operations team.Key Responsibilities:Manage and lead site-specific Requests for Information (RFIs) and respond to contractor and vendor RFIs in accordance...


  • Remote, Oregon, United States Symbotic Full time

    About the RoleWe are seeking a highly skilled Site Installation Manager to join our Implementation organization within Symbotic. This individual will be responsible for leading the installation of our automated equipment on customer sites, ensuring timely completion, within budget, and delivered without defect.Key ResponsibilitiesManage and lead...


  • Remote, Oregon, United States Symbotic Full time

    About the RoleWe are seeking a highly skilled Senior Site Quality Engineer to join our team at Symbotic. As a key member of our Quality team, you will be responsible for ensuring the highest standards of quality, safety, and reliability in our products and processes.Key ResponsibilitiesProvide quality leadership at customer installation sites, focusing on...

  • Cloud Engineer

    3 days ago


    Remote, Oregon, United States Brooksource Full time

    Job DescriptionBrooksource is seeking a highly skilled and experienced Senior Site Reliability Engineer to join our cloud environment team.The ideal candidate will have a strong background in AWS and be well-versed in cloud-native principles. This role requires participation in an on-call rotation, with the assurance that it won't drastically impact personal...


  • Remote, Oregon, United States Symbotic Full time

    About the RoleThe Senior Site Quality Engineer will work closely with our customer sites and installation teams to ensure that quality standards are met in all aspects of installations. This role will act as a catalyst for continuous improvement in our products and systems.Key ResponsibilitiesInterface with suppliers, sub-tiers, engineering, program team,...