Senior Site Reliability Engineer

1 month ago


Remote, Oregon, United States DFIN Full time

Donnelley Financial Solutions (DFIN) is a leader in risk and compliance solutions, providing insightful technology, industry expertise and data insights to clients across the globe. We're here to help you make smarter decisions with insightful technology, industry expertise and data insights at every stage of your business and investment lifecycles. As markets fluctuate, regulations evolve and technology advances, we're there. And through it all, we deliver confidence with the right solutions in moments that matter.
Summary:
We are looking for technical team members at all levels who want to push themselves to deliver best in market SaaS solutions. We offer a challenging environment where you will have to grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise.
The Senior Site Reliability Engineer is responsible for ensuring our SaaS products are fast, stable and optimized for our customers. SRE's at DFIN take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.
You either have an infrastructure background with a programmatic, automated mindset or are someone that comes with a software engineering background with infrastructure experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can operate independently to deliver solutions.
Responsibilities:

  • Champion and implement a culture of SRE to maintain a high-quality platform infrastructure
  • Champion and implement application and infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs
  • Optimize application performance at scale
  • Automate everything including system operational runbooks
  • Define and support continuous integration and deployment pipelines (CI/CD) aligned to branching and quality assurance strategies
  • Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into work processes
  • Perform with broad independence and deliver on project milestones and tasks on schedule while communicating progress regularly
  • Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations
  • Learn continuously and apply lessons learned
  • Evangelize best practices, eliminate bottlenecks, and improve process

Qualifications:

  • 5+ years experience writing software in any modern software language such as C# .NET, Java
  • 5+ years experience creating automated deployments with tools such as Azure DevOps Pipelines, Ansible, Jenkins or other scripting languages to manage infrastructure, software build and deployment in a continuous integration (CI) / continuous delivery (CD) environment
  • 5+ years experience writing scripts in PowerShell or Python/Bash to automate system operations as runbooks for Windows and Linux environments.
  • 5+ years experience implementing production performance, availability, and scalability monitoring and alerting best practices using a tool such as New Relic, Dynatrace, DataDog or AppDynamics
  • 5+ years experience as a global admin of Azure including cloud cost management
  • 5+ years of experience supporting public client facing revenue generating systems
  • Strong DevOps focus and experience building and deploying Infrastructure as Code with Terraform or similar technology
  • Experiencing monitoring and preventing issues with databases and database queries (SQL, Cosmos) using tools like Solarwinds Database Performance Analyzer, Idera SQL Diagnostic Manager, or Redgate SQL Monitor
  • Experience planning, coordinating, developing and executing all stages of test scripts
  • Experience securing Windows or Linux systems in 24x7 production environment
  • Experience with containerization and managing Kubernetes clusters
  • Experience with common networking, firewall and load balancing protocols
  • BS in Computer Science or equivalent work experience.

It is the policy of Donnelley Financial Solutions to select, place and manage all its employees without discrimination based on race, color, national origin, gender, age, religion, actual or perceived disability, veteran's status, actual or perceived sexual orientation, genetic information or any other protected status.
If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access as a result of your disability. You can request a reasonable accommodation by sending an email to . #BI-Remote



  • Remote, Oregon, United States Hypori Inc. Full time

    Hypori Inc, a leading provider of SaaS cybersecurity solutions, is transforming secure mobility for federal and commercial customers, including the United States Army. Hypori's secure virtual workspace enables users to access critical data and apps from any mobile device without compromising user privacy. From commercial IP to national security level intel,...


  • Remote, Oregon, United States Business Wire Full time

    Business Wire, a Berkshire Hathaway company, is the global market leader in press release distribution and regulatory disclosure. We are on a mission to redefine how organizations connect with their audiences - and that's just the beginningOrganizations, large and small, depend on us to accurately publicize market-moving news and multimedia, and generate...


  • Remote, Oregon, United States Abarca Full time

    What you'll doIn a few words...Abarca is igniting a revolution in healthcare. We built our company on the belief that with smarter technology we are redefining pharmacy benefits, but this is just the beginning...Our Site Reliability Engineering team leverages software engineering and infrastructure operations to create highly reliable and scalable software...


  • Remote, Oregon, United States Katmai Full time

    ABOUT KATMAIKatmai is pioneering the future of virtual experiences and hybrid work. The platform brings people together inside an easy-to-navigate 3D environment, enabling natural communication & collaboration, spontaneous interactions, and a sense of place that's been missing from the digital world. The simplicity of the user experience means no headsets...


  • Remote, Oregon, United States Own Company Full time

    Own is the leading data platform trusted by thousands of organizations to protect and activate SaaS data to transform their businesses. Own empowers customers to ensure the availability, security and compliance of mission-critical data, while unlocking new ways to gain deeper insights faster. By partnering with some of the world's largest SaaS ecosystems...


  • Remote, Oregon, United States Xero Full time

    Xero is a beautiful, easy-to-use platform that helps small businesses and their accounting and bookkeeping advisors grow and thrive. At Xero, our purpose is to make life better for people in small business, their advisors, and communities around the world. This purpose sits at the centre of everything we do. We support our people to do the best work of their...


  • Remote, Oregon, United States Brooksource Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Brooksource. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesManage and Optimize Linux-Based Systems and Servers: Ensure high availability,...


  • Remote, Oregon, United States Neon Full time

    Neon is aiming to be the go-to platform if you need a serverless Postgres with additional features like branching and scaling, to name a couple. Currently we are serving 750k databases and we want to grow that number, along with delivering more features without compromising from reliability and scalability. This is where our SRE team comes into the...


  • Remote, Oregon, United States Henry Meds Full time

    About Henry Meds:Tens of millions of Americans are unable to manage their chronic conditions with commercial medications. Using specialized compounded formulas tailored to individual patient needs, Henry helps people who have been left behind by the commercial market, all while remaining easy, accessible, and affordable. Our customers get access to the care...


  • Remote, Oregon, United States Brooksource Full time

    Job DescriptionBrooksource is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key Responsibilities:Linux System Administration: Manage and optimize Linux-based systems and servers to...


  • Remote, Oregon, United States Comcast Advertising Full time

    FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...


  • Remote, Oregon, United States Tyk Full time

    DescriptionWho are Tyk, and what do we do?The Tyk API Management platform is helping to drive the connected world and power new products and services. We're changing the way that organisations connect any number of their systems and services. Whether internal, external, public or highly encrypted systems, Tyk helps businesses drive value across the retail,...


  • Remote, Oregon, United States Sparksoft Corporation Full time

    Join us at Sparksoft, where we're not just another tech company—we're a catalyst for change. Our mission isn't just to offer IT solutions; it's to revolutionize the way you work. Here, passion isn't just a buzzword; it's the fuel behind groundbreaking ideas and transformative technologies. We serve a wide range of government clients, delivering impact...

  • Senior Engineer

    4 weeks ago


    Remote, Oregon, United States Sephora Full time

    Senior EngineerReq: 253036Worksite Address: San Francisco, CA, US, Hybrid)Job Type: Full TimePosition Type: RegularDepartment: Technology Your role at Sephora:As a Senior Engineer, you will: Develop, build, and service systems software endpoints for data services in an Azure platform; analyze and translate business needs into data models to support...


  • Remote, Oregon, United States Symbotic Full time

    About the RoleThe Senior Site Quality Engineer will work closely with our customer sites and installation teams to ensure that quality standards are met in all aspects of installations. This role will act as a catalyst for continuous improvement in our products and systems.Key ResponsibilitiesInterface with suppliers, sub-tiers, engineering, program team,...


  • Remote, Oregon, United States Symbotic Full time

    About the RoleWe are seeking a highly skilled Senior Site Quality Engineer to join our team at Symbotic. As a key member of our Quality team, you will be responsible for ensuring the highest standards of quality, safety, and reliability in our products and processes.Key ResponsibilitiesProvide quality leadership at customer installation sites, focusing on...


  • Remote, Oregon, United States Dotdash Meredith Full time

    Remote- In-office Expectations: This position is fully remote with no in-office requirements, (might require coming into an office 1 or 2x a year)Dotdash Meredith is looking for a Senior Software Engineer 1 to join our Search and Recommendations team. As part of the Search and Recommendations team, you'll be working on widely used components that help users...

  • Senior DevOps Engineer

    2 months ago


    Remote, Oregon, United States SafeBase Full time

    SafeBase is the leading trust center platform designed for friction-free security reviews. With our enterprise-grade Trust Center Platform, we automate the security review process and transform how you communicate your trust posture, ditching outdated 'security through obscurity' in exchange for transparency that helps you build customer trust, gain valuable...


  • Remote, Oregon, United States Gremlin Full time

    Job Description: Today's complex, fast-paced systems have become a minefield of reliability risks—any of which could cause an outage that costs millions and destroys customer confidence. That's why high-availability teams use the Gremlin to find and fix ‌reliability risks before they become incidents.Gremlin Reliability Platform helps software teams...


  • Remote, Oregon, United States Dutchie Full time

    About DutchieFounded in 2017, Dutchie is a comprehensive technology platform powering dispensary operations, while providing consumers with safe and easy access to cannabis. Dutchie aims to further support the positive societal change the cannabis industry brings to the world through wellness benefits, social justice, and empowering local communities through...