Senior Site Reliability Engineer

2 weeks ago


Chicago, United States DASH2 Full time

The Senior/Principal Site Reliability Engineer is responsible for ensuring our SaaS products are fast, stable and optimized for our customers. SRE’s here take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.


You either have an infrastructure background with a programmatic, automated mindset or are someone that comes with a software engineering background with infrastructure experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can operate independently to deliver solutions.


Responsibilities

  • Champion and implement a culture of SRE to maintain a high-quality platform infrastructure
  • Champion and implement application and infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs
  • Optimize application performance at scale
  • Automate everything including system operational runbooks
  • Define and support continuous integration and deployment pipelines (CI/CD) aligned to branching and quality assurance strategies
  • Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into work processes
  • Perform with broad independence and deliver on project milestones and tasks on schedule while communicating progress regularly
  • Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations
  • Learn continuously and apply lessons learned
  • Evangelize best practices, eliminate bottlenecks, and improve process


Qualifications

  • BS in Computer Science or equivalent work experience.
  • 10+ years demonstrating hands-on technical leadership and business impact in combining software skills with systems to solve complex automation and reliability challenges
  • 5+ years working with various cloud providers, containerization technologies, automated deployment frameworks, orchestration frameworks, monitoring, logging, alerting, system internals, networking, databases, distributed systems, and service-oriented architecture
  • 5+ years of experience supporting public client facing revenue generating systems
  • 3+ years of experience writing software in any modern software language such as C#.NET, Java, Javascript, Node.js, React.
  • 3+ years of experience creating automation with tools such as Azure DevOps, Ansible, Terraform, PowerShell, Python/Bash to build and deploy in a continuous integration (CI) environment and to manage infrastructure as code
  • You have proven track record to implement load, stress, performance and reliability testing standards at scale to improve service, platform and infrastructure resiliency
  • You are experienced in leading efforts in securing systems in 24x7 production environments


  • Chicago, United States Dell Full time

    Senior Engineer Site ReliabilityDell Technologies customers rely on our products and services to drive progress. So, we take the service we provide extremely seriously. Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives. As trusted advisors, we build in-depth knowledge of what...


  • Chicago, United States Dell Full time

    Senior Engineer Site ReliabilityDell Technologies customers rely on our products and services to drive progress. So, we take the service we provide extremely seriously. Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives. As trusted advisors, we build in-depth knowledge of what...


  • Chicago, Illinois, United States Balyasny Asset Management Full time

    We are looking for a Senior Site Reliability Engineer who can cultivate our SRE philosophy, processes, and technologies from the ground up.As a Senior Site Reliability Engineer within the Platform group, you will lay the groundwork for our SRE infrastructure. Your role will entail driving standards and fostering adoption across our technology teams, whilst...


  • Chicago, United States Balyasny Asset Management Full time

    We are looking for a Senior Site Reliability Engineer who can cultivate our SRE philosophy, processes, and technologies from the ground up. As a Senior Site Reliability Engineer within the Platform group, you will lay the groundwork for our SRE infrastructure. Your role will entail driving standards and fostering adoption across our technology teams, whilst...


  • Chicago, United States Balyasny Asset Management Full time

    We are looking for a Senior Site Reliability Engineer who can cultivate our SRE philosophy, processes, and technologies from the ground up. As a Senior Site Reliability Engineer within the Platform group, you will lay the groundwork for our SRE infrastructure. Your role will entail driving standards and fostering adoption across our technology teams, whilst...


  • Chicago, United States DASH2 Full time

    The Senior/Principal Site Reliability Engineer is responsible for ensuring our SaaS products are fast, stable and optimized for our customers. SRE's here take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.You either have an infrastructure background with a programmatic, automated mindset...


  • Chicago, Illinois, United States Motion Recruitment Full time

    A financial company is looking for senior level Site Reliability Engineers to join their team in troubleshooting applications and managing their Azure environment. This will be a contract-to-hire position that is hybrid 3 days a week in the Chicago area. Expertise in Terraform, YAML, and Azure infrastructure is mandatory. This company is a global leader in...


  • Chicago, Illinois, United States Motion Recruitment Full time

    A financial company is looking for senior level Site Reliability Engineers to join their team in troubleshooting applications and managing their Azure environment. This will be a contract-to-hire position that is hybrid 3 days a week in the Chicago area. Expertise in Terraform, YAML, and Azure infrastructure is mandatory. This company is a global leader in...


  • Chicago, United States Adyen Full time

    This is Adyen Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the culture...


  • Chicago, United States Adyen Full time

    This is Adyen Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition.  For our teams, we create an environment with opportunities for our people to succeed, backed by the culture...


  • Chicago, Illinois, United States Adyen Full time

    This is AdyenAdyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the culture...


  • Chicago, United States Adyen Full time

    This is Adyen Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition.  For our teams, we create an environment with opportunities for our people to succeed, backed by the culture...


  • Chicago, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer Location: Chicago - IL (Remote) Employment: Contract JobSummary: Experienced Senior Cloud Engineer with 6-10 years of experience in AWS and AWS Cloud Formation. Must have domain skill experience in Payer. Required Skills Technical Skills: Dynatrace, Python, AWS, AWS Cloud Watch, Glue and Lambda Roles & Responsibilities...


  • Chicago, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer Location: Chicago - IL (Remote) Employment: Contract JobSummary: Experienced Senior Cloud Engineer with 6-10 years of experience in AWS and AWS Cloud Formation. Must have domain skill experience in Payer. Required Skills Technical Skills: Dynatrace, Python, AWS, AWS Cloud Watch, Glue and Lambda Roles & Responsibilities...


  • Chicago, United States Deere & Company Full time

    Advanced Options 28 open jobs. Use your resume to get matched with the right job. Senior Platform Engineer (Chicago, Visa Sponsorship available) Reliability Engineer Dubuque, Iowa, United States Reliability Engineer Dubuque, Iowa, United States Senior Software Engineer - DevOps eCommerce (Chicago) SOFTWARE ENGINEER (Chicago, IL or Moline, IL - Hybrid) SAP...


  • Chicago, Illinois, United States Spectraforce Technologies Full time

    Title: Senior Associate Software Engineer/Senior Lead Software EngineerLocation: Chicago, IL Onsite 3 days per weekDuration: 6 Month Contract to Hire Must Haves:5-8+ years of overall software engineering experience 4-6+ years in Site Reliability Engineering Experience developing, supporting, and managing cloud technologies Experience working with...


  • Chicago, Illinois, United States Spectraforce Technologies Full time

    Title :Senior Associate Software Engineer/Senior Lead Software Engineer Location :Chicago, IL Onsite 3 days per week Duration : 6 Month Contract to Hire Must Haves: 5-8+ years of overall software engineering experience 4-6+ years in Site Reliability Engineering Experience developing, supporting, and managing cloud technologies Experience working with...


  • Chicago, United States Saxon Global Full time

    Northern Trust Site Reliability Engineer (Azure) Location : Downtown Chicago - Onsite 2 days/week - 181 W Madison St Duration : 12+ month contract w/extension/conversion Overview The Goals Driven Wealth Management platform is a showcase product for Northern Trusts Wealth Management business and we must demonstrate our ability to deliver and...


  • Chicago, United States Saxon Global Full time

    Northern Trust Site Reliability Engineer (Azure) Location : Downtown Chicago - Onsite 2 days/week - 181 W Madison St Duration : 12+ month contract w/extension/conversion Overview The Goals Driven Wealth Management platform is a showcase product for Northern Trusts Wealth Management business and we must demonstrate our ability to deliver and...


  • Chicago, United States Synergy Interactive Full time

    As a Site Reliability Engineer you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure and applications. You will collaborate closely with our engineering, operations, and development teams to design, implement, and maintain robust systems and processes that support our mission-critical services.Key...