Senior Site Reliability Engineer

4 weeks ago


Madison, United States THINKalpha Full time

Location: 100% Remote. The working timezone is EU/GMT. ThinkAlpha is looking for a Senior Site Reliability Engineer to work in the core infrastructure team supporting our data analytics platform and transactional trading engine. Responsibilities: Configure and maintain observability tooling with Datadog and PagerDuty (Slack channels) Contribute to our IaC codebase by creating and maintaining Terraform and Ansible modules, and participate in the review process for the IaC developed by the other SRE engineers Help developers with their needs when it comes to infrastructure updates and accounts management Support our CI/CD infrastructure and be familiar enough with the workflow to maintain/update our reusable workflows and GitHub actions to reflect the dev team needs Keep the infrastructure systems updated and secure by performing scheduled updates to the operating systems, software packages, and services Run software releases for our production environments, support the SW Development team with testing and validation, and perform rollbacks when necessary Be part of the on-call rotation schedule and ensure services are healthy and performing as expected, especially during the market opening hours Ideal candidates: Ideal candidates want to build sustainable code, build systems that are resilient and well-tested, and want to work with a group of people that hold each other to high standards. Qualifications: Must have: Minimum 5 years of professional experience Extensive experience and deep understanding of infrastructure as code (Terragrunt/Terraform) Strong understanding of Docker, Kubernetes, Linux and data pipelines and stream processing tools Experience with both on-premise/colocated servers as well as cloud infrastructure, and hybrid deployments spanning both types of environments Experience with observability platforms (e.g., DataDog) and alarm systems (e.g., PagerDuty) Nice to have: Coding background in at least one language (Node, JavaScript, Python, C++, etc) Understanding of mesh networking with Kubernetes clusters (Istio, Linkerd, or similar), ArgoCD Familiarity managing and configuring services that rely on: Git, S3, SQL, Mongo Familiarity with CI and GitHub Actions #J-18808-Ljbffr



  • Madison, United States Formula Recruitment Full time

    ```html Senior Site Reliability Engineer Salary: Up to £120,000 Location: Fully Remote Type: Permanent, Full Time We are partnered with a leading Web3 and Blockchain start-up company who aim to disrupt the crypto eco-system and move away from a chain centric worldview towards an account centric worldview. They are currently looking for a Senior Site...


  • Madison, United States Talented Recruitment Group Full time

    ```html Are you passionate about crafting robust, fault-tolerant systems that power unforgettable travel experiences? Do you thrive in an environment where innovation and collaboration are valued? If so, we have an incredible opportunity for you! About the Company: We are working with a leading global travel company dedicated to providing exceptional...


  • Madison, United States Palmer Group Full time

    One of the leading appliance manufacturers in the world is searching for a Senior Reliability Engineer. This person will be responsible for establishing design assurance and reliability standards to ensure products consistently meet customer expectations for quality and performance throughout their lifecycle. This role requires specific responsibilities,...


  • Madison, United States Palmer Group Full time

    One of the leading appliance manufacturers in the world is searching for a Senior Reliability Engineer. This person will be responsible for establishing design assurance and reliability standards to ensure products consistently meet customer expectations for quality and performance throughout their lifecycle. This role requires specific responsibilities,...


  • Madison, United States Xcede Full time

    Site Reliability Engineering Manager is required by a global financial technology organisation. In this newly created role, the Site Reliability Engineering Manager will be responsible for deploying and managing a suite of enterprise-wide tools used for provisioning, automation, and monitoring as well as technical team leadership. Site Reliability...


  • Madison, United States Sub-Zero & Wolf Appliance Full time

    We welcome you to join Sub-Zero, Wolf, and Cove as a Senior Reliability Engineer in Madison, WI location. Sub-Zero, Wolf, and Cove the leading manufacturer of luxury kitchen appliances is a longstanding, family-owned company in the Madison area. Icons of design and paragons of performance and quality, Sub-Zero, Wolf, and Cove are the refrigeration, cooking,...


  • Madison, United States IC Resources Full time

    Our Cambridge based client is currently searching for a Senior Reliability Engineer to be responsible for all reliability aspects of new devices and ICs. The role will involve designing the reliability experiments as well as developing and analysing reliability models. An important part of the role will be to develop new electrical reliability tests in...


  • Madison, United States Sub-Zero & Wolf Appliance Full time

    We welcome you to join Sub-Zero, Wolf, and Cove as a Senior Reliability Engineer in Madison, WI location. Sub-Zero, Wolf, and Cove the leading manufacturer of luxury kitchen appliances is a longstanding, family-owned company in the Madison area. Icons of design and paragons of performance and quality, Sub-Zero, Wolf, and Cove are the refrigeration, cooking,...


  • Madison, United States Fetch Full time

    What we’re building and why we’re building it. There’s a reason Fetch is ranked top 10 in Shopping in the App Store. Every day, millions of people earn Fetch Points buying brands they love. From the grocery aisle to the drive-through, Fetch makes saving money fun. We’re more than just a build-first tech unicorn. We’re a revolutionary shopping...


  • Madison, United States Fetch Full time

    What we’re building and why we’re building it. There’s a reason Fetch is ranked top 10 in Shopping in the App Store. Every day, millions of people earn Fetch Points buying brands they love. From the grocery aisle to the drive-through, Fetch makes saving money fun. We’re more than just a build-first tech unicorn. We’re a revolutionary shopping...


  • Madison, Wisconsin, United States Xcede Full time

    Position Overview:The Manager of Site Reliability Engineering is sought by a leading global financial technology firm. In this pivotal role, you will oversee the deployment and management of a comprehensive suite of enterprise tools designed for provisioning, automation, and monitoring, alongside providing technical leadership to your team.Key...


  • Madison, United States Canonical - Jobs Full time

    Job DescriptionJob DescriptionThis role is an opportunity for a hands-on, but literally hands-off, senior technologist with a passion for Linux to build a career with Canonical and drive the success with those leveraging Ubuntu and open source products. If you have experience of IT operations automation, Infrastructure as Code and a passion for technology,...


  • Madison, United States TekStream Solutions Full time

    Overview Our client is a remote-first company with team members across the globe! Offering a SaaS-based Learning Management System powering the world's leading education programs. Our client helps large brands and fast-moving companies increase revenue, improve customer retention, and decrease support costs through external education. The platform includes...


  • Madison, United States Peaple Talent Full time

    Hello Site Reliability Engineers! Having an average day? Well, luckily you've come across an opportunity that might just change that. For this one - you will be part of a team that is building & designing a new serverless architecture. Therefore, you will be comfortable deploying with Terraform, while understanding observability principles. Really know your...


  • Madison, United States Redline Group Full time

    ```html Job Opportunity: Senior Reliability Engineer - Electronics The Redline Group have a fantastic new opportunity for a Senior Reliability Engineer - Electronics , based in Central London. My client is a leading software development company, developing within the UK's fastest growing EV market. This opportunity presents itself to highly motivated and...


  • Madison, United States Palmer Group Full time $110,000

    Job DescriptionJob DescriptionOne of the leading appliance manufacturers in the world is searching for a Senior Reliability Engineer. This person will be responsible for establishing design assurance and reliability standards to ensure products consistently meet customer expectations for quality and performance throughout their lifecycle. This role requires...


  • Madison, United States Total Administrative Services Corporation Full time

    Job DescriptionJob DescriptionAbout Us:Xformative Payment Systems is at the cutting edge of the Fintech industry, specializing in cloud-native payment processing solutions. We are a dynamic, fast-growing company with a small, agile team that thrives in a startup environment. Here, every team member has the opportunity to drive and create impactful work. Our...


  • Madison, Wisconsin, United States TekStream Solutions Full time

    OverviewTekStream Solutions is a remote-first organization with a diverse team spread across the globe, specializing in a SaaS-based Learning Management System that empowers leading educational initiatives worldwide.We assist prominent brands and agile companies in boosting revenue, enhancing customer loyalty, and reducing support expenses through external...


  • Madison, Wisconsin, United States THINKalpha Full time

    Location:100% Remote. The working timezone is EU/GMT. THINKalpha is seeking a Senior Site Reliability Engineer to join our core infrastructure team, dedicated to enhancing our data analytics platform and transactional trading engine.Key Responsibilities:• Configure and oversee observability tools such as Datadog and PagerDuty (including Slack channels)•...


  • Madison, Wisconsin, United States Talented Recruitment Group Full time

    Are you driven by the challenge of building resilient and dependable systems that enhance travel experiences? Do you excel in a collaborative and innovative atmosphere? If this resonates with you, we have an exciting opportunity for you.About Talented Recruitment Group:We partner with a premier global travel organization committed to delivering outstanding...