Site Reliability Engineer

2 days ago


Austin, Texas, United States Liquibase Full time
Job Description

We are seeking a highly skilled Site Reliability Engineer to join our team at Liquibase. As a key member of our DevOps team, you will be responsible for designing, implementing, and maintaining highly resilient and secure infrastructure for our SaaS platform using AWS services.

Key Responsibilities:

  • Design and implement secure and scalable infrastructure using AWS services such as API Gateway, Lambda, Aurora Serverless, and OpenSearch Serverless.
  • Develop and maintain robust monitoring and alerting solutions to ensure the reliability and performance of our SaaS platform.
  • Ensure best-in-class security of the application using AWS security services such as WAF, Shield, and GuardDuty.
  • Facilitate and drive incident response, triage, and resolution to maintain the reliability and uptime of our platform.
  • Lead incident post-mortem/retrospectives to surface reliability improvements and drive to completion.
  • Implement strategies to increase system resilience and performance through on-call rotation and process optimization.
  • Strong understanding of SRE principles, including error budgets, SLOs, SLIs, and SLAs.
  • Build and maintain infrastructure as code using Terraform.
  • Provide input and expertise for system architecture and feature development.
  • Engage and collaborate with stakeholders to ensure work is properly defined, prioritized, and executed.

Requirements:

  • Prior SRE experience supporting a cloud-native SaaS platform with AWS.
  • Bachelor's degree in Computer Science, Software Engineering, or a related field (or equivalent work experience).
  • AWS Solutions Architect and/or AWS DevOps Professional Certifications.
  • 5+ years of hands-on experience in site reliability engineering roles.
  • Expert knowledge of AWS services, specifically API Gateway, Lambda, Aurora Serverless, OpenSearch Serverless, Secrets Manager, and FusionAuth.
  • Expertise in AWS security services, including WAF, Shield, GuardDuty, and a deep understanding of cloud security practices.
  • Strong experience with monitoring and alerting tools such as CloudWatch, Prometheus, Grafana, or similar.
  • Proven ability to design and implement effective monitoring strategies to ensure system reliability and performance.

Perks of Life at Liquibase:

  • Remote culture with potential for company-wide in-person gatherings.
  • Home office allowance for remote workers.
  • Meaningful equity (US only).
  • Comprehensive health, vision, and dental benefits - country dependent.
  • Generous paid time off and paid holidays.
  • 401K matching (US only).
  • No punks, no jerks culture.
  • Growth opportunities and ability to move up within the company.


  • Austin, Texas, United States Unreal Gigs Full time

    Job Title: Site Reliability EngineerAt Unreal Gigs, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the high availability, scalability, and performance of our complex distributed systems.Key Responsibilities:Design and implement monitoring, logging, and alerting...


  • Austin, Texas, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based services.Key ResponsibilitiesDesign, develop, and deploy automation tools to improve the efficiency and reliability of our cloud...


  • Austin, Texas, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based services.Key ResponsibilitiesDesign, develop, and deploy software to improve the availability, scalability, and efficiency of Oracle...


  • Austin, Texas, United States Cisco Full time

    About the RoleCisco is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement automated solutions to improve the reliability and...


  • Austin, Texas, United States Thales Full time

    Job Title: Site Reliability EngineerThales is seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and security of our cloud-based services.Key Responsibilities:Collaborate with project managers and service delivery managers to analyze traffic...


  • Austin, Texas, United States Thales Full time

    Job Title: Site Reliability EngineerThales is seeking an experienced Site Reliability Engineer to join our team in Austin, TX. As a Site Reliability Engineer, you will be responsible for designing, developing, and maintaining our CTE product line and solutions for deployment in various environments, including on-premises, multiple clouds, and big data and...


  • Austin, Texas, United States Thales Full time

    Job Title: Site Reliability EngineerThales is seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, developing, and maintaining our cloud-based infrastructure and applications.Key Responsibilities:Collaborate with project managers and service delivery managers to analyze...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking an innovative Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will design, build, and maintain our core infrastructure, enabling thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple customers.Key ResponsibilitiesCollaborate with...


  • Austin, Texas, United States Cisco Full time

    About the RoleCisco is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement automated solutions to improve infrastructure stability and scalabilityCollaborate with...


  • Austin, Texas, United States Terminal Industries Full time

    About UsTerminal Industries is a cutting-edge technology company that's revolutionizing the logistics industry with its innovative software solutions.Our platform leverages machine learning and IoT technology to digitize, index, and automate warehouse operations, providing warehouse operators with the intelligence needed to optimize their usage of trucks,...


  • Austin, Texas, United States Apple Full time

    About the RoleWe are seeking an innovative Site Reliability Engineer to join our Apple Services Engineering team. As a key member of our team, you will design, build, and maintain our core infrastructure, enabling thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple customers.Key ResponsibilitiesCollaborate with...


  • Austin, Texas, United States Apple Full time

    Site Reliability Engineering ManagerAt Apple, we're committed to delivering exceptional customer experiences through innovative products and services. As a Site Reliability Engineering Manager, you'll play a critical role in ensuring the reliability and scalability of our cloud services.Key ResponsibilitiesLead a team of SRE engineers in establishing and...


  • Austin, Texas, United States Oxford Knight Full time

    Database Site Reliability EngineerOxford Knight is seeking an experienced Database Site Reliability Engineer to join our Trading Systems Infrastructure team. As a key member of our team, you will be responsible for designing, building, and maintaining our diverse production database infrastructure, focusing on bare metal performance, scalability, and...


  • Austin, Texas, United States Terminal Industries Full time

    About UsTerminal Industries is a pioneering company that leverages cutting-edge machine learning to digitize, index, and automate the yard. Our platform empowers warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers, and personnel. These fundamental operating assets of commerce represent the last...


  • Austin, Texas, United States Info Way Solutions Full time

    Splunk Administration and SRE ExpertiseWe are seeking a highly skilled Splunk administrator with strong expertise in Site Reliability Engineering (SRE) and DevOps to join our team at Info Way Solutions.Key Responsibilities:Administer and optimize Splunk infrastructure for maximum performance and efficiencyDevelop and implement SRE practices to ensure high...


  • Austin, Texas, United States Oracle Full time

    Job Title: Site Reliability DeveloperOracle is seeking a highly skilled Site Reliability Developer to join our team. As a Site Reliability Developer, you will be responsible for designing, building, and deploying software to improve the availability, scalability, and efficiency of Oracle products and services.Key Responsibilities:Design and develop software...


  • Austin, Texas, United States Terminal Industries Full time

    About UsTerminal Industries is a software company that leverages machine learning to digitize, index, and automate the yard. Our platform provides warehouse operators with the intelligence needed to optimize their usage of trucks, trailers, chassis, containers, and personnel.OverviewOur world-class vision engineering team has built an engine that can process...


  • Austin, Texas, United States Publishing Full time

    Job DescriptionAt Publishing, we're seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud infrastructure to support our growing business.ResponsibilitiesDesign and implement scalable cloud...


  • Austin, Texas, United States Weedmaps Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Weedmaps. As a key member of our infrastructure team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based services.Key ResponsibilitiesLeverage your engineering expertise to build, monitor, and improve our...


  • Austin, Texas, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based services.Key ResponsibilitiesDesign, develop, and deploy software to improve the availability, scalability, and efficiency of Oracle...