Current jobs related to Principal Site Reliability Engineer - Dallas, Texas - CARE


  • Dallas, Texas, United States Care Full time

    Job OverviewCare.com is a leading provider of online services for finding family care and care jobs. We're seeking a highly skilled Principal Site Reliability Engineer to join our team and ensure the reliability, scalability, and performance of our critical systems.This is a leadership role that requires strong technical expertise and excellent communication...


  • Dallas, Texas, United States CARE Full time

    About CARECARE is a consumer tech company with heart. We're on a mission to solve a human challenge we all face: finding great care for the ones we love. We're moms and dads and pet parents. Our culture and our products reflect that.Here, entrepreneurs, self-starters, team players, and big thinkers unite behind a common cause. We're applying data analytics,...


  • Dallas, Texas, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.**Key Responsibilities:*** Design, implement, and maintain scalable and reliable cloud...


  • Dallas, Texas, United States Glow Networks Full time

    Site Reliability Engineer (SRE for Datacenter)At Glow Networks, we are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability and performance of our datacenter infrastructure. Responsibilities:Data monitoring and alerting, data quality assurance, and anomaly...


  • Dallas, Texas, United States Capgemini Full time

    Site Reliability Engineer Job DescriptionWe're seeking an experienced Site Reliability Engineer to join our team at Capgemini. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, scalability, and performance of our cloud infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud...


  • Dallas, Texas, United States Mastech Digital Full time

    About the Role:We are seeking a skilled Site Reliability Engineer to join our team at Mastech Digital. As a Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our IT systems and infrastructure.Key Responsibilities:Administration and troubleshooting in Linux and WindowsPatching and basic scripting skills (PowerShell,...


  • Dallas, Texas, United States Diamondpick Full time

    The roleDiamondpick is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, reliability, and performance of our services and platforms in a highly transactional 24x7 environment.Key Responsibilities:Monitor application performance and take steps to improve...


  • Dallas, Texas, United States Veradigm Full time

    Welcome to Veradigm, where our mission is to transform health through innovative solutions. We are seeking a highly skilled Senior Site Reliability Engineer to join our team and help us achieve our goals.As a Senior Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining robust, scalable, and reliable systems. You will...


  • Dallas, Texas, United States Saxon Global Full time

    Job Summary:We are seeking a skilled Site Reliability Engineer to ensure the reliability, availability, and performance of our production systems. As an SRE, you will work closely with cross-functional teams to design and implement tools and processes to automate deployment, observability, and troubleshooting of our applications and infrastructure.This...


  • Dallas, Texas, United States Motion Recruitment Partners LLC Full time

    Job Title: Site Reliability Engineer - AzureJob Description:Motion Recruitment Partners LLC is seeking a highly skilled Site Reliability Engineer - Azure to join their team. The ideal candidate will have a strong background in monitoring and recovery of data systems, with experience in Azure and cloud infrastructure.Key Responsibilities:Develop and utilize...


  • Dallas, Texas, United States Bayone Full time

    Job Title: Site Reliability Engineer - Cloud ExpertOverview:Bayone is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining highly available and scalable applications deployed in Azure. You will work closely with development teams to ensure...


  • Dallas, Texas, United States Goldman Sachs Full time

    About the RoleWe are seeking a talented Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the firm's cloud infrastructure. You will work closely with our development team to ensure the smooth operation of our systems and services.Key...


  • Dallas, Texas, United States Kyndryl Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Kyndryl. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and security of our systems and services.Key ResponsibilitiesDesign and implement automated solutions to enhance the stability and security of our...


  • Dallas, Texas, United States Goldman Sachs Full time

    About This RoleAt Goldman Sachs, we're committed to building and running large-scale, massively distributed, fault-tolerant systems. As a Site Reliability Engineer, you'll play a critical role in ensuring the availability and reliability of our firm's most critical platform services.ResponsibilitiesDevelop and support automation tooling to improve the...


  • Dallas, Texas, United States The Goldman Sachs Group Full time

    Job SummaryAs a Site Reliability Engineer, VP at The Goldman Sachs Group, you will be responsible for ensuring the reliability and scalability of our Procmon Platform. This platform is a highly scalable and reliable ecosystem for scheduling business-critical jobs across the firm.Key ResponsibilitiesOwn technical operations for systems that manage hundreds of...


  • Dallas, Texas, United States Diverse Lynx Full time

    Job Title: Site Reliability ManagerJob Summary:We are seeking a Site Reliability Manager with 8 to 12 years of experience to manage geospatial data projects, ensure data integrity, and leverage advanced technologies to drive business outcomes.Key Responsibilities:• Make monitoring and alerting notify on symptoms and not on outages.• Document findings to...

  • Nuclear Engineer

    4 weeks ago


    Dallas, Texas, United States Westinghouse Full time

    Job Title: Principal Nuclear EngineerWe are seeking a highly skilled Principal Nuclear Engineer to join our team at Westinghouse Electric Company. As a Principal Nuclear Engineer, you will be responsible for providing technical leadership and expertise in the development, performance, and application of safety analysis methods for nuclear power plants.Key...

  • SRE Program Principal

    4 weeks ago


    Dallas, Texas, United States TEKsystems Full time

    We're TEKsystems Global Services, a leading provider of data, cloud, and customer experience solutions. Our team of experts works with partners in top cloud, design, and business intelligence platforms to deliver strategy, design, operations, and customer-first approaches to help enterprises thrive in a rapidly changing world.We're seeking a dynamic...


  • Dallas, Texas, United States CVS Health Full time

    Job SummaryAs a Site Reliability Engineer at CVS Health, you will play a critical role in designing, implementing, and managing the infrastructure systems and tools that enable reliability and performance of our technology platforms.This position requires a strong background in infrastructure engineering and a commitment to proactive monitoring,...


  • Dallas, Texas, United States Motion Recruitment Full time

    Job DescriptionOur client, a leading digital solutions provider, is seeking a Site Reliability Engineer to join their team in Dallas, Texas.This individual will be responsible for ensuring the stability and performance of their application, identifying areas for improvement, and implementing solutions to increase scalability and efficiency.The ideal...

Principal Site Reliability Engineer

1 month ago


Dallas, Texas, United States CARE Full time
About CARE

CARE is a consumer tech company with a mission to solve a universal challenge: finding great care for the ones we love. We're a team of entrepreneurs, self-starters, and big thinkers united behind a common cause. Our culture and products reflect our values of empathy, innovation, and collaboration.

Work Environment

CARE offers a hybrid work environment, with in-office days on Monday, Wednesday, and Friday. Our locations include Salt Lake City, Austin, and Dallas.

Job Summary

We're seeking a Principal Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for ensuring the reliability, scalability, and performance of our critical systems. You'll lead incident response, manage releases, improve observability, and collaborate across development and operations teams to drive continuous improvements.

Key Responsibilities
  • Coordinate releases for applications, ensuring efficient deployment and smooth rollbacks.
  • Lead incident management, facilitate root cause analysis, and continuously update response processes.
  • Implement proactive monitoring, create dashboards, and set up real-time alerts for critical services.
  • Ensure system stability during critical post-release periods, monitoring performance and preventing incidents.
  • Work closely with developers and QA teams to ensure performance benchmarks and observability goals are met.
  • Define and measure service levels for key workflows and APIs, ensuring alignment with business expectations.
  • Continuously assess and improve observability practices across teams, driving data-driven insights.
Requirements
  • 6+ years of experience in SRE or DevOps roles with a focus on monoliths and distributed microservices in cloud environments (AWS, GCP).
  • Proficiency in CI/CD tools (Jenkins, Terraform, Ansible).
  • Strong experience with Kubernetes, Docker, and JVM-based monoliths.
  • Expertise in monitoring tools (SignalFX, Splunk, Amplitude) and production incident management.
  • Scripting skills (Python, Bash, or Groovy).
  • Strong understanding of cloud-based systems and containerization.
  • Excellent communication skills and a collaborative approach to working cross-functionally.
  • Experience optimizing large-scale, customer-facing platforms in fast-paced environments.
Perks and Benefits

CARE offers a competitive salary range of $180,000 to $200,000, as well as a variety of benefits, including health insurance coverage, life and disability insurance, a generous 401K employer matching program, paid holidays, and paid time off (PTO).

CARE is an equal opportunity employer and recognizes the power of a diverse and inclusive workforce. We encourage applications from individuals with varied experiences, perspectives, and backgrounds.