Site Reliability Engineering Director

3 hours ago


Plano, Texas, United States Toyota Full time
Job Summary

We are seeking a highly skilled Director of Site Reliability Engineering to lead our new SRE team at Toyota Financial Services. As a key member of our organization, you will be responsible for building and managing a team of engineers to ensure the reliability, performance, and scalability of our systems and applications.

Key Responsibilities
  • Support engineers with hands-on coding, debugging, and implementation of automation to support a stable and robust application environment.
  • Foster a collaborative team culture and support professional development.
  • Define and implement strategies for system reliability, performance, and scalability.
  • Develop Service Level Objectives (SLOs) and Service Level Agreements (SLAs) aligned with business goals.
  • Design and deploy monitoring, alerting, and incident management systems.
  • Implement and refine disaster recovery and business continuity plans.
  • Lead major incident responses and coordinate with stakeholders for resolution.
  • Conduct post-incident reviews and drive continuous improvement.
  • Identify and implement automation opportunities to streamline operations.
  • Oversee the development and implementation of monitoring and incident management tools.
  • Work with engineering, product, and infrastructure teams on reliability goals.
  • Participate in architectural reviews, providing input on reliability and scalability.
  • Recruit, build, and lead the new SRE team with clear objectives and metrics.
Requirements
  • 7+ years of experience in Site Reliability Engineering, DevOps, or a related field, with at least 3 years in a leadership role.
  • Demonstrated experience in building and managing teams, with a proven track record of achieving high system reliability and performance.
  • Deep understanding of cloud platforms (e.g., AWS, GCP, Azure) and container orchestration technologies (e.g., Kubernetes).
  • Proficiency in scripting and automation (e.g., Python, Bash) and familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack).
  • Strong leadership capabilities, with excellent problem-solving and decision-making skills.
  • Effective communication skills, with the ability to convey complex technical concepts to diverse audiences.
What We Offer
  • A work environment built on teamwork, flexibility, and respect.
  • Professional growth and development programs to help advance your career, as well as tuition reimbursement.
  • Team Member Vehicle Purchase Discount.
  • Toyota Team Member Lease Vehicle Program (if applicable).
  • Comprehensive health care and wellness plans for your entire family.
  • Flextime and virtual work options (if applicable).
  • Toyota 401(k) Savings Plan featuring a company match, as well as an annual retirement contribution from Toyota regardless of whether you contribute.
  • Paid holidays and paid time off.
  • Referral services related to prenatal services, adoption, childcare, schools and more.
  • Tax Advantaged Accounts (Health Savings Account, Health Care FSA, Dependent Care FSA)
  • Relocation assistance (if applicable)


  • Plano, Texas, United States Toyota North America Full time

    About the RoleWe are seeking a highly skilled and experienced Director of Site Reliability Engineering to lead our new SRE team at Toyota North America. As a key member of our organization, you will be responsible for building and managing a high-performing team that ensures the reliability, performance, and scalability of our systems and applications.Key...


  • Plano, Texas, United States Toyota North America Full time

    About the RoleWe are seeking a highly skilled Director of Site Reliability Engineering to join our team at Toyota North America. As a key member of our organization, you will be responsible for building and leading a high-performing SRE team that ensures the reliability, performance, and scalability of our systems and applications.Key ResponsibilitiesSupport...


  • Plano, Texas, United States Toyota Full time

    About the RoleWe are seeking a highly skilled Director of Site Reliability Engineering to lead our new SRE team at Toyota Financial Services. As a key member of our organization, you will be responsible for building and establishing robust processes to ensure the reliability, performance, and scalability of our systems and applications.Key...


  • Plano, Texas, United States Toyota Full time

    About ToyotaToyota is a world-renowned brand that is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve.Job SummaryWe are seeking a highly skilled and experienced Director of Site Reliability Engineering to spearhead our new SRE team. As a key member of our team, you will...


  • Plano, Texas, United States Toyota North America Full time

    About the RoleWe are seeking a highly experienced Site Reliability Engineering Director to lead our new SRE team at Toyota North America. As a key member of our organization, you will be responsible for building and managing a high-performing team that ensures the reliability, performance, and scalability of our systems and applications.Key...


  • Plano, Texas, United States Trident Consulting Full time

    {"h1": "Site Reliability Engineer", "p": "Trident Consulting is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for leading the development and implementation of geospatial application performance monitoring strategies. Key Responsibilities: * Lead the development and...


  • Plano, Texas, United States Hispanic Technology Executive Council Full time

    About UsAt Hispanic Technology Executive Council, we are driven by a shared purpose to harness the power of technology to drive innovation and growth. Our team is dedicated to creating a workplace that is inclusive, diverse, and supportive of our employees' well-being.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team. As a...


  • Plano, Texas, United States Bank of America Full time

    About the RoleAt Bank of America, we are committed to delivering exceptional service and support to our customers. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and efficiency of our enterprise security solutions, including Crowdstrike Falcon.Key ResponsibilitiesPartner with engineering and technology teams to...


  • Plano, Texas, United States Hispanic Technology Executive Council Full time

    About the RoleWe are seeking a highly skilled Director of Technical Program Management to lead our Site Reliability Engineering team. As a key member of our Enterprise product and platform organization, you will be responsible for driving large-scale enterprise initiatives in the data management and engineering space.About the TeamOur team is dedicated to...

  • Platform Engineer

    5 days ago


    Plano, Texas, United States Capital One Full time

    Job Title: Platform Engineer - Site Reliability EngineeringCapital One is seeking a highly skilled Platform Engineer to join our Site Reliability Engineering (SRE) team. As a Platform Engineer, you will be responsible for designing, developing, and deploying scalable and reliable cloud-based systems.Key Responsibilities:Collaborate with product owners to...


  • Plano, Texas, United States AT&T Full time

    Job SummaryWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at AT&T. As a key member of our Consumer Technology experience team, you will be responsible for delivering innovative and reliable technology solutions to power differentiated, simplified customer experiences.The ideal candidate will have a strong background in...


  • Plano, Texas, United States Dexian - DISYS Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Dexian - DISYS. As a key member of our engineering team, you will be responsible for designing, building, and maintaining cloud native applications and infrastructure.Key Responsibilities:Establish frameworks and best practices for...


  • Plano, Texas, United States Dexian Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Dexian. As a key member of our Incident Management team, you will be responsible for establishing frameworks, best practices, and scope management as we transition Incident Management into a Site Reliability Engineering team.Key...


  • Plano, Texas, United States Dexian Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Dexian. As a key member of our Incident Management team, you will be responsible for establishing frameworks, best practices, and scope management as we transition Incident Management into a Site Reliability Engineering team.Key...


  • Plano, Texas, United States MSRCOSMOS Full time

    Job DescriptionMSRCOSMOS is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our Site Reliability and Observability Engineering team, you will be responsible for ensuring the reliability and performance of our network and applications.Key Responsibilities:Design and implement automation solutions to improve...


  • Plano, Texas, United States AT&T Full time

    Job Title: Principal Site Reliability EngineerAT&T is seeking a highly skilled Principal Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for ensuring the high availability, reliability, and resiliency of our customer-facing experiences and shared omnichannel platforms.Key...


  • Plano, Texas, United States Request Technology Full time

    Job Title: Sr. Director, Network Reliability EngineeringWe are seeking a highly experienced Sr. Director, Network Reliability Engineering to join our team at Request Technology. This is a full-time, permanent role that requires a strong background in network engineering and leadership.Responsibilities:Lead the development of a network services API-driven...


  • Plano, Texas, United States Capital One Full time

    Job Title: Lead Platform Engineer, Site Reliability EngineeringCapital One is seeking a highly skilled Lead Platform Engineer, Site Reliability Engineering to join our team. As a key member of our engineering organization, you will be responsible for designing, developing, and deploying scalable and reliable cloud-based systems.Key...


  • Plano, Texas, United States Bank of America Full time

    Senior Site Reliability EngineerAt Bank of America, we are committed to delivering exceptional customer experiences through the power of technology. As a Senior Site Reliability Engineer, you will play a critical role in ensuring the stability and performance of our cloud-based identity systems.Key Responsibilities:Collaborate with cross-functional teams to...


  • Plano, Texas, United States Request Technology Full time

    Job Title: Sr. Director, Network Reliability EngineeringWe are seeking a highly experienced Sr. Director, Network Reliability Engineering to join our team at Request Technology. This is a full-time, permanent role that requires a strong background in network engineering and leadership.Responsibilities:Lead the development of a network services API-driven...