Senior Site Reliability Engineer

1 day ago


Plano, Texas, United States MSRCOSMOS Full time
Job Description

MSRCOSMOS is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our Site Reliability and Observability Engineering team, you will be responsible for ensuring the reliability and performance of our network and applications.

Key Responsibilities:

  • Design and implement automation solutions to improve network and service availability
  • Collaborate with cross-functional teams to identify and resolve technical issues
  • Develop and maintain a catalog of reliability scripts, tools, and libraries
  • Monitor and analyze network performance to identify areas for improvement
  • Act as a Tier 3 escalation for issues related to our observability platform

Requirements:

  • Bachelor's degree in Computer Science or related field
  • 3+ years of experience in scripting languages such as Python and JavaScript
  • 3+ years of experience in event-driven engineering and AIOps
  • 3+ years of experience in cloud platforms such as AWS, Azure, and GCP
  • 5+ years of technical experience in areas such as AWS Cloud Engineering, 5G ORAN, 5G Core, and Data and Transport Engineering

Preferred Skills:

  • Experience with tools such as DataDog, Grafana, ServiceNow, and Solarwinds
  • Experience with log analysis and system tracing
  • Intermediate understanding of RestAPIs, Apache Spark, and Kafka

About MSRCOSMOS:

MSRCOSMOS is a leading provider of network and application reliability solutions. We are committed to delivering high-quality products and services to our customers.



  • Plano, Texas, United States Dexian - DISYS Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Dexian - DISYS. As a key member of our engineering team, you will be responsible for designing, building, and maintaining cloud native applications and infrastructure.Key Responsibilities:Establish frameworks and best practices for...


  • Plano, Texas, United States Dexian Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Dexian. As a key member of our Incident Management team, you will be responsible for establishing frameworks, best practices, and scope management as we transition Incident Management into a Site Reliability Engineering team.Key...


  • Plano, Texas, United States Dexian Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Dexian. As a key member of our Incident Management team, you will be responsible for establishing frameworks, best practices, and scope management as we transition Incident Management into a Site Reliability Engineering team.Key...


  • Plano, Texas, United States Bank of America Full time

    Senior Site Reliability EngineerAt Bank of America, we are committed to delivering exceptional customer experiences through the power of technology. As a Senior Site Reliability Engineer, you will play a critical role in ensuring the stability and performance of our cloud-based identity systems.Key Responsibilities:Collaborate with cross-functional teams to...


  • Plano, Texas, United States Pizza Hut Full time

    Job SummaryAs a Senior Manager, Engineering Site Reliability, you will lead a team of experienced engineers responsible for designing, implementing, and maintaining the infrastructure that supports our website, mobile app, and API. You will work closely with our Incident Management team to ensure that our infrastructure is reliable and scalable. You will...


  • Plano, Texas, United States Trident Consulting Full time

    {"h1": "Site Reliability Engineer", "p": "Trident Consulting is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for leading the development and implementation of geospatial application performance monitoring strategies. Key Responsibilities: * Lead the development and...


  • Plano, Texas, United States Hispanic Technology Executive Council Full time

    About UsAt Hispanic Technology Executive Council, we are driven by a shared purpose to harness the power of technology to drive innovation and growth. Our team is dedicated to creating a workplace that is inclusive, diverse, and supportive of our employees' well-being.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team. As a...


  • Plano, Texas, United States Dexian - DISYS Full time

    Senior Site Reliability EngineerDexian is a leading provider of staffing, IT, and workforce solutions with over 12,000 employees and 70 locations worldwide. We are seeking a Senior Site Reliability Engineer to join our team.Key Responsibilities:Establish frameworks, best practices, and scope management for Incident Management as we transition into a Site...


  • Plano, Texas, United States Dexian - DISYS Full time

    Senior Site Reliability EngineerDexian - DISYS is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our Incident Management team, you will be responsible for establishing frameworks, best practices, and scope management as we transition Incident Management into a Site Reliability Engineering team.Key...


  • Plano, Texas, United States Bank of America Full time

    About the RoleAt Bank of America, we are committed to delivering exceptional service and support to our customers. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and efficiency of our enterprise security solutions, including Crowdstrike Falcon.Key ResponsibilitiesPartner with engineering and technology teams to...


  • Plano, Texas, United States Dexian Full time

    Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our Incident Management team. As a key member of our team, you will be responsible for establishing frameworks, best practices, and scope management as we transition Incident Management into a Site Reliability Engineering team.Key Responsibilities:Partner...

  • Platform Engineer

    4 days ago


    Plano, Texas, United States Capital One Full time

    Job Title: Platform Engineer - Site Reliability EngineeringCapital One is seeking a highly skilled Platform Engineer to join our Site Reliability Engineering (SRE) team. As a Platform Engineer, you will be responsible for designing, developing, and deploying scalable and reliable cloud-based systems.Key Responsibilities:Collaborate with product owners to...


  • Plano, Texas, United States AT&T Full time

    Job SummaryWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at AT&T. As a key member of our Consumer Technology experience team, you will be responsible for delivering innovative and reliable technology solutions to power differentiated, simplified customer experiences.The ideal candidate will have a strong background in...


  • Plano, Texas, United States Dexian - DISYS Full time

    We are seeking a Senior Site Reliability Engineer to join our team at Dexian - DISYS. As a key member of our Incident Management team, you will be responsible for establishing frameworks, best practices, and scope management as we transition Incident Management into a Site Reliability Engineering team.You will partner with Platform Engineering, Development,...


  • Plano, Texas, United States Toyota North America Full time

    About the RoleWe are seeking a highly skilled Director of Site Reliability Engineering to join our team at Toyota North America. As a key member of our organization, you will be responsible for building and leading a high-performing SRE team that ensures the reliability, performance, and scalability of our systems and applications.Key ResponsibilitiesSupport...


  • Plano, Texas, United States Toyota North America Full time

    About the RoleWe are seeking a highly skilled and experienced Director of Site Reliability Engineering to lead our new SRE team at Toyota North America. As a key member of our organization, you will be responsible for building and managing a high-performing team that ensures the reliability, performance, and scalability of our systems and applications.Key...


  • Plano, Texas, United States Toyota Full time

    About the RoleWe are seeking a highly skilled Director of Site Reliability Engineering to lead our new SRE team at Toyota Financial Services. As a key member of our organization, you will be responsible for building and establishing robust processes to ensure the reliability, performance, and scalability of our systems and applications.Key...


  • Plano, Texas, United States Toyota Full time

    About ToyotaToyota is a world-renowned brand that is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve.Job SummaryWe are seeking a highly skilled and experienced Director of Site Reliability Engineering to spearhead our new SRE team. As a key member of our team, you will...


  • Plano, Texas, United States Capital One Full time

    Job SummaryCapital One is seeking a skilled Platform Engineer to join our team. As a Platform Engineer, you will be responsible for designing, developing, testing, implementing, and supporting technical solutions across a full-stack of development tools and technologies.Key ResponsibilitiesCollaborate with product owners to understand desired application...


  • Plano, Texas, United States Toyota North America Full time

    About the RoleWe are seeking a highly experienced Site Reliability Engineering Director to lead our new SRE team at Toyota North America. As a key member of our organization, you will be responsible for building and managing a high-performing team that ensures the reliability, performance, and scalability of our systems and applications.Key...