Site Reliability Engineer

9 hours ago


Dallas, Texas, United States Diverse Lynx Full time
Site Reliability Manager

We are seeking a highly skilled Site Reliability Manager to join our team at Diverse Lynx LLC. As a key member of our technical team, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications.

Key Responsibilities:
  • Design and implement monitoring and alerting systems to notify on symptoms and not on outages.
  • Develop and maintain automation scripts to improve deployment processes, change management, and release management.
  • Debug production issues across services and levels of the stack.
  • Propose ideas and solutions to improve resiliency, availability, security, and performance of our systems.
  • Plan and execute configuration change operations at the application and infrastructure levels.
  • Actively look for opportunities to improve availability and performance by applying learnings from monitoring and observation.
  • Complete Root Cause Analysis (RCA) investigations and improve DevSecOps practices.
  • Assist in providing inputs to develop strategic technology roadmaps and respond to incidents.
Requirements:
  • Expertise in Azure, Dynatrace, GitHub, and other cloud native technologies.
  • Strong technical knowledge and skills in various hardware, software, and technology platforms.
  • Experience with infrastructure as a service (IaaS), platform as a service (PaaS) tools, and containers and container orchestration platforms.
  • Hands-on experience with open source logging infrastructure, Node JS, and GQL.
  • Ability to script automated performance testing scenarios for APIs and web front ends and embed in CI/CD pipelines.
Preferred Qualifications:
  • Terraform experience in Azure and on-prem infrastructure resources.
  • Load balancing experience, including proxies and CDN.
  • Experience with database and persistence frameworks, such as Mongo and Oracle.
  • Web services experience, including Graph QL and REST/SOAP.

Diverse Lynx LLC is an Equal Employment Opportunity employer. We promote and support a diverse workforce across all levels in the company.



  • Dallas, Texas, United States CV Library Full time

    {"title": "Site Reliability Engineer", "description": "Job SummaryWe are seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, reliability, and performance of our applications and services.Key ResponsibilitiesMonitor and analyze system performance to identify areas...


  • Dallas, Texas, United States STIAOS Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at STIAOS Technologies in Dallas, TX. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our software systems.Key Responsibilities:Collaborate with cross-functional teams to identify and...


  • Dallas, Texas, United States Learfield Full time

    About LearfieldLearfield is a leading media and technology services company in intercollegiate athletics, unlocking the value of college sports for brands and fans through an omnichannel platform with innovative content and commerce solutions for fan engagement.Job SummaryWe are seeking an experienced Senior Site Reliability Engineer to join our team,...


  • Dallas, Texas, United States Goldman Sachs Full time

    About This RoleWe are seeking a highly skilled Site Reliability Engineering Specialist to join our team at Goldman Sachs. As a Site Reliability Engineer, you will play a critical role in ensuring the availability and reliability of our firm's most critical platform services.Key ResponsibilitiesDevelop and implement incident management processes to ensure...


  • Dallas, Texas, United States Signify Health Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our vibrant team at Signify Health. As a Site Reliability Engineer, you will play a critical role in ensuring the stability, scalability, and availability of our products.Key ResponsibilitiesDevelop and Implement Strategies to improve the performance and reliability of our...


  • Dallas, Texas, United States Learfield Full time

    Learfield is seeking a seasoned Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for ensuring the reliability, availability, and performance of our live services. Your expertise in Linux container technologies, public and private clouds, and cloud orchestration frameworks will be instrumental in...

  • Software Engineer

    4 days ago


    Dallas, Texas, United States Federal Reserve Bank Full time

    About the RoleThe Federal Reserve Bank of Dallas is seeking a highly motivated and experienced Software Engineer to join our Site Reliability Engineering (SRE) team. As a key member of our team, you will be responsible for designing, developing, and implementing scalable, highly available system architectures to handle increasing loads and user demands.Key...

  • Software Engineer

    12 hours ago


    Dallas, Texas, United States Federal Reserve Bank Full time

    About the RoleThe Federal Reserve Bank of Dallas is seeking a highly motivated and experienced Software Engineer to join our Site Reliability Engineering (SRE) team. As a key member of our team, you will be responsible for designing, developing, and implementing scalable, highly available system architectures to handle increasing loads and user demands.Key...


  • Dallas, Texas, United States Signify Health Full time

    About the Role:Signify Health is seeking a highly skilled Site Reliability Engineer II to join our vibrant team. As a Site Reliability Engineer, you will play a critical role in ensuring the stability, scalability, and availability of our products.Your Key Responsibilities:Develop and Implement Strategies: Design and implement strategies to improve the...


  • Dallas, Texas, United States Diverse Lynx Full time

    Job SummaryDiverse Lynx LLC is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, reliability, and performance of our cloud-based systems.Key ResponsibilitiesSystem Monitoring and Alerting: Develop and maintain monitoring tools and alerting systems to...


  • Dallas, Texas, United States Apple Full time

    Job SummaryApple is seeking a highly skilled Site Reliability Engineering Manager to lead a team responsible for providing a platform for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to flourish.Key ResponsibilitiesEstablish and maintain SRE practices for a private cloud service to...


  • Dallas, Texas, United States The Goldman Sachs Group Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Procmon Platform team at Goldman Sachs. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our systems, which manage hundreds of thousands of compute cores.ResponsibilitiesOwn technical operations for systems that...


  • Dallas, Texas, United States Wise Skulls llc Full time

    Job OverviewPosition: Site Reliability Engineer (Python)Location: Dallas, TX (On-site presence required)Contract Duration: 12 monthsPartnering Company: Wise Skulls LLCClient: ConfidentialKey Responsibilities:Minimum of 5 years of relevant experience in the field.Proficient in Python programming and familiar with frameworks such as Django or Flask.Mandatory...


  • Dallas, Texas, United States Hitachi Full time

    About the RoleWe're seeking a highly skilled Lead Application Support Site Reliability Engineer to join our team at Hitachi Digital Services. As a key member of our Site Reliability Engineering team, you will be responsible for ensuring the availability, reliability, and performance of our services and platforms in a highly transactional 24x7 environment.Key...


  • Dallas, Texas, United States Cognizant Full time

    Senior Site Reliability Engineer (Hybrid) Cognizant stands as a prominent global entity delivering IT solutions, encompassing digital transformation, technology services, consulting, and operational support. At Cognizant, we embrace innovative thinking and explore new concepts daily. Our mission is to assist leading enterprises in reimagining their...


  • Dallas, Texas, United States Saxon Global Full time

    About the RoleSaxon Global is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our Production Support/SRE team, you will work collaboratively with various teams to deliver high-quality engineering services and solutions to our stakeholders.Key ResponsibilitiesDesign, develop, and implement tools to improve the...


  • Dallas, Texas, United States Cambium Learning Group Full time

    Job Overview:We are seeking a highly skilled Senior Site Reliability Engineer to enhance our infrastructure monitoring capabilities and application performance. This role will be responsible for implementing best practices, leveraging appropriate tools and technologies, and collaborating with cross-functional teams to optimize system performance and...


  • Dallas, Texas, United States Diverse Lynx Full time

    Site Reliability ManagerWe are seeking a highly skilled Site Reliability Manager to join our team at Diverse Lynx LLC. As a key member of our organization, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement monitoring and alerting systems to notify on...


  • Dallas, Texas, United States The Goldman Sachs Group Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer, VP to join our team at The Goldman Sachs Group. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining large-scale distributed systems that support our business operations.Key ResponsibilitiesOwn technical operations for systems...


  • Dallas, Texas, United States Diverse Lynx Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a key member of our technical team, you will be responsible for ensuring the reliability and scalability of our cloud-based eComm platform.Key ResponsibilitiesDesign and implement monitoring and support tools to ensure optimal system...