Site Reliability Manager

2 months ago


Dallas, United States Diverse Lynx Full time
Job Summary:
Top Qualifications:
1. Azure
2. Dynatrace
3. GitHub

Job Summary:
We are seeking a Site Reliability Manager with 8 to 12 years of experience to join our team. The ideal candidate will have expertise in Database Design, MySQL, Node.js, Kubernetes, iPaaS, Dynatrace, Azure CI, Moogsoft, GITHUB, MongoDB, and PostgreSQL. This role involves managing geospatial data projects, ensuring data integrity, and leveraging advanced technologies to drive business outcomes.

Required Skills: MySQL, Node.js, Kubernetes, iPaaS, Dynatrace, Azure CI, Moogsoft, GITHUB, MongoDB, PostgreSQL

Roles & Responsibilities :
Make monitoring and alerting notify on symptoms and not on outages.
Document so your findings turn into repeatable actions-and then into automation.
Improve the deployment process, change mgmt., release mgmt. processes to make it efficient and streamlined.
Debug production issues across services and levels of the stack.
Proposes ideas and solutions within the product team to improve resiliency, availability, security.
Plan and execute configuration change operations both at the application and the infrastructure level.
Actively look for opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation
Complete Root Cause Analysis (RCA) investigations
Improving DevSecOps practices and accelerating delivery and take a lead role in troubleshooting technical issues
Assist in providing inputs to develop strategic technology roadmaps
Respond to incidents and provide support for customer incidents

Must to have
- Implement GitHub, GitAction CI or CD and ADO cloud for automation
- Implementing monitoring, observability in AKS and Azure cloud, Kubernetes
- Monitoring and Metrics in Dynatrace, Prometheus, Grafana and integrations with Moogsoft or xMatters
- Open source Logging infrastructure
- Worked in an environment with Node JS and GQL with for 2 years of experience
- Hands-on experience with Infrastructure as a Service (IaaS), Platform as a Service (PaaS) tools and platforms, and containers and container orchestration platforms (aka Docker & Kubernetes)
- Expertise in one or more cloud native relational databases such as MySQL, PostgreSQL and NoSQL databases such as Cassandra and MongoDB highly desired
- Strong technical knowledge and skills that are broad and deep, covering various hardware, software, and technology platforms
- Develop, implement, and maintain applications and systems that integrate MongoDB
- Dynatrace
- Mezmo
- Security Vulnerabilities (remediation or compliance)

Good to have
- Terraform in Azure and on-prem infrastructure resources
- Load balancing the application including Proxies and CDN (automate)
- Able to script Automated performance testing scenarios for APIs and Web front ends and embed in CI/CD pipelines dashboarding/reporting query languages
- Airline Industry experience helpful
- Typescript, JavaScript
- Database and persistence frameworks: Mongo, Oracle, Object/Relational Mapping, Query performance tuning
- Experience with Mongo Schema Design and Mongo Aggregation Framework
- Web Services: Graph QL, REST/SOAP (JSON/WSDL/XML)
- DB Admin/SQL Server, Terraform, SysAdmin, Troubleshooting Network Issues, VM Management

Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.

  • Dallas, United States Themesoft Inc. Full time

    Role: Site Reliability EngineerLocation: Dallas, TexasFull TimeSalary: $140,000 + Bonus+ BenefitsThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment....


  • Dallas, United States Themesoft Inc. Full time

    Role: Site Reliability EngineerLocation: Dallas, TexasFull TimeSalary: $140,000 + Bonus+ BenefitsThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment....


  • Dallas, Texas, United States Apple Full time

    Job SummaryApple is seeking a highly skilled Site Reliability Engineering Manager to lead a team responsible for providing a platform for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to flourish.Key ResponsibilitiesEstablish and maintain SRE practices for a private cloud service to...


  • Dallas, United States Themesoft Inc. Full time

    The Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment. The roleMonitor application performance, take steps to improve overall application performance...


  • Dallas, United States Diamondpick Full time

    Hi,Hope you are doing well.Please find the below JD.Title: SRE EngineerLocation: Dallas, TX Type of Hire: Full TimeJob Description:The Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a...


  • Dallas, United States Diamondpick Full time

    Hi,Hope you are doing well.Please find the below JD.Title: SRE EngineerLocation: Dallas, TX Type of Hire: Full TimeJob Description:The Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a...


  • Dallas, United States Motion Recruitment Full time

    Dallas, TexasHybridFull Time$160k - $180kOur client, a large manager service provider focused on digital solutions and transformation, is looking for a Site Reliability Engineer to join their team. This person will be responsible for monitoring their application performance, making suggestions to improve performance and stability, and taking the lead on...


  • Dallas, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer Location: Dallas, TX//Onsite Duration: Full Time-Only Job Description Responsible for ensuring the reliability of systems, minimizing downtime, and maintaining service-level objectives (SLOs). Developing, automating, and implementing automation tools to streamline processes, deploy applications, and manage...


  • Dallas, United States Motion Recruitment Full time

    Job Description Our client, an independent services business that focuses on delivering a unified operating model for cloud, data, IoT and managed services, is looking for a Site Reliability Engineer who will be accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment. This...


  • Dallas, Texas, United States JPMorganChase Full time

    Job Description There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Infrastructure Platform, Web Hosting team, you will solve complex...


  • Dallas, United States Appspace Full time

    Your Role as a Site Reliability Engineer: Our Cloud Operations team seeks a Site Reliability Engineer who is passionate about problem-solving, automating, and maintaining Appspace’s Cloud Platform to support the needs of our Engineering and Customer Care teams. The ideal candidate will see manual work as an opportunity to exercise automation, will...


  • Dallas, United States VDart Inc Full time

    Job DescriptionJob DescriptionTitle: SRE / Site Reliability EngineerLocation: TX/Dallas Hybrid/OnsiteDuration: 1 YearSkillsHelp build a Site Reliability Engineering culture by sharing your best practices, approaches, documentation, and code with other engineering teams.Apply automation and software to any tasks or parts of the system that would benefit from...


  • Dallas, United States Signify Health Full time

    How will this role have an Impact? Join Signify Health's vibrant Site Reliability Engineering team as a Site Reliability Engineer. We're seeking passionate individuals from diverse technical backgrounds. Reporting to the Manager of Site Reliability Engineering, we offer a collaborative environment that values each team member's unique contribution and...


  • Dallas, United States Signify Health Full time

    Job DescriptionJob DescriptionHow will this role have an Impact?Join Signify Health's vibrant Site Reliability Engineering team as a Site Reliability Engineer. We're seeking passionate individuals from diverse technical backgrounds. Reporting to the Manager of Site Reliability Engineering, we offer a collaborative environment that values each team...


  • Dallas, United States TheStaffed Full time

    Our client, a top tier IT Consulting firm if looking for several qualified Site Reliability Engineers to join a Top-Tier Investment Bank. Essential Requirements and Responsibilities :Proficiency in designing, deploying, and maintaining scalable infrastructure on cloud platforms (most specifically AWS) with automation tools like Terraform or Ansible. ...


  • Dallas, United States JPMorganChase Full time

    Job Description There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Enterprise technology, Infrastructure platforms team, you...


  • Dallas, Texas, United States JPMorganChase Full time

    Job Description There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Enterprise technology, Infrastructure platforms team, you will solve...


  • Dallas, United States Dice Full time

    Dice is the leading career destination for tech experts at every stage of their careers. Our client, Galaxy i Technologies, Inc., is seeking the following. Apply via Dice today! Site Reliability Engineer Location: Dallas TX Onsite Full Time Skill: Site Reliability Engineer Ensures supported applications are functioning and available by minimizing downtime...


  • Dallas, United States Motion Recruitment Partners LLC Full time

    Our client, a large manager service provider focused on digital solutions and transformation, is looking for a Site Reliability Engineer to join their team. This person will be responsible for monitoring their application performance, making suggestions to improve performance and stability, and taking the lead on implementing those improvements. The ideal...


  • Dallas, United States Saxon Global Full time

    As a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and a flare for technical communication, prioritization . You will work directly...