We have other current jobs related to this field that you can find below


  • Dallas, United States VDart Inc Full time

    Job DescriptionJob DescriptionTitle: SRE / Site Reliability EngineerLocation: TX/Dallas Hybrid/OnsiteDuration: 1 YearSkillsHelp build a Site Reliability Engineering culture by sharing your best practices, approaches, documentation, and code with other engineering teams.Apply automation and software to any tasks or parts of the system that would benefit from...


  • Dallas, United States Apple Full time

    Site Reliability Engineering (SRE) Manager - Apple Service Engineering Austin, Texas, United States Software and Services Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish! Join...


  • Dallas, United States VIZIO Full time

    About the Team: VIZIO releases firmware & software for millions of customers in a time efficient manner. Our goal is to maintain 99.9% uptime for our customers. We are seeking a Site Reliability Engineer to join our expanding organization. The Site Reliability Engineer will report to the Manager, DevOps Security and will play a crucial role in enhancing the...


  • Dallas, United States Diamondpick Full time

    Hi,Hope you are doing well.Please find the below JD.Title: SRE EngineerLocation: Dallas, TX Type of Hire: Full TimeJob Description:The Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a...


  • Dallas, United States Themesoft Inc. Full time

    Role: Site Reliability EngineerLocation: Dallas, TexasFull TimeSalary: $140,000 + Bonus+ BenefitsThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment....


  • Dallas, United States Themesoft Inc. Full time

    Role: Site Reliability EngineerLocation: Dallas, TexasFull TimeSalary: $140,000 + Bonus+ BenefitsThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment....


  • Dallas, United States Motion Recruitment Full time

    Job Description Our client, an independent services business that focuses on delivering a unified operating model for cloud, data, IoT and managed services, is looking for a Site Reliability Engineer who will be accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment. This...


  • Dallas, United States Themesoft Inc. Full time

    The Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment. The roleMonitor application performance, take steps to improve overall application performance...


  • Dallas, United States Net2Source Inc. Full time

    This is W2 position only and Only Local Candidates will be required.Role-SRE (Site Reliability Engineer)Rate-$60/hr. on W2Oniste-3 days a week-Dallas, TXListed below is the JD:The roleThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability,...


  • Dallas, United States Net2Source Inc. Full time

    This is W2 position only and Only Local Candidates will be required.Role-SRE (Site Reliability Engineer)Rate-$60/hr. on W2Oniste-3 days a week-Dallas, TXListed below is the JD:The roleThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability,...


  • Dallas, United States Motion Recruitment Partners LLC Full time

    Our client, a large manager service provider focused on digital solutions and transformation, is looking for a Site Reliability Engineer to join their team. This person will be responsible for monitoring their application performance, making suggestions to improve performance and stability, and taking the lead on implementing those improvements. The ideal...


  • Dallas, United States Dice Full time

    Dice is the leading career destination for tech experts at every stage of their careers. Our client, Galaxy i Technologies, Inc., is seeking the following. Apply via Dice today! Site Reliability Engineer Location: Dallas TX Onsite Full Time Skill: Site Reliability Engineer Ensures supported applications are functioning and available by minimizing downtime...


  • Dallas, United States Motion Recruitment Full time

    Our client, a large manager service provider focused on digital solutions and transformation, is looking for a Site Reliability Engineer to join their team. This person will be responsible for monitoring their application performance, making suggestions to improve performance and stability, and taking the lead on implementing those improvements. The ideal...


  • Dallas, United States Appspace Full time

    Your Role as a Site Reliability Engineer: Our Cloud Operations team seeks a Site Reliability Engineer who is passionate about problem-solving, automating, and maintaining Appspace’s Cloud Platform to support the needs of our Engineering and Customer Care teams. The ideal candidate will see manual work as an opportunity to exercise automation, will...


  • Dallas, United States Motion Recruitment Full time

    Job Description Our client, a large manager service provider focused on digital solutions and transformation, is looking for a Site Reliability Engineer to join their team. This individual will oversee the functionality and performance of their application, coming up with ideas to make it more stable and efficient, and leading the implementation of those...


  • Dallas, United States Diverse Lynx Full time

    Role : Site Reliability Engineer/Devops Engineer Location : Dallas TX (Onsite) Duration: Full-time Job Description Skill: Site Reliability Engineer Ensures supported applications are functioning and available by minimizing downtime and maximizing performance. Provides technical expertise to the stakeholders and end user ensuring continuous...


  • Dallas, United States Saxon Global Full time

    As a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and a flare for technical communication, prioritization . You will work directly...


  • Dallas, Texas, United States American Airlines Full time

    IntroductionAre you ready to embark on a journey filled with opportunities, both professionally and personally? Become a part of the American Airlines family, where you can explore the globe, enhance your skills, and evolve into your best self. As you begin this exciting chapter, you will face challenges with adaptability and poise, acquiring new...


  • Dallas, United States PMG Full time

    Job DescriptionJob DescriptionPMG is a digital company that helps marketers connect people with their brand. Focused on people and grounded in data, our award-winning culture fosters meaningful careers. Partnering with the most iconic brands in the world, we put people at the center of everything we do to deliver value, innovation, and business...


  • Dallas, Texas, United States PMG Full time

    PMG is a digital company that helps marketers connect people with their brand. Focused on people and grounded in data, our award-winning culture fosters meaningful careers. Partnering with the most iconic brands in the world, we put people at the center of everything we do to deliver value, innovation, and business transformation.WHO WE AREAgile. Authentic....

SRE (Site Reliability Engineer)

2 months ago


Dallas, United States Econosoft Inc. Full time

Title: Sr. SRE Observability Engineer Skills: 8+ years of experience in AWS, configuring alerts, monitoring, Open Telemetry framework, Terraform, and scripting. In-depth knowledge of observability tools such as Prometheus, Grafana, Splunk, Netcool, ELK, AIM, Sumologic, and New Relics. Strong understanding of licensing mechanisms and MELT. Experience with Cloud Platforms (AWS/Azure), Kubernetes, CI/CD (Jenkins), and Infrastructure as Code (Terraform). Ability to read and write code in Java, Python, Ruby, Node.js, and other relevant languages. Proven experience in creating dashboards, establishing design patterns, and understanding application flows in containerized/microservice environment. Excellent communication skills and the ability to work effectively across teams. Description: Implement and maintain observability solutions using Prometheus as the backend and GEM as the middle end. o Develop and manage Grafana dashboards for visualizing metrics and performance data. o Optimize and configure licensing mechanisms for observability tools. o Write and manage complex queries and alert definitions. o Bridge the gap between application development teams and SRE operations. o Manage and optimize OpenShift, Linux environments, and Grafana Enterprise Metrics. o Utilize MELT (Metrics, Events, Logs, and Traces) and plan for long-term data migration to AWS S3. o Configure and manage monitoring, alerts, and observability using a range of tools including Splunk, Netcool, ELK, and AIM. o Maintain deep technical knowledge and operational experience with tools like AppDynamics, DataDog, Dynatrace, NewRelic, Sumologic, Splunk, Prometheus, and Grafana. o Understand and write code (Java, Python, Ruby, Node.js, etc.), programs, config files, and complex queries. o Implement and manage Infrastructure as Code (IAC) using Terraform. o Manage and optimize cloud platforms (AWS/Azure) and Kubernetes environments. o Establish design patterns for monitoring and benchmarking application uptime and performance. o Provide thought leadership and strategy in implementing and maintaining observability solutions. o Onboard new teams and data sources into the observability solutions. o Create and maintain operational process documentation for observability solutions. o Optimize the Observability Suite for monitoring applications and infrastructure. o Write queries for alerts, dashboards, and reporting.

#J-18808-Ljbffr