We have other current jobs related to this field that you can find below
-
SRE / Site Reliability Engineer
3 weeks ago
Dallas, United States VDart Inc Full timeJob DescriptionJob DescriptionTitle: SRE / Site Reliability EngineerLocation: TX/Dallas Hybrid/OnsiteDuration: 1 YearSkillsHelp build a Site Reliability Engineering culture by sharing your best practices, approaches, documentation, and code with other engineering teams.Apply automation and software to any tasks or parts of the system that would benefit from...
-
Site Reliability Engineering
2 weeks ago
Dallas, United States Apple Full timeSite Reliability Engineering (SRE) Manager - Apple Service Engineering Austin, Texas, United States Software and Services Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish! Join...
-
Site Reliability Engineer
2 months ago
Dallas, United States VIZIO Full timeAbout the Team: VIZIO releases firmware & software for millions of customers in a time efficient manner. Our goal is to maintain 99.9% uptime for our customers. We are seeking a Site Reliability Engineer to join our expanding organization. The Site Reliability Engineer will report to the Manager, DevOps Security and will play a crucial role in enhancing the...
-
Site Reliability Engineer
10 hours ago
Dallas, United States Diamondpick Full timeHi,Hope you are doing well.Please find the below JD.Title: SRE EngineerLocation: Dallas, TX Type of Hire: Full TimeJob Description:The Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a...
-
Site Reliability Engineer
3 weeks ago
Dallas, United States Themesoft Inc. Full timeRole: Site Reliability EngineerLocation: Dallas, TexasFull TimeSalary: $140,000 + Bonus+ BenefitsThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment....
-
Site Reliability Engineer
3 weeks ago
Dallas, United States Themesoft Inc. Full timeRole: Site Reliability EngineerLocation: Dallas, TexasFull TimeSalary: $140,000 + Bonus+ BenefitsThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment....
-
Site Reliability Engineer
1 week ago
Dallas, United States Motion Recruitment Full timeJob Description Our client, an independent services business that focuses on delivering a unified operating model for cloud, data, IoT and managed services, is looking for a Site Reliability Engineer who will be accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment. This...
-
Site Reliability Engineer
3 weeks ago
Dallas, United States Themesoft Inc. Full timeThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment. The roleMonitor application performance, take steps to improve overall application performance...
-
Site Reliability Engineer
1 month ago
Dallas, United States Net2Source Inc. Full timeThis is W2 position only and Only Local Candidates will be required.Role-SRE (Site Reliability Engineer)Rate-$60/hr. on W2Oniste-3 days a week-Dallas, TXListed below is the JD:The roleThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability,...
-
Site Reliability Engineer
1 month ago
Dallas, United States Net2Source Inc. Full timeThis is W2 position only and Only Local Candidates will be required.Role-SRE (Site Reliability Engineer)Rate-$60/hr. on W2Oniste-3 days a week-Dallas, TXListed below is the JD:The roleThe Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability,...
-
Site Reliability Engineer
2 weeks ago
Dallas, United States Motion Recruitment Partners LLC Full timeOur client, a large manager service provider focused on digital solutions and transformation, is looking for a Site Reliability Engineer to join their team. This person will be responsible for monitoring their application performance, making suggestions to improve performance and stability, and taking the lead on implementing those improvements. The ideal...
-
Site Reliability Engineer
2 weeks ago
Dallas, United States Dice Full timeDice is the leading career destination for tech experts at every stage of their careers. Our client, Galaxy i Technologies, Inc., is seeking the following. Apply via Dice today! Site Reliability Engineer Location: Dallas TX Onsite Full Time Skill: Site Reliability Engineer Ensures supported applications are functioning and available by minimizing downtime...
-
Site Reliability Engineer
2 weeks ago
Dallas, United States Motion Recruitment Full timeOur client, a large manager service provider focused on digital solutions and transformation, is looking for a Site Reliability Engineer to join their team. This person will be responsible for monitoring their application performance, making suggestions to improve performance and stability, and taking the lead on implementing those improvements. The ideal...
-
Site Reliability Engineer
1 day ago
Dallas, United States Appspace Full timeYour Role as a Site Reliability Engineer: Our Cloud Operations team seeks a Site Reliability Engineer who is passionate about problem-solving, automating, and maintaining Appspace’s Cloud Platform to support the needs of our Engineering and Customer Care teams. The ideal candidate will see manual work as an opportunity to exercise automation, will...
-
Site Reliability Engineer
2 weeks ago
Dallas, United States Motion Recruitment Full timeJob Description Our client, a large manager service provider focused on digital solutions and transformation, is looking for a Site Reliability Engineer to join their team. This individual will oversee the functionality and performance of their application, coming up with ideas to make it more stable and efficient, and leading the implementation of those...
-
Site Reliability Engineer
2 months ago
Dallas, United States Diverse Lynx Full timeRole : Site Reliability Engineer/Devops Engineer Location : Dallas TX (Onsite) Duration: Full-time Job Description Skill: Site Reliability Engineer Ensures supported applications are functioning and available by minimizing downtime and maximizing performance. Provides technical expertise to the stakeholders and end user ensuring continuous...
-
Site Reliability Engineer
3 months ago
Dallas, United States Saxon Global Full timeAs a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and a flare for technical communication, prioritization . You will work directly...
-
Dallas, Texas, United States American Airlines Full timeIntroductionAre you ready to embark on a journey filled with opportunities, both professionally and personally? Become a part of the American Airlines family, where you can explore the globe, enhance your skills, and evolve into your best self. As you begin this exciting chapter, you will face challenges with adaptability and poise, acquiring new...
-
Site Reliability Engineer
2 months ago
Dallas, United States PMG Full timeJob DescriptionJob DescriptionPMG is a digital company that helps marketers connect people with their brand. Focused on people and grounded in data, our award-winning culture fosters meaningful careers. Partnering with the most iconic brands in the world, we put people at the center of everything we do to deliver value, innovation, and business...
-
Site Reliability Engineer
3 weeks ago
Dallas, Texas, United States PMG Full timePMG is a digital company that helps marketers connect people with their brand. Focused on people and grounded in data, our award-winning culture fosters meaningful careers. Partnering with the most iconic brands in the world, we put people at the center of everything we do to deliver value, innovation, and business transformation.WHO WE AREAgile. Authentic....
SRE (Site Reliability Engineer)
2 months ago
Title: Sr. SRE Observability Engineer
Skills:
8+ years of experience in AWS, configuring alerts, monitoring, Open Telemetry framework, Terraform, and scripting.
In-depth knowledge of observability tools such as Prometheus, Grafana, Splunk, Netcool, ELK, AIM, Sumologic, and New Relics.
Strong understanding of licensing mechanisms and MELT.
Experience with Cloud Platforms (AWS/Azure), Kubernetes, CI/CD (Jenkins), and Infrastructure as Code (Terraform).
Ability to read and write code in Java, Python, Ruby, Node.js, and other relevant languages.
Proven experience in creating dashboards, establishing design patterns, and understanding application flows in containerized/microservice environment.
Excellent communication skills and the ability to work effectively across teams.
Description:
Implement and maintain observability solutions using Prometheus as the backend and GEM as the middle end.
o Develop and manage Grafana dashboards for visualizing metrics and performance data.
o Optimize and configure licensing mechanisms for observability tools.
o Write and manage complex queries and alert definitions.
o Bridge the gap between application development teams and SRE operations.
o Manage and optimize OpenShift, Linux environments, and Grafana Enterprise Metrics.
o Utilize MELT (Metrics, Events, Logs, and Traces) and plan for long-term data migration to AWS S3.
o Configure and manage monitoring, alerts, and observability using a range of tools including Splunk, Netcool, ELK, and AIM.
o Maintain deep technical knowledge and operational experience with tools like AppDynamics, DataDog, Dynatrace, NewRelic, Sumologic, Splunk, Prometheus, and Grafana.
o Understand and write code (Java, Python, Ruby, Node.js, etc.), programs, config files, and complex queries.
o Implement and manage Infrastructure as Code (IAC) using Terraform.
o Manage and optimize cloud platforms (AWS/Azure) and Kubernetes environments.
o Establish design patterns for monitoring and benchmarking application uptime and performance.
o Provide thought leadership and strategy in implementing and maintaining observability solutions.
o Onboard new teams and data sources into the observability solutions.
o Create and maintain operational process documentation for observability solutions.
o Optimize the Observability Suite for monitoring applications and infrastructure.
o Write queries for alerts, dashboards, and reporting.
#J-18808-Ljbffr