SRE Engineer
2 weeks ago
Location: Austin [Hybrid]
Job Description:
We are currently seeking a highly skilled SRE hands-on Lead Engineer with solid experience to help lead transformational initiatives within IT operations, encompassing development as well. As a crucial figure in this role, you will participate/help designing and implementing cutting-edge SRE solutions, driving the transformation of IT operations organizations to adopt an engineering-centric approach.
Responsibilities:
• Participate in design, architecture of reliable, scalable, and high-performance systems and services with a focus on operational excellence, availability, and performance.
• Primary skillset to be expertise in Observability as service, Telemetry data collection using Dynatrace APM, SolarWinds, Open-Source tools (Prometheus and Grafana), Log Aggregations (Kibana or Splunk) and AIOPS Tools
• Configure application performance monitoring (APM), infrastructure monitoring, synthetic monitoring, RUM, and log monitoring.
• Integrate Dynatrace with CI/CD pipelines, alerting tools, ITSM systems, and incident automation frameworks.
• Tune alert thresholds, baselines, and AI-driven anomaly detection to reduce noise and improve actionable insights.
• Deeper understanding of Login authentication mechanisms using Ping, ForgeRock and SiteMinder technologies (session management and cookie management)
• Correlation mechanisms and dashboards to have end to end visibility of requests from external to internal applications.
• Evangelize SRE evolution within IT operations and promoting a culture of engineering excellence and best practices.
• Define best practices and principles for SRE, including incident management, monitoring, alerting, and automation.
• Collaborate with development teams on resiliency to ensure that services and applications are designed with operational reliability in mind.
• Implement monitoring systems to assess the performance of applications and infrastructure, and proactively identifying areas for optimization.
• Understanding incident and problem management process, post-mortems, and driving improvements to prevent future incidents.
• Analyze resource utilization patterns and forecasting future capacity needs to ensure optimal performance and cost-efficiency.
• Ensure that SRE practices align with security and compliance requirements and implementing measures to protect systems and data.
• Operational excellence with focus on automation and developing tools to streamline operational tasks and increase efficiency.
• Provide guidance and mentorship to SRE teams, fostering skill development, and building a strong and capable SRE practice.
-
Software Development Engineer
5 days ago
Austin, TX, United States Info Way Solutions Full timeSoftware Development Engineer - SRE Preferred Location: Onsite, (Austin/SVC) Requirement Skilled at writing clean, high-performant and unit-testable code in Java Proficiency with the architecture, deployment, performance tuning, and troubleshooting large scale distributed systems on AWS Understanding of SRE principals including monitoring, alerting, error...
-
Software Development Engineer
1 week ago
Austin, TX, United States Info Way Solutions Full timeSoftware Development Engineer - SRE Preferred Location: Onsite, (Austin/SVC) Requirement Skilled at writing clean, high-performant and unit-testable code in Java Proficiency with the architecture, deployment, performance tuning, and troubleshooting large scale distributed systems on AWS Understanding of SRE principals including monitoring, alerting, error...
-
Software Development Engineer
1 week ago
Austin, TX, United States Info Way Solutions Full timeSoftware Development Engineer - SRE Preferred Location: Onsite, (Austin/SVC) Requirement Skilled at writing clean, high-performant and unit-testable code in Java Proficiency with the architecture, deployment, performance tuning, and troubleshooting large scale distributed systems on AWS Understanding of SRE principals including monitoring, alerting, error...
-
Compute SRE Software Engineer
7 days ago
Austin, TX, United States Apple Full timeRole Number: 200633176-0157 Summary People at Apple don’t just build products — they craft the kind of experience that has revolutionized entire industries. The diverse collection of our people and their ideas inspire innovation in everything we do. Imagine what you could do here! Join Apple, and help us leave the world better than we found it. The Apple...
-
Compute SRE Software Engineer
2 weeks ago
Austin, TX, United States Apple Full timeRole Number: 200633176-0157 Summary People at Apple don’t just build products — they craft the kind of experience that has revolutionized entire industries. The diverse collection of our people and their ideas inspire innovation in everything we do. Imagine what you could do here! Join Apple, and help us leave the world better than we found it. The Apple...
-
Python Developer with Kubernetes/ SRE
13 hours ago
Austin, TX, United States Yantran LLC Full timeJob Title: Python SRE Engineer Location: Austin, TX Salary Range: *** to 120,000/Annual for Vendors: *** ***/hr CTC Job Summary: We are seeking a skilled Python Developer with strong Kubernetes experience to join our team in Austin. The ideal candidate will have a solid foundation in software development and a keen interest in Site Reliability Engineering...
-
SRE - Site Reliability Engineer - Senior
2 weeks ago
Austin, TX, United States Manpower Group Inc. Full timeOur client, a leading organization in the technology and cloud services sector, is seeking a SRE - Site Reliability Engineer - Senior to join their team. As a SRE - Site Reliability Engineer - Senior, you will be part of the Infrastructure Support team supporting cloud-native application deployment and reliability. The ideal candidate will demonstrate strong...
-
SRE - Site Reliability Engineer - Senior
2 weeks ago
Austin, TX, United States Manpower Group Inc. Full timeOur client, a leading organization in the technology and cloud services sector, is seeking a SRE - Site Reliability Engineer - Senior to join their team. As a SRE - Site Reliability Engineer - Senior, you will be part of the Infrastructure Support team supporting cloud-native application deployment and reliability. The ideal candidate will demonstrate strong...
-
SRE - Site Reliability Engineer - Senior
1 week ago
Austin, TX, United States Manpower Group Inc. Full timeOur client, a leading organization in the technology and cloud services sector, is seeking a SRE - Site Reliability Engineer - Senior to join their team. As a SRE - Site Reliability Engineer - Senior, you will be part of the Infrastructure Support team supporting cloud-native application deployment and reliability. The ideal candidate will demonstrate strong...
-
SRE - Site Reliability Engineer - Senior
6 days ago
Austin, TX, United States Manpower Group Inc. Full timeOur client, a leading organization in the technology and cloud services sector, is seeking a SRE - Site Reliability Engineer - Senior to join their team. As a SRE - Site Reliability Engineer - Senior, you will be part of the Infrastructure Support team supporting cloud-native application deployment and reliability. The ideal candidate will demonstrate strong...