Site Reliability Engineer
2 weeks ago
Description Should be strong SRE, experience with java, AWS / DevOps / deployment strategy and monitoring tools.
Candidates should be with more hands-on experience with Dynatrace / Splunk / CICD / Grafana etc.
Looking for resource with very good application trouble shooting experience.
More on core SRE metrics before going to Prod. uptime vs availability, monitoring vs Observability, and incident and outage etc
Should be familiar with SLO, SLA, SLI or other SRE keywords or terms.
Experience with deploying using CICD pipeline and debugging/troubleshooting issues and coordinate with the application team such as Java, Spring Boot, Python, .Net, etc.
Ability to perform API performance testing using tools such as JMeter / Blazemeter
Experience on identifying RCA for any production issues on AWS environment with multiple microservices
Expertise in Terraform to manage infrastructure as code would be highly desirable
Job responsibilities:
Demonstrates and champions site reliability culture and practices and exerts technical influence throughout your team
Leads initiatives to improve the reliability and stability of your team's applications and platforms using data-driven analytics to improve service levels
Collaborates with team members to identify comprehensive service level indicators and stakeholders to establish reasonable service level objectives and error budgets with customers
Demonstrates a high level of technical expertise within one or more technical domains and proactively identifies and solves technology-related bottlenecks in your areas of expertise
Acts as the main point of contact during major incidents for your application and demonstrates the skills to identify and solve issues quickly to avoid financial losses
Documents and shares knowledge within your organization via internal forums and communities of practice
Required qualifications, capabilities, and skills
Formal training or certification on Software engineering concepts and 5+ years of applied experience
Deep proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices with the ability to implement these practices within an application or platform
Fluency in JAVA programming
Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Splunk, Grafana, Dynatrace, Prometheus, Datadog
Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker)
Preferred qualifications, capabilities, and skills
Experience with infrastructure as code tools such as Terraform. also experience managing/supporting Cloud based applications, AWS preferred.
Excellent communications desired
Background Fin-tech experience may be helpful
Troubleshooting common networking technologies and issues
Education:
Bachelors Degree
Additional client information:
#J-18808-Ljbffr
-
Site Reliability Engineer
3 days ago
Plano, United States Dice Full timeDice is the leading career destination for tech experts at every stage of their careers. Our client, Fortis Talent, is seeking the following. Apply via Dice today! Fortis Talent is seeking a Site Reliability Engineer for a Contract to Hire opportunity with one of our top clients in Plano, TX. Required skills are as follows: This is a Contract to Hire on W2...
-
Site Reliability Engineer
3 days ago
Plano, United States Toyota Full timeOverview Who we are Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world's most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We're looking for...
-
Plano, United States Tyler Technologies Full timeSite Reliability Engineer, Enterprise Justice Technical SupportPlano,TexasUnited States This Site Reliability Engineer position is a technical role within the Technical and Cloud Services group that helps ensure the reliability, scalability, and performance of our infrastructure while driving automation and efficiency in our development process. This...
-
Principal Site Reliability Engineer
3 days ago
Plano, United States Toyota Motor Sales, U.S.A., Inc. Full timeOverview Who we are Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world's most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We're looking for...
-
Principal Site Reliability Engineer
3 days ago
Plano, United States Toyota Deutschland GmbH Full timeOverview Who we are Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world’s most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We’re looking for...
-
Site Reliability Engineering
4 weeks ago
Plano, United States Forhyre Full timeJob DescriptionJob DescriptionForhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changing technology landscape. To be successful in this roleYou'll have the opportunity to design and implement major infrastructure...
-
Plano, United States Toyota Full timeOverview Who we are At Toyota, we are reimagining mobility through innovative, high-quality technology solutions designed to enhance lives and meet our company mission of "Producing Happiness for All." If you are interested in reimagining mobility with us in an inclusive environment built on teamwork that puts respect for people first, we want to talk to...
-
Site Reliability Engineer, Enterprise Justice
4 weeks ago
Plano, United States Tyler Technologies Full timeDescription This Site Reliability Engineer position is a technical role within the Technical and Cloud Services group that helps ensure the reliability, scalability, and performance of our infrastructure while driving automation and efficiency in our development process. This engineer provides technical guidance to team members and other development teams...
-
Plano, United States Tyler Technologies Full timeDescription This Site Reliability Engineer position is a technical role within the Technical and Cloud Services group that helps ensure the reliability, scalability, and performance of our infrastructure while driving automation and efficiency in our development process. This engineer provides technical guidance to team members and other development teams...
-
Plano, United States Toyota North America Full timeOverviewWho we areAt Toyota, we are reimagining mobility through innovative, high-quality technology solutions designed to enhance lives and meet our company mission of "Producing Happiness for All." If you are interested in reimagining mobility with us in an inclusive environment built on teamwork that puts respect for people first, we want to talk to...
-
Sr. Site Reliability Engineer
1 month ago
Plano, United States Pizza Hut Full time7100 Corporate Drive **Plano, TX 75023** **Sr. Site Reliability Engineer (Remote)** **Description:** Job Description - Site Reliability Engineer If so, you might be just the person we are looking for to fill our Senior Site Reliability Engineering role at Pizza Hut. Site Reliability Engineers are just as adept at software engineering as they are able to be...
-
Senior Manager Site Reliability Engineering
2 weeks ago
Plano, United States CarMax Full timeCarMax, the way your career should be! Who we are looking for:The Senior Technology Manager’s primary responsibility is to partner with their business and technology peers to provide solutions and services that help deliver CarMax’s strategic mission and plans. This position will direct and manage the strategic planning and implementation of enterprise...
-
Senior Manager Site Reliability Engineering
2 weeks ago
Plano, United States CarMax Full timeCarMax, the way your career should be! Who we are looking for:The Senior Technology Manager’s primary responsibility is to partner with their business and technology peers to provide solutions and services that help deliver CarMax’s strategic mission and plans. This position will direct and manage the strategic planning and implementation of enterprise...
-
Senior Manager Site Reliability Engineering
2 weeks ago
Plano, United States CarMax Full timeCarMax, the way your career should be! Who we are looking for:The Senior Technology Manager’s primary responsibility is to partner with their business and technology peers to provide solutions and services that help deliver CarMax’s strategic mission and plans. This position will direct and manage the strategic planning and implementation of enterprise...
-
Plano, United States Intuit Full timeCome join the team at Intuit as a Software Engineer in Site Reliability Engineering. Site Reliability Engineering works to ensure that TurboTax.com and other Intuit products are highly-available, scale without bottlenecks, and offer world-class performance. The team is looking for “full cycle” Software Engineers with a passion for optimization and...
-
Site Reliability Engineer
4 days ago
Plano, United States Hispanic Technology Executive Council Full timeAt Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day. One of the keys to driving Responsible Growth is being a great place to work for our teammates...
-
Site Reliability Engineer III
3 days ago
Plano, United States Trintech Full timeSummary This position seeks a versatile technical enthusiast who thrives in the world of technology. The ideal candidate will bridge the gap between business needs and team capabilities, fostering clarity and alignment. They'll engage with internal and external stakeholders, adeptly capturing and articulating requirements while crafting effective solutions....
-
Site Reliability Center Lead
3 days ago
Plano, United States TEKsystems Full timeTEKsystems Site Reliability Center Lead Pittsburgh , Pennsylvania Apply Now The client is seeking a Site Reliability Center Lead to do production support, not new development. The individual will troubleshoot highly technical problems which may require assessing source code to analyze and resolve problems. This requires advanced troubleshooting skills and be...
-
Site Reliability Center Lead
4 days ago
Plano, United States TEKsystems Full timeTEKsystems Site Reliability Center Lead Pittsburgh , Pennsylvania Apply Now The client is seeking a Site Reliability Center Lead to do production support, not new development. The individual will troubleshoot highly technical problems which may require assessing source code to analyze and resolve problems. This requires advanced troubleshooting skills and be...
-
Reliability and Monitoring Engineer
2 weeks ago
Plano, United States ClifyX Full timeReliability and Monitoring Engineer Plano, TX office 3 times a week Infosys/Toyot Bill rate: $75 openings - 5 Responsible for ensuring the availability, performance, and reliability of our cloud-based infrastructure and services. The primary focus of this role is designing, implementing, and managing robust monitoring and alerting systems to proactively...