Site Reliability Engineer

2 weeks ago


Plano, United States JobRialto Full time

Description Should be strong SRE, experience with java, AWS / DevOps / deployment strategy and monitoring tools.

Candidates should be with more hands-on experience with Dynatrace / Splunk / CICD / Grafana etc.

Looking for resource with very good application trouble shooting experience.

More on core SRE metrics before going to Prod. uptime vs availability, monitoring vs Observability, and incident and outage etc

Should be familiar with SLO, SLA, SLI or other SRE keywords or terms.

Experience with deploying using CICD pipeline and debugging/troubleshooting issues and coordinate with the application team such as Java, Spring Boot, Python, .Net, etc.

Ability to perform API performance testing using tools such as JMeter / Blazemeter

Experience on identifying RCA for any production issues on AWS environment with multiple microservices

Expertise in Terraform to manage infrastructure as code would be highly desirable

Job responsibilities:

Demonstrates and champions site reliability culture and practices and exerts technical influence throughout your team

Leads initiatives to improve the reliability and stability of your team's applications and platforms using data-driven analytics to improve service levels

Collaborates with team members to identify comprehensive service level indicators and stakeholders to establish reasonable service level objectives and error budgets with customers

Demonstrates a high level of technical expertise within one or more technical domains and proactively identifies and solves technology-related bottlenecks in your areas of expertise

Acts as the main point of contact during major incidents for your application and demonstrates the skills to identify and solve issues quickly to avoid financial losses

Documents and shares knowledge within your organization via internal forums and communities of practice

Required qualifications, capabilities, and skills

Formal training or certification on Software engineering concepts and 5+ years of applied experience

Deep proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices with the ability to implement these practices within an application or platform

Fluency in JAVA programming

Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Splunk, Grafana, Dynatrace, Prometheus, Datadog

Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)

Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker)

Preferred qualifications, capabilities, and skills

Experience with infrastructure as code tools such as Terraform. also experience managing/supporting Cloud based applications, AWS preferred.

Excellent communications desired

Background Fin-tech experience may be helpful

Troubleshooting common networking technologies and issues

Education:

Bachelors Degree

Additional client information: #J-18808-Ljbffr



  • Plano, United States Dice Full time

    Dice is the leading career destination for tech experts at every stage of their careers. Our client, Fortis Talent, is seeking the following. Apply via Dice today! Fortis Talent is seeking a Site Reliability Engineer for a Contract to Hire opportunity with one of our top clients in Plano, TX. Required skills are as follows: This is a Contract to Hire on W2...


  • Plano, United States Toyota Full time

    Overview Who we are Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world's most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We're looking for...


  • Plano, United States Tyler Technologies Full time

    Site Reliability Engineer, Enterprise Justice Technical SupportPlano,TexasUnited States This Site Reliability Engineer position is a technical role within the Technical and Cloud Services group that helps ensure the reliability, scalability, and performance of our infrastructure while driving automation and efficiency in our development process. This...


  • Plano, United States Toyota Motor Sales, U.S.A., Inc. Full time

    Overview Who we are Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world's most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We're looking for...


  • Plano, United States Toyota Deutschland GmbH Full time

    Overview Who we are Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world’s most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We’re looking for...


  • Plano, United States Forhyre Full time

    Job DescriptionJob DescriptionForhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changing technology landscape. To be successful in this roleYou'll have the opportunity to design and implement major infrastructure...


  • Plano, United States Toyota Full time

    Overview Who we are At Toyota, we are reimagining mobility through innovative, high-quality technology solutions designed to enhance lives and meet our company mission of "Producing Happiness for All." If you are interested in reimagining mobility with us in an inclusive environment built on teamwork that puts respect for people first, we want to talk to...


  • Plano, United States Tyler Technologies Full time

    Description This Site Reliability Engineer position is a technical role within the Technical and Cloud Services group that helps ensure the reliability, scalability, and performance of our infrastructure while driving automation and efficiency in our development process. This engineer provides technical guidance to team members and other development teams...


  • Plano, United States Tyler Technologies Full time

    Description This Site Reliability Engineer position is a technical role within the Technical and Cloud Services group that helps ensure the reliability, scalability, and performance of our infrastructure while driving automation and efficiency in our development process. This engineer provides technical guidance to team members and other development teams...


  • Plano, United States Toyota North America Full time

    OverviewWho we areAt Toyota, we are reimagining mobility through innovative, high-quality technology solutions designed to enhance lives and meet our company mission of "Producing Happiness for All." If you are interested in reimagining mobility with us in an inclusive environment built on teamwork that puts respect for people first, we want to talk to...


  • Plano, United States Pizza Hut Full time

    7100 Corporate Drive **Plano, TX 75023** **Sr. Site Reliability Engineer (Remote)** **Description:** Job Description - Site Reliability Engineer If so, you might be just the person we are looking for to fill our Senior Site Reliability Engineering role at Pizza Hut. Site Reliability Engineers are just as adept at software engineering as they are able to be...


  • Plano, United States CarMax Full time

    CarMax, the way your career should be! Who we are looking for:The Senior Technology Manager’s primary responsibility is to partner with their business and technology peers to provide solutions and services that help deliver CarMax’s strategic mission and plans. This position will direct and manage the strategic planning and implementation of enterprise...


  • Plano, United States CarMax Full time

    CarMax, the way your career should be! Who we are looking for:The Senior Technology Manager’s primary responsibility is to partner with their business and technology peers to provide solutions and services that help deliver CarMax’s strategic mission and plans. This position will direct and manage the strategic planning and implementation of enterprise...


  • Plano, United States CarMax Full time

    CarMax, the way your career should be! Who we are looking for:The Senior Technology Manager’s primary responsibility is to partner with their business and technology peers to provide solutions and services that help deliver CarMax’s strategic mission and plans. This position will direct and manage the strategic planning and implementation of enterprise...


  • Plano, United States Intuit Full time

    Come join the team at Intuit as a Software Engineer in Site Reliability Engineering. Site Reliability Engineering works to ensure that TurboTax.com and other Intuit products are highly-available, scale without bottlenecks, and offer world-class performance. The team is looking for “full cycle” Software Engineers with a passion for optimization and...


  • Plano, United States Hispanic Technology Executive Council Full time

    At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day. One of the keys to driving Responsible Growth is being a great place to work for our teammates...


  • Plano, United States Trintech Full time

    Summary This position seeks a versatile technical enthusiast who thrives in the world of technology. The ideal candidate will bridge the gap between business needs and team capabilities, fostering clarity and alignment. They'll engage with internal and external stakeholders, adeptly capturing and articulating requirements while crafting effective solutions....


  • Plano, United States TEKsystems Full time

    TEKsystems Site Reliability Center Lead Pittsburgh , Pennsylvania Apply Now The client is seeking a Site Reliability Center Lead to do production support, not new development. The individual will troubleshoot highly technical problems which may require assessing source code to analyze and resolve problems. This requires advanced troubleshooting skills and be...


  • Plano, United States TEKsystems Full time

    TEKsystems Site Reliability Center Lead Pittsburgh , Pennsylvania Apply Now The client is seeking a Site Reliability Center Lead to do production support, not new development. The individual will troubleshoot highly technical problems which may require assessing source code to analyze and resolve problems. This requires advanced troubleshooting skills and be...


  • Plano, United States ClifyX Full time

    Reliability and Monitoring Engineer Plano, TX office 3 times a week Infosys/Toyot Bill rate: $75 openings - 5 Responsible for ensuring the availability, performance, and reliability of our cloud-based infrastructure and services. The primary focus of this role is designing, implementing, and managing robust monitoring and alerting systems to proactively...