See more Collapse

Senior Site Reliability Engineer

1 month ago


Phoenix, United States Indotronix International Corporation Full time

Job Title: IT - Site Reliability Engineer Sr – Contractor

Location: Phoenix, AZ – Hybrid (3 Days A week in Office)

Duration: 03 Months Contract To Hire

Only W2

Targeting a 6/24 start date

Must be willing to work EST for a few months then it may change to MST - 8-5 EST

M-F 40 hours

OT: Should be able to provide 24/7 on-call support - will be on a monthly rotation basis

Travel: potentially but rare


Job Description

Which project will this need be supporting?

SRC Special Projects


Organizational Structure And Impact:

Impact/Function this role has within the bank/LOB:

SRC site reliability center supports all infrastructure within the bank, responsible for 24/7 operations of enterprise support, all technology and day to day operations - keeps the bank running, process pillars and observability, using tools and needs product owner to come up with policies and procedures, holding product vendors and the SRE tools team accountable


Team Background and Preferred Candidate History:

Candidate preferred industry background: Technology background required - banking nice to have - knowledge of SRE concepts required


Key responsibilities:

  • Monitor infrastructure, servers, middleware, databases, and batch jobs.
  • Aggressively respond to service requests from business partners facing support teams, Operations, Risk/control partners, etc.
  • Troubleshoot environment, data control and operational issues.
  • Create and Maintain documentation to ensure knowledge accessibility.
  • Automate and streamline process using scripts and scheduling tools.
  • Liaise with other application support teams and internal/external business and technical partners.
  • Provide ad hoc and on-demand reports.
  • Perform timely escalation of critical issues and proactively identify patterns of recurring issues to improve production.
  • Lead problem resolution and conduct root cause analysis and establish processes that will help incident prevention.
  • Participates in the Incident and Problem Management processes as a resolver accountable for root cause analysis, resolution and reporting.
  • Ensures that all production changes are processed according to Change Management policies and procedures.
  • Ensures that appropriate levels of Quality Assurance have been met for all new and existing products.
  • Support Sustained Resiliency, Disaster Recovery, and High Availability events.
  • Help Level 2 operation team with setting up monitoring and bridging the gaps in current monitoring setup.
  • Play key part in setting up reporting and be a key component in Monitor -> Report -> Improve principle
  • Coordinate incident management coverage, to ensure appropriate coverage.
  • Call facilitation, coordination and communications during critical outage situations.
  • Call documentation, queue management, ticket analysis and interface to impacting lines of business for incident impact analysis via the Production Assurance process.
  • End to end view of issues for objectivity.
  • Influence senior technology leads across organizations to ensure timely resolution of incidents
  • Problem Management:
  • Participate and ensure RCA (root cause analysis) activities on client impacting incidents are executed and action items are assigned / completed.
  • Provide expertise and support during critical incidents, interfacing with all impacted groups to better manage the message.
  • Chronic issue coordination and leadership.
  • Guidance to all staff involved and vendors in driving a coordinated approach for results.
  • Hygiene and Capacity Maintenance:
  • Responsible for data quality of PLM.
  • Work aggressively to make sure all servers are up to company standards as per uptimes, patch level etc.
  • Work on Capacity planning for applications, estimating and analyzing growth rates of vital infrastructure components and adding capacity pro actively as and when required.
  • Understand application code, work flow and business usage of application.
  • Understand DB component of application.
  • Understand the impacts of application based on seasonality of critical applications.
  • Document known errors and play important role in Knowledge transfer to Level 1 team.
  • Reduce escalations to Level 3 based on incremental learning about applications.


Must have technical skills/experience:

  • SRE - Network Engineering & Architecture
  • Technical Project management
  • Deep Understanding of Networking Protocols, security, switching & routing, wireless, VoIP, cloud networking, network management and monitoring
  • Understanding of SRE concepts and a proven experience working on automation or application development using any programming language.
  • Solid technical skills including knowledge of client server technology, networking basics, database technology, end to end understanding of 3-tier application architecture (frontend - application server - database).


Flex Skills:

  • Proven experience in incident/problem management with a good understanding of any of the tools used for this purpose.
  • Good understanding of both UNIX and Windows operating systems
  • Good understanding of web hosting technologies like Apache / Tomcat or other equivalent web/app servers.
  • Good understanding of Big Data & cloud concepts.
  • Good understanding of database technologies like ORACLE and SQL.
  • Good understanding of monitoring tools is an added advantage.
  • Solid understanding of the major functionality bundled into a release, both from a technology and business point of view.
  • Strong knowledge of relevant applications and development life cycles.
  • Experience working with geographically distributed and culturally diverse work-groups.
  • Strong desire to learn new technology.


Soft Skills:

  • Excellent communication skills, both verbal and written, with the ability to lead/manage large conference calls.
  • Comfortable providing clear problem descriptions and guidance to business users in a time critical environment.
  • Ability to be proactive with a strong bias for action, naturally inquisitive, and bias for continuous improvement of practices / processes.
  • Excellent influence, negotiation and presentation skills.
  • Experience in working with cross line of business teams, Outside Service Providers and Partner Organizations.
  • Outstanding interpersonal skills and ability to establish strong relationships with all levels of management.
  • Ability to work independently as a self-starter, and within a team environment.


Preferred bachelor's degree in a technical field or relevant work experience, certifications; CCNP | CCIE | JNCIS | JNCIE nice to have


Years of experience: Level 3 5-7 years


Role Differentiator: What is different about this specific role compared to other hiring with the same skill set?

Opportunity for growth, AZ tech hub is growing rapidly and is a good area to grow a candidates career


Interviews Process:

2 step interview

1st round with HM

2nd round panel ITV with engineering managers


We have other current jobs related to this field that you can find below


  • Phoenix, United States Motion Recruitment Full time

    Senior Site Reliability Engineer (SRE)Location: Phoenix, AZ, 85050 (Hybrid- 3 days onsite)Term: 06+ Months Contract (with a possible extension)Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer (SRE) to lead our release processes, manage infrastructure incidents, and optimize our CI/CD pipelines. The ideal candidate will have a deep...


  • Phoenix, United States Motion Recruitment Full time

    Senior Site Reliability Engineer (SRE)Location: Phoenix, AZ, 85050 (Hybrid- 3 days onsite)Term: 06+ Months Contract (with a possible extension)Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer (SRE) to lead our release processes, manage infrastructure incidents, and optimize our CI/CD pipelines. The ideal candidate will have a deep...


  • Phoenix, United States Charles Schwab Full time

    Your Opportunity At Schwab, you're empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us "challenge the status quo" and transform the finance industry together. Charles Schwab's Technology Services Technical Manager's thrive in a leading-edge work culture while focusing on products that help Schwab...


  • Phoenix, United States Indotronix International Corporation Full time

    Indotronix is seeking a Hybrid-Site Reliability Engineer Senior in Phoenix - Biltmore location Position: Site Reliability Engineer Senior Location: Hybrid in Phoenix - Biltmore (3 days in office) Hybrid 3 days min in office/week Duration: Contract to hire Organizational Structure And Impact: Impact/Function this role has within the bank/LOB: SRC site...


  • Phoenix, United States Cloud BC Labs Full time

    Job DescriptionJob DescriptionPOSITIONSite Reliability EngineerLOCATIONHybrid- Phoenix, AZ (locals only)DURATION5+ Months possible ext or CTHINTERVIEW TYPEVideoVISA RESTRICTIONSMust convert perm without sponsorshipREQUIRED SKILLSExperience leading onshore/offshore teamsHands on building/troubleshooting experienceTransitioned from Prometheus to Mimir; Grafana...


  • Phoenix, United States Motion Recruitment Full time

    We are seeking a skilled Site Reliability Engineer (SRE) to join our team and ensure the reliability, scalability, and performance of our critical systems and services. The SRE will bridge the gap between development and operations, focusing on automation, monitoring, and incident management to maintain high service availability and seamless software...


  • Phoenix, United States Indotronix Avani Group Full time

    Position: Site Reliability Engineer Sr Hybrid in Phoenix , AZ (3 days in office)Targeting a 7/1 start date6 Months with a possibility of extension/contract to hireManager will only look at candidates that are open to converting to a full time employee. F-M 40 hours7PM-7AM Saturdays/Sundays MST - F and M 11PM-7AM MSTM- F 40 hours8-5 MSTOnly W2Impact/Function...


  • Phoenix, United States Indotronix Avani Group Full time

    Position: Site Reliability Engineer Sr Hybrid in Phoenix , AZ (3 days in office) Targeting a 7/1 start date 6 Months with a possibility of extension/contract to hire Manager will only look at candidates that are open to converting to a full time employee. F-M 40 hours 7PM-7AM Saturdays/Sundays MST - F and M 11PM-7AM MST M- F 40 hours 8-5 MST Only W2 ...


  • Phoenix, Arizona, United States Cloud BC Labs Full time

    Job DescriptionJob DescriptionPOSITIONSite Reliability Engineer (SRE)LOCATIONHybrid Phoenix, AZDURATION6 MonthsINTERVIEW TYPEVideoVISA RESTRICTIONSNoneREQUIRED SKILLSExperience leading onshore/offshore teamsHands on building/troubleshooting experienceTransitioned from Prometheus to Mimir; Grafana is still a must-haveSite Reliability/Observability dev...


  • Phoenix, United States Insight Global Full time

    Position: LEAD Site Reliability Engineer (SRE) Location: Phoenix, AZ (Hybrid 3X Per Week) Pay Range: $60-$70 an hour + Benefits Duration: 6 month contract to hire Desired Qualifications: -4-year degree (Computer Science, Information Systems, or relational functional field) and/or equivalent combination of education or work experience. **A DEGREE IS...


  • Phoenix, United States Insight Global Full time

    Position: LEAD Site Reliability Engineer (SRE)Location: Phoenix, AZ (Hybrid 3X Per Week)Pay Range: $60-$70 an hour + BenefitsDuration: 6 month contract to hireDesired Qualifications: -4-year degree (Computer Science, Information Systems, or relational functional field) and/or equivalent combination of education or work experience. **A DEGREE IS ABSOLUTELY...


  • Phoenix, United States Insight Global Full time

    Position: LEAD Site Reliability Engineer (SRE)Location: Phoenix, AZ (Hybrid 3X Per Week)Pay Range: $60-$70 an hour + BenefitsDuration: 6 month contract to hireDesired Qualifications: -4-year degree (Computer Science, Information Systems, or relational functional field) and/or equivalent combination of education or work experience. **A DEGREE IS ABSOLUTELY...


  • Phoenix, United States TWO95 International Full time

    Title: Site Reliability Engineer Location: Phoenix, AZ Job Type: Full Time Minimum Qualifications •BS or MS degree in computer science, computer engineering, or other technical discipline, or equivalent 3-6 years of work experience in DevOps - Java/J2EE/REACT JS applications •2+ years of hands on experience on configuring Splunk dashboards, Alerts setup...


  • Phoenix, United States Indotronix International Corporation Full time

    Indotronix is seeking a Hybrid- Site Reliability Engineer in Phoenix (AZ) location Position: Site Reliability Engineer Location: Hybrid in Phoenix (AZ) (3 days in office) Hybrid 3 days min in office/week Duration: Contract to hire Must be willing to work EST for a few months then it may change to MST - 8-5 EST M-F 40 hours Organizational Structure...


  • Phoenix, United States Indotronix International Corporation Full time

    Indotronix is seeking a Hybrid- Site Reliability Engineer in Phoenix (AZ) location Position: Site Reliability Engineer Location: Hybrid in Phoenix (AZ) (3 days in office) Hybrid 3 days min in office/week Duration: Contract to hire Must be willing to work EST for a few months then it may change to MST - 8-5 EST M-F 40 hours Organizational Structure...


  • Phoenix, United States American Express Full time

    American Express Director - Site Reliability Engineering Phoenix , Arizona Apply Now With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here,...


  • Phoenix, United States Mastech Digital Full time

    Mastech Digital Inc. is a (certified) Minority owned business certified by NMSDC. Public traded firm under MHH at NYSE, Established in 1986. Headquartered in Pittsburgh, PA our operations are spread across 11 Global Recruiting & Sales offices across US.Role: Site Reliability EngineerLocation: Pheonix AZDuration: FulltimeMust have:SRE - Network Engineering &...


  • Phoenix, United States Mastech Digital Full time

    Mastech Digital Inc. is a (certified) Minority owned business certified by NMSDC. Public traded firm under MHH at NYSE, Established in 1986. Headquartered in Pittsburgh, PA our operations are spread across 11 Global Recruiting & Sales offices across US.Role: Site Reliability EngineerLocation: Pheonix AZDuration: FulltimeMust have:SRE - Network Engineering &...


  • Phoenix, Arizona, United States Expert In Recruitment Solutions Full time

    Job Title: Site Reliability Engineer (SRE) Location: This is a hybrid onsite position, worker is required to work onsite 2-3 days per week in Phoenix, AZ.Hybrid Onsite: Worker is required to work onsite 3 days per week in Phoenix, AZ as they will be working cross functionally with 3 different teams.MAIN RESPONSIBILITIES" Experience in leading Observability...


  • Phoenix, United States Cloud BC Labs Full time

    Job DescriptionJob DescriptionPOSITIONSite Reliability Engineer (SRE)LOCATIONHybrid Phoenix, AZDURATION6 MonthsINTERVIEW TYPEVideoVISA RESTRICTIONSNoneREQUIRED SKILLSExperience leading onshore/offshore teamsHands on building/troubleshooting experienceTransitioned from Prometheus to Mimir; Grafana is still a must-haveSite Reliability/Observability dev...