SRE - Network Engineering & Architecture

1 month ago


Phoenix, United States Mastech Digital Full time

Organizational Structure And Impact:

Impact/Function this role has within the bank/LOB:

SRC site reliability center supports all infrastructure within the bank, responsible for 24/7 operations of enterprise support, all technology and day to day operations - keeps the bank running, process pillars and observability, using tools and needs product owner to come up with policies and procedures, holding product vendors and the SRE tools team accountable

Team Background and Preferred Candidate History:

Candidate preferred industry background: Technology background required - banking nice to have - knowledge of SRE concepts required

Candidate Technical and skills profile:

Key responsibilities:

• Monitor infrastructure, servers, middleware, databases, and batch jobs.

• Aggressively respond to service requests from business partners facing support teams, Operations, Risk/control partners, etc.

• Troubleshoot environment, data control and operational issues.

• Create and Maintain documentation to ensure knowledge accessibility.

• Automate and streamline process using scripts and scheduling tools.

• Liaise with other application support teams and internal/external business and technical partners.

• Provide ad hoc and on-demand reports.

• Perform timely escalation of critical issues and proactively identify patterns of recurring issues to improve production.

• Lead problem resolution and conduct root cause analysis and establish processes that will help incident prevention.

• Participates in the Incident and Problem Management processes as a resolver accountable for root cause analysis, resolution and reporting.

• Ensures that all production changes are processed according to Change Management policies and procedures.

• Ensures that appropriate levels of Quality Assurance have been met for all new and existing products.

• Support Sustained Resiliency, Disaster Recovery, and High Availability events.

• Help Level 2 operation team with setting up monitoring and bridging the gaps in current monitoring setup.

• Play key part in setting up reporting and be a key component in Monitor -> Report -> Improve principle

• Coordinate incident management coverage, to ensure appropriate coverage.

• Call facilitation, coordination and communications during critical outage situations.

• Call documentation, queue management, ticket analysis and interface to impacting lines of business for incident impact analysis via the Production Assurance process.

• End to end view of issues for objectivity.

• Influence senior technology leads across organizations to ensure timely resolution of incidents

• Problem Management:

• Participate and ensure RCA (root cause analysis) activities on client impacting incidents are executed and action items are assigned / completed.

• Provide expertise and support during critical incidents, interfacing with all impacted groups to better manage the message.

• Chronic issue coordination and leadership.

• Guidance to all staff involved and vendors in driving a coordinated approach for results.

• Hygiene and Capacity Maintenance:

• Responsible for data quality of PLM.

• Work aggressively to make sure all servers are up to company standards as per uptimes, patch level etc.

• Work on Capacity planning for applications, estimating and analyzing growth rates of vital infrastructure components and adding capacity pro actively as and when required.

• Understand application code, work flow and business usage of application.

• Understand DB component of application.

• Understand the impacts of application based on seasonality of critical applications.

• Document known errors and play important role in Knowledge transfer to Level 1 team.

• Reduce escalations to Level 3 based on incremental learning about applications.

Must have technical skills/experience (ask for alternative/tool/version):

- SRE - Network Engineering & Architecture

- Technical Project management

- Deep Understanding of Networking Protocols, security, switching & routing, wireless, voip, cloud networking, network management and monitoring

- Understanding of SRE concepts and a proven experience working on automation or application development using any programing language.

- Solid technical skills including knowledge of client server technology, networking basics, database technology, end to end understanding of 3-tier application architecture (frontend - application server - database).



  • Phoenix, United States Mastech Digital Full time

    Organizational Structure And Impact: Impact/Function this role has within the bank/LOB: SRC site reliability center supports all infrastructure within the bank, responsible for 24/7 operations of enterprise support, all technology and day to day operations - keeps the bank running, process pillars and observability, using tools and needs product owner to...


  • Phoenix, United States Mastech Digital Full time

    Organizational Structure And Impact: Impact/Function this role has within the bank/LOB: SRC site reliability center supports all infrastructure within the bank, responsible for 24/7 operations of enterprise support, all technology and day to day operations - keeps the bank running, process pillars and observability, using tools and needs product owner to...

  • SRE Engineer w Chaos

    2 months ago


    Phoenix, United States eTeam Full time

    Required skills: Required Experience: Hands on experience as SRE Engineer Good Java coding Skills and has experience building tools and frameworks using front-end and backend technologies Proven Hands-on experience working with modern container services (Docker/Kubernetes) Proven Hands-on experience working with Web Services and Databases Strong...

  • SRE Engineer w Chaos

    3 weeks ago


    Phoenix, United States eTeam Full time

    Required skills: Required Experience: Hands on experience as SRE Engineer Good Java coding Skills and has experience building tools and frameworks using front-end and backend technologies Proven Hands-on experience working with modern container services (Docker/Kubernetes) Proven Hands-on experience working with Web Services and Databases Strong...

  • SRE Engineer w Chaos

    4 weeks ago


    Phoenix, United States eTeam Full time

    Required skills: Required Experience: Hands on experience as SRE Engineer Good Java coding Skills and has experience building tools and frameworks using front-end and backend technologies Proven Hands-on experience working with modern container services (Docker/Kubernetes) Proven Hands-on experience working with Web Services and Databases Strong...


  • Phoenix, United States TEKsystems Full time

    Description: Monitor infrastructure, servers, middleware, databases, and batch jobs. • Aggressively respond to service requests from business partners facing support teams, Operations, Risk/control partners, etc. • Troubleshoot environment, data control and operational issues. • Create and Maintain documentation to ensure knowledge...


  • Phoenix, Arizona, United States TEKsystems Full time

    *Description:* Monitor infrastructure, servers, middleware, databases, and batch jobs. Aggressively respond to service requests from business partners facing support teams, Operations, Risk/control partners, etc. Troubleshoot environment, data control and operational issues. Create and Maintain documentation to ensure knowledge accessibility. Automate...


  • Phoenix, United States TEKsystems Full time

    *Description:* Monitor infrastructure, servers, middleware, databases, and batch jobs. Aggressively respond to service requests from business partners facing support teams, Operations, Risk/control partners, etc. Troubleshoot environment, data control and operational issues. Create and Maintain documentation to ensure knowledge accessibility. Automate...


  • Phoenix, United States TEKsystems Full time

    *Description:* Monitor infrastructure, servers, middleware, databases, and batch jobs. Aggressively respond to service requests from business partners facing support teams, Operations, Risk/control partners, etc. Troubleshoot environment, data control and operational issues. Create and Maintain documentation to ensure knowledge accessibility. Automate...

  • Network Architect

    1 month ago


    Phoenix, United States Mastech Digital Full time

    Organizational Structure And Impact: Impact/Function this role has within the bank/LOB: SRC site reliability center supports all infrastructure within the bank, responsible for 24/7 operations of enterprise support, all technology and day to day operations - keeps the bank running, process pillars and observability, using tools and needs product owner to...

  • Network Architect

    4 weeks ago


    Phoenix, United States Mastech Digital Full time

    Organizational Structure And Impact: Impact/Function this role has within the bank/LOB: SRC site reliability center supports all infrastructure within the bank, responsible for 24/7 operations of enterprise support, all technology and day to day operations - keeps the bank running, process pillars and observability, using tools and needs product owner to...


  • Phoenix, United States Indotronix Avani Group Full time

    Position: Site Reliability Engineer Sr Hybrid in Phoenix , AZ (3 days in office)Targeting a 7/1 start date6 Months with a possibility of extension/contract to hireManager will only look at candidates that are open to converting to a full time employee. F-M 40 hours7PM-7AM Saturdays/Sundays MST - F and M 11PM-7AM MSTM- F 40 hours8-5 MSTOnly W2Impact/Function...


  • Phoenix, United States Indotronix Avani Group Full time

    Position: Site Reliability Engineer Sr Hybrid in Phoenix , AZ (3 days in office) Targeting a 7/1 start date 6 Months with a possibility of extension/contract to hire Manager will only look at candidates that are open to converting to a full time employee. F-M 40 hours 7PM-7AM Saturdays/Sundays MST - F and M 11PM-7AM MST M- F 40 hours 8-5 MST Only W2 ...


  • Phoenix, United States Indotronix Avani Group Full time

    Position: Site Reliability Engineer Sr Hybrid in Phoenix , AZ (3 days in office)Targeting a 7/1 start date6 Months with a possibility of extension/contract to hireManager will only look at candidates that are open to converting to a full time employee. F-M 40 hours7PM-7AM Saturdays/Sundays MST - F and M 11PM-7AM MSTM- F 40 hours8-5 MSTOnly W2Impact/Function...

  • Principle Lead SRE

    4 weeks ago


    Phoenix, United States Insight Global Full time

    Role: Principle Lead SRE Location: Phoenix, AZ (85027) Hybrid: Onsite, 3 days/week Contract Duration: 6 months contract-to-hire Day to Day: A large retail enterprise in Phoenix, AZ is looking for a SRE Engineer to help lead observability initiatives and assist in the development and implementation of build release pipelines with accountability for managing...

  • Principle Lead SRE

    4 weeks ago


    Phoenix, United States Insight Global Full time

    Role: Principle Lead SRE Location: Phoenix, AZ (85027) Hybrid: Onsite, 3 days/week Contract Duration: 6 months contract-to-hire Day to Day: A large retail enterprise in Phoenix, AZ is looking for a SRE Engineer to help lead observability initiatives and assist in the development and implementation of build release pipelines with accountability for managing...

  • Principle Lead SRE

    4 weeks ago


    Phoenix, United States Insight Global Full time

    Role: Principle Lead SRE Location: Phoenix, AZ (85027)Hybrid: Onsite, 3 days/weekContract Duration: 6 months contract-to-hireDay to Day:A large retail enterprise in Phoenix, AZ is looking for a SRE Engineer to help lead observability initiatives and assist in the development and implementation of build release pipelines with accountability for managing...

  • Principle Lead SRE

    4 weeks ago


    Phoenix, United States Insight Global Full time

    Role: Principle Lead SRE Location: Phoenix, AZ (85027)Hybrid: Onsite, 3 days/weekContract Duration: 6 months contract-to-hireDay to Day:A large retail enterprise in Phoenix, AZ is looking for a SRE Engineer to help lead observability initiatives and assist in the development and implementation of build release pipelines with accountability for managing...

  • Principle Lead SRE

    4 weeks ago


    Phoenix, United States Insight Global Full time

    Role: Principle Lead SRE Location: Phoenix, AZ (85027)Hybrid: Onsite, 3 days/weekContract Duration: 6 months contract-to-hireDay to Day:A large retail enterprise in Phoenix, AZ is looking for a SRE Engineer to help lead observability initiatives and assist in the development and implementation of build release pipelines with accountability for managing...

  • Principle Lead SRE

    4 weeks ago


    Phoenix, United States Insight Global Full time

    Role: Principle Lead SRE Location: Phoenix, AZ (85027)Hybrid: Onsite, 3 days/weekContract Duration: 6 months contract-to-hireDay to Day:A large retail enterprise in Phoenix, AZ is looking for a SRE Engineer to help lead observability initiatives and assist in the development and implementation of build release pipelines with accountability for managing...