Lead Site Reliability Engineer in Phoenix

3 weeks ago


Phoenix, Arizona, United States Indotronix International Corporation Full time
Job Title: Lead Site Reliability Engineer in Phoenix

Indotronix International Corporation is seeking a highly skilled Lead Site Reliability Engineer to join our team in Phoenix, AZ. As a key member of our Hybrid Site Reliability Engineering team, you will be responsible for ensuring the smooth operation of our infrastructure and applications.

Key Responsibilities:
  • Monitor and troubleshoot infrastructure, servers, middleware, databases, and batch jobs.
  • Respond to service requests from business partners and support teams.
  • Create and maintain documentation to ensure knowledge accessibility.
  • Automate and streamline processes using scripts and scheduling tools.
  • Liaise with other application support teams and internal/external business and technical partners.
  • Provide ad hoc and on-demand reports.
  • Perform timely escalation of critical issues and proactively identify patterns of recurring issues to improve production.
  • Lead problem resolution and conduct root cause analysis and establish processes that will help incident prevention.
  • Participate in the Incident and Problem Management processes as a resolver accountable for root cause analysis, resolution, and reporting.
  • Ensure that all production changes are processed according to Change Management policies and procedures.
  • Ensure that appropriate levels of Quality Assurance have been met for all new and existing products.
  • Support Sustained Resiliency, Disaster Recovery, and High Availability events.
  • Help Level 2 operation team with setting up monitoring and bridging the gaps in current monitoring setup.
  • Play a key part in setting up reporting and be a key component in Monitor -> Report -> Improve principle.
  • Coordinate incident management coverage to ensure appropriate coverage.
  • Call facilitation, coordination, and communications during critical outage situations.
  • Call documentation, queue management, ticket analysis, and interface to impacting lines of business for incident impact analysis via the Production Assurance process.
  • End-to-end view of issues for objectivity.
  • Influence senior technology leads across organizations to ensure timely resolution of incidents.
  • Problem Management:
    • Participate and ensure RCA activities on client-impacting incidents are executed and action items are assigned/completed.
    • Provide expertise and support during critical incidents, interfacing with all impacted groups to better manage the message.
    • Chronic issue coordination and leadership.
    • Guidance to all staff involved and vendors in driving a coordinated approach for results.
  • Hygiene and Capacity Maintenance:
    • Responsible for data quality of PLM.
    • Work aggressively to ensure all servers are up to company standards as per uptimes, patch level, etc.
    • Work on Capacity planning for applications, estimating and analyzing growth rates of vital infrastructure components and adding capacity proactively as and when required.
    • Understand application code, workflow, and business usage of application.
    • Understand DB component of application.
    • Understand the impacts of application based on seasonality of critical applications.
    • Document known errors and play an important role in Knowledge transfer to Level 1 team.
    • Reduce escalations to Level 3 based on incremental learning about applications.

Requirements:

  • 7+ years of experience in a technical field or relevant work experience.
  • Preferred bachelor's degree in a technical field or relevant work experience, certifications; CCNP | CCIE | JNCIS | JNCIE nice to have.
  • Strong technical skills, including knowledge of client-server technology, networking basics, database technology, and end-to-end understanding of 3-tier application architecture.
  • Proven experience in incident/problem management with a good understanding of any of the tools used for this purpose.
  • Good understanding of both UNIX and Windows operating systems.
  • Good understanding of web hosting technologies like Apache/Tomcat or other equivalent web/app servers.
  • Good understanding of Big Data & cloud concepts.
  • Good understanding of database technologies like ORACLE and SQL.
  • Good understanding of monitoring tools is an added advantage.
  • Solid understanding of the major functionality bundled into a release, both from a technology and business point of view.
  • Strong knowledge of relevant applications and development life cycles.
  • Experience working with geographically distributed and culturally diverse work-groups.
  • Strong desire to learn new technology.

Soft Skills:

  • Excellent communication skills, both verbal and written, with the ability to lead/manage large calls.
  • Comfortable providing clear problem descriptions and guidance to business users in a time-critical environment.
  • Ability to be proactive with a strong bias for action, naturally inquisitive, and bias for continuous improvement of practices/processes.
  • Excellent influence, negotiation, and presentation skills.
  • Experience in working with cross-line of business teams, Outside Service Providers, and Partner Organizations.
  • Outstanding interpersonal skills and ability to establish strong relationships with all levels of management.
  • Ability to work independently as a self-starter, and within a team environment.

Logistics:

  • 2-step interview process.
  • 1st round with HM.
  • 2nd round panel ITV with engineering managers.

Indotronix Commitment:

A Safe and Inclusive Workplace - Promoting a Culture of Inclusion, Respect, Equality, and Diversity: Ensuring Safety and Non-Discrimination.

We actively strive to attract, retain, and empower a diverse range of talented individuals, recognizing that diverse perspectives and experiences enhance our collective performance.

Breaking Barriers: Your Potential Knows No Limits. Embrace Your Potential, Apply Today.



  • Phoenix, Arizona, United States Indotronix International Corporation Full time

    Job Title: Lead Site Reliability EngineerIndotronix International Corporation is seeking a highly skilled Lead Site Reliability Engineer to join our team in Phoenix, AZ. As a key member of our Site Reliability Engineering team, you will be responsible for ensuring the reliability, scalability, and performance of our infrastructure and applications.Key...


  • Phoenix, Arizona, United States Indotronix International Corporation Full time

    Job Title: Hybrid Site Reliability Engineer in PhoenixIndotronix International Corporation is seeking a skilled Hybrid Site Reliability Engineer to join our team in Phoenix, AZ. As a key member of our Site Reliability Center, you will play a critical role in ensuring the smooth operation of our enterprise infrastructure, 24/7.Job Summary:We are looking for a...


  • Phoenix, Arizona, United States Manpower Group Inc. Full time

    Job Title: Site Reliability EngineerWe are seeking a skilled Site Reliability Engineer to join our team at Manpower Group Inc. in Phoenix, AZ. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our Java Full Stack Applications.About the Role:Design, implement, and maintain scalable and...


  • Phoenix, Arizona, United States Indotronix International Corporation Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Indotronix International Corporation. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our cloud-native infrastructure.Job Summary:The Site Reliability Engineer will be responsible for...


  • Phoenix, Arizona, United States hackajob Full time

    hackajob has partnered with a company that provides information technology services, consulting, and financial services to find a skilled Software Engineer to join their growing team.Role: Site Reliability EngineerLocation: Phoenix, AZ (hybrid 3 days/week onsite)Key Responsibilities:Respond to alerts and resolve customer/system issues.Perform root cause...


  • Phoenix, Arizona, United States Indotronix International Corporation Full time

    Job Title: Hybrid- Site Reliability EngineerIndotronix International Corporation is seeking a highly skilled Hybrid- Site Reliability Engineer to join our team in Phoenix, AZ.Job Summary:We are looking for a talented individual to fill the role of Hybrid- Site Reliability Engineer. As a key member of our team, you will be responsible for ensuring the smooth...


  • Phoenix, Arizona, United States Manpower Group Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Manpower Group Inc. in Phoenix, AZ. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our Java Full Stack Applications.Key ResponsibilitiesDesign and implement scalable and reliable infrastructure solutions using...


  • Phoenix, Arizona, United States CloudBC Labs Full time

    Job Title: Site Reliability EngineerCloudBC Labs is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based infrastructure.Key Responsibilities:Lead onshore and offshore teams to deliver high-quality solutionsDesign and implement...


  • Phoenix, Arizona, United States ConsultUSA Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at ConsultUSA. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure and applications.Key Responsibilities:Monitor and maintain the health of our systems and...


  • Phoenix, Arizona, United States Futran Tech Solutions Pvt. Ltd. Full time

    Job Title: Site Reliability EngineerFutran Tech Solutions Pvt. Ltd. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and...


  • Phoenix, Arizona, United States hackajob Full time

    Site Reliability Engineer Opportunityhackajob has partnered with a leading information technology services company to find an experienced Software Engineer to join their growing team.Key Responsibilities:Respond to alerts and resolve customer/system issues in a timely manner.Perform root cause analysis and troubleshoot application problems to ensure high...


  • Phoenix, Arizona, United States Futran Tech Solutions Pvt. Ltd. Full time

    Job Title: Site Reliability EngineerFutran Tech Solutions Pvt. Ltd. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure and applications.Key Responsibilities:Design, implement, and maintain...


  • Phoenix, Arizona, United States Futran Tech Solutions Pvt. Ltd. Full time

    Site Reliability EngineerFutran Tech Solutions Pvt. Ltd. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain scalable and highly available...


  • Phoenix, Arizona, United States Sumitomo Mitsui Banking Corp Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Sumitomo Mitsui Banking Corp. As a key member of our technology team, you will be responsible for ensuring the reliability and performance of our production applications.Key ResponsibilitiesDesign and implement monitoring systems to detect and respond to incidents and...


  • Phoenix, Arizona, United States PNC Financial Services Group Full time

    Job DescriptionPNC Financial Services Group is seeking a highly skilled Site Reliability Engineering Manager to join our Enterprise Technology organization. As a key member of our team, you will be responsible for leading a team of Site Reliability Engineers in implementing, maintaining, and improving robust monitoring response sites and infrastructure...


  • Phoenix, Arizona, United States iSoftTek Solutions Inc Full time

    Job OverviewiSoftTek Solutions Inc is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and scalability of our applications and infrastructure.Key Responsibilities:Collaborate with development teams to identify and resolve issuesImplement...


  • Phoenix, Arizona, United States Sumitomo Mitsui Banking Corp Full time

    Evolve Banking with UsWe're on a mission to create a completely new, 100% digital bank that truly serves customers' best interests. Our team of seasoned financial services professionals is committed to building a bank from scratch, and we're looking for a talented Site Reliability Engineer to join us.About the RoleAs a Site Reliability Engineer, you'll be...


  • Phoenix, Arizona, United States PNC Financial Services Group Full time

    Job DescriptionOverviewPNC Financial Services Group is seeking a highly skilled and experienced professional to fill the role of Site Reliability Engineering Manager. This position will be responsible for leading a team of Site Reliability Engineers in implementing, maintaining, and improving robust monitoring response sites and infrastructure...


  • Phoenix, Arizona, United States hackajob Full time

    Site Reliability Engineerhackajob has partnered with a company that provides information technology services, consulting, and financial services to find an experienced Software Engineer to join the growing team.Key Responsibilities:Respond to alerts and resolve customer/system issues.Perform root cause analysis and troubleshoot application problems.Implement...


  • Phoenix, Arizona, United States Wells Fargo Full time

    About this role:We are seeking a highly skilled Senior Site Reliability Engineer to join our team at Wells Fargo. As a key member of our Site Reliability Engineering team, you will be responsible for designing, implementing, and maintaining scalable and highly available cloud-based systems. Your expertise in cloud infrastructure, automation, and monitoring...