Site Reliability Engineer
3 days ago
Site Reliability Engineer in Wealth Management
Chicago (IL) / Tempe (AZ)
Onsite Job
ROLE:
This role will be Responsible for application observability, maintenance, and support, identifying and
implementing preventive measures proactively, evaluates and makes recommendation on techniques,
practices, or technologies that would enhance business needs. As a SRE associate you will collaborate
with Application Support and Development teams to implement business solution through agile practice
and manage production issues.
The ideal candidates will possess excellent leadership and communication skills coupled with a solid
understanding of modern cloud technologies preferably in the financial sector.
PRINCIPAL RESPONSIBILITIES:
• Lead production stability effort by preventing production issue and improve production
stability.
• Defining and enforcing Service Level Objectives (SLOs) and Agreements (SLAs), Error Budgets to
guarantee system reliability and availability.
• Attention to key performance indicators, such as response times, error rates, and uptime to align
operational performance with overarching business objectives.
• Proactively identify continuous improvement opportunities such as reducing manual effort,
automation of tasks/resolutions or decreasing production incidents.
• Involving in defining and deploying monitoring, metrics, and logging systems and developing
application dashboards.
• Ensure near-zero downtime with monitoring and alerting, self-healing automation, and
continuous improvement.
• Provide reactive, break-fix support and Communicate issue/resolution status (written and
verbal) to project team and management.
• Develop to become a Subject Matter Expert for assigned application domain.
• Should be able to interpret the alerts like SiteScope, Dynatrace and ELK etc. & refer to it while
doing the RCA of the issue.
• He Should be flexible for upskilling to new tech stack & should be ready to do hands on
development.
• Provide regular and high-quality updates to all the stakeholders on the progress of the work
including user stories and ITSM problems.
• Attend regular meetings with Project/Development teams to discuss production issues for
prioritization, fixes, and release.
SKILLS / EXPERIENCE:
• 5-6 plus years of application development experience using modern technologies and
architecture, including experience collaborating with technology teams.
• 2 plus years of Site Reliability Engineering experience.
• Good Understanding of at least one public cloud, preferably Microsoft Azure/ Pivotal Cloud
Foundry.
• Strong understanding of REST APIs and how to use them in practice.
• Strong Experience with continuous integration and collaboration tools like Azure DevOps, JIRA,
Bitbucket, GitHub and Confluence.
• Good knowledge and Hands on CLI - Bash, Linux, Azure CLI etc.,
• Experience in some of the following technologies: Java, J2EE, Pivotal Cloud Foundry, Cloud
Computing (IaaS, PaaS, and SaaS), RESTful interfaces, GIT, Gradle, Maven, NPM, Spring (Spring
Batch and Spring Boot), CSS3, HTML4, React.js, Node.js/JavaScript, Oracle PL/SQL, and Kafka.
NTAC:3NS-20
• Strong communication and interpersonal skills, along with a solid technical background are
essential as is the ability to multitask in a fast-paced environment. The ideal candidate can
explain technical issues in layman's terms and translate business needs to technology teams and
back.
• Experience working effectively with diverse groups around the world, including IT management,
technology staff, business partners, consultants, vendors, and clients.
-
Site Reliability Engineer
22 hours ago
Chicago, IL, United States HCL Global Systems Full timeEdward Jones Site Reliability Engineer 100% remote Initial contract is 6 months, but will be a multi year engagement. Position Overview: As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems. You will be responsible for incident management, root cause analysis, and...
-
Site Reliability Engineer
4 days ago
Chicago, IL, United States HCL Global Systems Full timeEdward Jones Site Reliability Engineer 100% remote Initial contract is 6 months, but will be a multi year engagement. Position Overview: As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems. You will be responsible for incident management, root cause analysis, and...
-
Site Reliability Engineer
1 week ago
Chicago, IL, United States HCL Global Systems Full timeEdward Jones Site Reliability Engineer 100% remote Initial contract is 6 months, but will be a multi year engagement. Position Overview: As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems. You will be responsible for incident management, root cause analysis, and...
-
Site Reliability Engineer
1 week ago
Chicago, IL, United States HCL Global Systems Full timeEdward Jones Site Reliability Engineer 100% remote Initial contract is 6 months, but will be a multi year engagement. Position Overview: As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems. You will be responsible for incident management, root cause analysis, and...
-
Site Reliability Engineer
2 weeks ago
Chicago, IL, United States HCL Global Systems Full timeEdward Jones Site Reliability Engineer 100% remote Initial contract is 6 months, but will be a multi year engagement. Position Overview: As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems. You will be responsible for incident management, root cause analysis, and...
-
Site Reliability Engineer
4 days ago
Chicago, IL, United States Request Technology Full time***Hybrid, 3 days onsite, 2 days remote******We are unable to sponsor as this is a permanent full-time role***A prestigious company is looking for a Site Reliability Engineer. This role is focused on observation, logging, and capacity planning. This engineer will need experience/exposure to Linux systems, Kubernetes/Docker, Terraform, Jenkins, Ansible,...
-
Site Reliability Engineer
1 week ago
Chicago, IL, United States Request Technology Full time***Hybrid, 3 days onsite, 2 days remote******We are unable to sponsor as this is a permanent full-time role***A prestigious company is looking for a Site Reliability Engineer. This role is focused on observation, logging, and capacity planning. This engineer will need experience/exposure to Linux systems, Kubernetes/Docker, Terraform, Jenkins, Ansible,...
-
Site Reliability Engineer
22 hours ago
Chicago, IL, United States Request Technology Full time***Hybrid, 3 days onsite, 2 days remote******We are unable to sponsor as this is a permanent full-time role***A prestigious company is looking for a Site Reliability Engineer. This role is focused on observation, logging, and capacity planning. This engineer will need experience/exposure to Linux systems, Kubernetes/Docker, Terraform, Jenkins, Ansible,...
-
Site Reliability Engineer
4 days ago
Chicago, IL, United States CADDi Full timeFor security reasons, the candidate must be a US Citizen, or a Permanent Resident (Green Card) Overview As a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership of...
-
Site Reliability Engineer
3 days ago
Chicago, IL, United States CADDi Full timeFor security reasons, the candidate must be a US Citizen, or a Permanent Resident (Green Card) Overview As a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership of...