Lead Application Reliability Engineer
3 days ago
Overview of the Company:
Citi, the leading global bank, has approximately 200 million customer accounts and does business in more than 160 countries and jurisdictions. Citi provides consumers, corporations, governments, and institutions with a broad range of financial products and services, including consumer banking and credit, corporate and investment banking, securities brokerage, transaction services, and wealth management.
As a bank with a brain and a soul, Citi creates economic value that is systemically responsible and in our clients' best interests. As a financial institution that touches every region of the world and every sector that shapes your daily life, our Enterprise Operations & Technology teams are charged with a mission that rivals any large tech company. Our technology solutions are the foundations of everything we do from keeping the bank safe, managing global resources, and providing the technical tools our workers need to be successful to designing our digital architecture and ensuring our platforms provide a first-class customer experience. We reimagine client and partner experiences to deliver excellence through secure, reliable, and efficient services.
Our commitment to diversity includes a workforce that represents the clients we serve from all walks of life, backgrounds, and origins. We foster an environment where the best people want to work. We value and demand respect for others, promote individuals based on merit, and ensure opportunities for personal development are widely available to all. Ideal candidates are innovators with well-rounded backgrounds who bring their authentic selves to work and complement our culture of delivering results with pride. If you are a problem solver who seeks passion in your work, come join us. We'll enable growth and progress together.
Overview of the Role:
The selected candidate will become the key engineer in supporting and advancing the platform used for threat-modeling process in Citi. The responsibilities will cover (among others) maintaining and supporting the threat-modeling application as well as developing relevant tools used throughout the threat-modeling process. The application is comprised of web servers and backend data storage databases and supporting it requires understanding of middleware, database, container, and AWS cloud environment as well as change-control and compliance processes.
We are seeking a highly skilled and dedicated Lead Application Reliability Engineer to ensure the continuous availability, optimal performance, and security of a critical threat-modeling application. This role is central to our operational excellence, involving comprehensive support and maintenance of a robust technology stack including middleware, databases, Linux, and AWS EKS, all within a strictly regulated and change-controlled financial environment. The ideal candidate will leverage modern DevOps principles to drive stability and efficiency.
Responsibilities:
Ensure high availability and optimal performance of the threat-modeling application through proactive monitoring, incident management, and efficient troubleshooting.
Perform routine and emergency application and infrastructure maintenance, including patching, upgrades, and configuration management, adhering strictly to change control procedures.
Conduct root cause analysis (RCA) for production incidents and implement preventative measures to minimize future occurrences.
Develop and maintain automation scripts and tools (e.g., using Python, Bash) to streamline operational tasks, improve monitoring, and facilitate efficient deployments.
Proactively identify, recommend, and implement enhancements to existing application maintenance practices, operational workflows, and system reliability.
Serve as a technology subject matter expert for internal and external stakeholders, contributing to technology domain roadmaps and firm-mandated controls and compliance initiatives.
Appropriately assess and mitigate risk in all technical decisions, ensuring compliance with applicable laws, rules, regulations, and internal policies, while escalating and reporting control issues with transparency.
Present technical work to senior stakeholders, the team, and other technical teams.
Mentor and train junior team members, fostering a culture of knowledge sharing and continuous improvement.
Qualifications:
6+ years of relevant experience in an Engineering role, preferably in Financial Services or a large, complex, and/or global environment.
Experience managing and troubleshooting Linux Operating Systems (e.g., Red Hat Enterprise Linux (RHEL), CentOS, Ubuntu), including System Administration Tasks like User Management, Service Restarts, and File System Checks – Must Have.
Proficiency in Scripting for Automation (e.g., Bash, Python) and with Configuration Management Tools (e.g., Ansible, Puppet, Chef) for system administration and infrastructure automation – Must Have.
Experience with container orchestration using Helm and Kubernetes on platforms like AWS EKS, GCP GKE, or OpenShift – Must Have.
Working knowledge of Relational Databases (e.g., PostgreSQL), including basic querying – Must Have.
Proven track record of maintaining applications and their technology stacks compliant with security and configuration requirements, including successfully passing internal and external security audits by demonstrating secure configuration of applications and infrastructure (e.g., implementing least privilege access, hardening OS, managing firewall rules) and ensuring continuous compliance with regulatory standards (e.g., SOX, GDPR) through automated checks and reporting – Must Have.
Demonstrated adherence to strict change control procedures, executing all changes (e.g., code deployments, infrastructure updates) through a formalized change management process (e.g., ITSM, ServiceNow) with proper documentation and approvals – Must Have.
Experience with Ticketing Systems (e.g., Jira, ServiceNow) – Must Have.
Working understanding of Middleware Components (e.g., Nginx, Tomcat or equivalents).
Familiarity with Development Concepts (e.g., Git, CI/CD, Pipelines, SDLC).
Strong communication skills, both written and verbal, for technical and non-technical audiences.
Demonstrated analytical and diagnostic skills, with an ability to identify process improvements and best practices.
Ability to work independently, manage multiple tasks, take ownership of initiatives, and operate effectively in a matrixed environment under pressure and tight deadlines.
Associate Level Certification Required: (Require a Minimum of 1 or more of the following)
Kubernetes and Cloud Native Associate (KCNA), Certified Kubernetes Application Developer (CKAD), Certified Kubernetes Administrator (CKA), Kubernetes and Cloud Native Security Associate (KCSA)
Red Hat Certified System Administrator or like certification
AWS Certified Developer, AWS Certified SysOps Administrator
CompTIA Cloud+
Google Associate Cloud Engineer or other GCP certification
HashiCorp Certified: Terraform Associate
Associate Cybersecurity Certification: (Not required but any of the following would be a plus)
GIAC Security Essentials (GSEC)
ISC2 Systems Security Certified Practitioner (SSCP)
CompTIA CySA+
Microsoft Certified: Security Operations Analyst Associate; Information Protection Administrator Associate
Education:
Bachelor's degree/University degree or equivalent experience
Job Family Group:
Technology
Job Family:
Systems & Engineering
Time Type:
Full time
Primary Location:
Irving Texas United States
Primary Location Full Time Salary Range:
$125, $188,640.00
In addition to salary, Citi's offerings may also include, for eligible employees, discretionary and formulaic incentive and retention awards. Citi offers competitive employee benefits, including: medical, dental & vision coverage; 401(k); life, accident, and disability insurance; and wellness programs. Citi also offers paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays. For additional information regarding Citi employee benefits, please visit Available offerings may vary by jurisdiction, job level, and date of hire.
Most Relevant Skills
Please see the requirements listed above.
Other Relevant Skills
For complementary skills, please see above and/or contact the recruiter.
Anticipated Posting Close Date:
Nov 03, 2025
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi's EEO Policy Statement and the Know Your Rights poster.
-
Lead Application Reliability Engineer
2 weeks ago
Irving, Texas, United States Citi Full time $125,000 - $188,640 per yearDiscover your future at CitiWorking at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you'll have the opportunity to grow your career, give back to your community and make a real impact.Job OverviewOverview of the Company:Citi, the leading global bank, has...
-
Site Reliability Engineer
5 days ago
Irving, Texas, United States CellPoint Digital Full time $120,000 - $180,000 per yearJoin CellPoint Digital: Shape the Future of Payments with UsAt CellPoint Digital, we're revolutionizing the way businesses in the air, travel, and hospitality sectors manage their payments.With our Leading Payment Orchestration Platform, we're turning payments into a strategic advantage, helping clients optimize their payment experience to boost profits,...
-
Reliability & Maintenance Engineer
5 days ago
Irving, Texas, United States Caterpillar Full time $144,000 - $217,320 per yearCareer Area:ManufacturingJob Description:Your Work Shapes the World at Caterpillar Inc.When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about...
-
Site Reliability Engineer
1 day ago
Irving, Texas, United States The Judge Group Full time $80,000 - $120,000 per yearAbout the Role:Our client is seeking a Site Reliability Engineer (SRE) with deep expertise in monitoring, debugging, and optimizing Azure App Services. This role is critical in ensuring our platforms remain reliable, performant, and scalable as we continue to grow.You'll combine hands-on Azure experience with code-level debugging, observability best...
-
Site Reliability Engineer
3 days ago
Irving, Texas, United States InfoVision Inc. Full time $100,000 - $120,000 per yearSite Reliability Engineer (SRE)We're looking for anSRE with strong DevOps DNA— not just someone to run pipelines, but someone whoownsreliability, automation, and innovation.Key Must-Haves:Proven SRE mindset — find issues, automate, and improve without waiting for instructions.Deep AWS experience: Autoscaling, Security Groups, Route53, S3, IAM.Strong in ...
-
Irving, Texas, United States Citi Full time $125,000 - $188,640 per yearDiscover your future at CitiWorking at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you'll have the opportunity to grow your career, give back to your community and make a real impact.Job OverviewOverview of the Company:Citi, the leading global bank, has...
-
Lead Integration Engineer
3 days ago
Irving, Texas, United States Care Continuity Full time $115,000 - $160,000 per yearAbout Care ContinuityCare Continuity is redefining patient navigation. We combine clinical expertise, AI-driven insights, and compassionate human support to ensure patients receive the care they need - when and where they need it. Our solutions empower health systems and providers to close care gaps, reduce readmissions, and drive ROI through smarter, more...
-
Lead Engineer
2 weeks ago
Irving, Texas, United States Prodapt Full time $120,000 - $180,000 per yearOverviewProdapt is the largest specialized player in the Connectedness industry. As an AI-first strategic technology partner, Prodapt provides consulting, business reengineering, and managed services for the largest telecom and tech enterprises building networks and digital experiences of tomorrow. Prodapt has been recognized by Gartner as a Large,...
-
Public Cloud Engineering Lead
5 hours ago
Irving, Texas, United States Citi Full time $156,160 - $234,240Are you a seasoned technology leader with a passion for building cutting-edge enterprise products and a hands-on approach to engineering? Join Citi's Cloud Technology Services (CTS) team and be part of our commitment to transform Citi technology leveraging game-changing Cloud capabilities to drive agility, efficiency, and innovation. We're providing our...
-
Engineering Lead Analyst
2 weeks ago
Irving, Texas, United States Citi Full time $140,200 - $177,146Citibank, N.A. seeks an Engineering Lead Analyst for its Irving, Texas location.Duties: Responsible for the planning, coordination, and implementation of IT projects for strategic monitoring and observability tools including AppDynamics, Splunk, BSM, Aternity, and Evolven. Serve as a liaison between business and technical users in order to determine...