Senior Lead Site Reliability Engineer

3 weeks ago


Dallas, United States JPMorgan Chase Full time

Job Description
Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.
As a Senior Lead Site Reliability Engineer at JPMorgan Chase within the CORPORATE SECTOR in the INFRASTRUCTURE PLATFORMS, Runtime Compute Team, you are deemed as a force multiplier at both a line-of-business and firm wide level. Inspire your peers and the wider product line to deliver durable and resilient products and services to our customers, define firm wide strategies for reliability, and guide and entrust our teams to lead and execute those strategies.
Job responsibilities

  • Provide technical SRE leadership for multiple SRE teams, engineers, and managers throughout Runtime Compute who look to you for advice on the technical issues facing them.
  • You are a key influencer in the Runtime Compute strategic resiliency, observability, and toil reduction planning.
  • You drive continual improvement in resilience, quality of experience, security, monitoring, instrumentation, and automation.
  • You have successfully implemented SRE best practices in high-performance, stable, mission-critical applications with demonstrable positive outcomes.
  • Technologists in Runtime Compute look to you for advice on technical and business issues facing them.
  • You work with your fellow stakeholders to define common NFRs and availability targets for your product line, Runtime Compute, and ensure that SRE is practiced consistently across applications, products, and product lines.
  • You act in a blameless, data-driven manner, show high empathy, emotional intelligence, and can navigate difficult situations with composure and tact.
  • Direct the SRE teams in the product line throughout the lifecycle to help develop software for reliability and scale, ensuring consistency across the product line, and minimal refactoring or changes.
  • Direct the SRE teams in the product line to develop and measure the SLO/SLI for provisioning/deprovisioning, deployments, uptime, and other measures critical to products. Work with business partners to help educate on the product line SLO/SLI.
  • Identify gaps between applicable requirements and current procedures/controls; Drive resolution of mitigating controls. Develop and implement solutions that strengthen business operating models, enhance the client experience, and improve efficiency and controls.
  • Work with business partners to design and implement enhancements to existing processes and/or business applications, introduce new processes and/or toolsets, and engage in process re-engineering.


Required qualifications, capabilities, and skills

  • Formal training or certification on software engineering concepts and 5+ years applied experience with Industry standard Runtime solutions eg Kubernetes and Cloud Foundry.
  • Expertise in at least one technology stack designing, coding, testing and delivering software.
  • Proficiency in one or more technology domains, may be cross-domain expert to able to solve complex and mission critical problems within a business or across the firm. Software development experience in at least one general purpose programming language: Python, Java, C, C++, Go, Shell scripting.
  • Working knowledge infrastructure component ( E.g. Load balancer, cloud platforms and products, container systems, and runtime compute).
  • Excellent debugging and troubleshooting skills.
  • Strong organizational and prioritization skills, detail-oriented and strong interpersonal skills.
  • Be a team player and a leader who shows commitment and dedication, and can maintain a positive attitude and high-level of performance on high-profile/time-sensitive initiatives


Preferred qualifications, capabilities, and skills

  • Experience hiring, developing, and recognizing talent
  • Ability to work in a high paced environment, be flexible, follow tight deadlines, organize and prioritize work
  • Hands-on experience with cloud-based observability technologies and tools especially in deployment, monitoring and operations, such as Data Dog, Prometheus, Splunk, ElasticSearch, Grafana, appdynamics etc
  • Strong working knowledge of modern development technologies and tools such Agile, CI/CD, Git, Terraform and Jenkins
  • Deep knowledge of Internet protocols and web services technologies such as HTTP, DNS, TCP/UDP, SOAP, JSON and REST
  • Good understanding of networking protocols and cybersecurity best practices in cloud environment. Public Cloud certification is preferred
  • AI/ML knowledge is preferred to evaluate and choose models that help with SRE goals including automated root cause analysis, anomaly detection, and real-time insights and analytics into various products.


#LI-RB3
About Us
JPMorgan Chase & Co., one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world's most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set, and location. For those in eligible roles, we offer discretionary incentive compensation which may be awarded in recognition of firm performance and individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.
JPMorgan Chase is an Equal Opportunity Employer, including Disability/Veterans
About the Team
Our professionals in our Corporate Functions cover a diverse range of areas from finance and risk to human resources and marketing. Our corporate teams are an essential part of our company, ensuring that we're setting our businesses, clients, customers and employees up for success.



  • Dallas, United States Tekwissen Full time

    Overview: TekWissen Group is a workforce management provider throughout the USA and many other countries in the world. Our client is an American multinational information technology services and consulting company and is a leading provider of information technology, consulting, and business process outsourcing services, dedicated helping the world's leading...


  • Dallas, United States Tekwissen Full time

    Overview: TekWissen Group is a workforce management provider throughout the USA and many other countries in the world. Our client is an American multinational information technology services and consulting company and is a leading provider of information technology, consulting, and business process outsourcing services, dedicated helping the world's leading...


  • Dallas, United States Saxon Global Full time

    Job Summary: We are looking for a Site Reliability Engineer (SRE) who will be responsible for ensuring the reliability, availability, and performance of our production systems. As an SRE, you will work closely with cross development and engineering teams to design and implement tools and processes to automate deployment, observability, and troubleshooting...


  • Dallas, United States Saxon Global Full time

    Job Summary: We are looking for a Site Reliability Engineer (SRE) who will be responsible for ensuring the reliability, availability, and performance of our production systems. As an SRE, you will work closely with cross development and engineering teams to design and implement tools and processes to automate deployment, observability, and troubleshooting...


  • Dallas, United States TekWissen LLC Full time

    Job DescriptionJob DescriptionOverview: TekWissen Group is a workforce management provider throughout the USA and many other countries in the world. Our client is an American multinational information technology services and consulting company and is a leading provider of information technology, consulting, and business process outsourcing services,...


  • Dallas, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Dallas, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Dallas, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Dallas, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Dallas, Texas, United States Cognizant Technology Solutions Full time

    Sr. Site Reliability Engineer (SRE)Cognizant's Digital Engineering practice is seeking a highly qualified Sr. Site Reliability Engineer with 10+ years plus experience developing and building high-performing, scalable, enterprise applications. You will be part of a digital software team that works on high-demand applications. Our engineers have a passion for...


  • Dallas, Texas, United States Cognizant Technology Solutions Full time

    Sr. Site Reliability Engineer (SRE)Cognizant's Digital Engineering practice is seeking a highly qualified Sr. Site Reliability Engineer with 10+ years plus experience developing and building high-performing, scalable, enterprise applications. You will be part of a digital software team that works on high-demand applications. Our engineers have a passion for...


  • Dallas, United States STIAOS Technologies Full time

    We are looking for Site Reliability Engineer for our client location in Dallas TX with following Skills: *Java Spring boot *Kubernetes *eCommerce experience Required. Key Responsbilities: *Working with the Applications, Engineering, Platform, Operations and infrastructure and Cloud teams to ensure we are a premier software delivery organization. *Drive...


  • Dallas, United States STIAOS Technologies Full time

    We are looking for Site Reliability Engineer for our client location in Dallas TX with following Skills: *Java Spring boot *Kubernetes *eCommerce experience Required. Key Responsbilities: *Working with the Applications, Engineering, Platform, Operations and infrastructure and Cloud teams to ensure we are a premier software delivery organization. *Drive...


  • Dallas, United States Veradigm (formerly Allscripts) Full time

    Welcome to Veradigm! Our Mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our Vision is a Connected Community of Health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...


  • Dallas, United States Saicon Consultants Full time

    Site Reliability Engineer (Buffer) Location:Dallas, TX Posted On: 11/08/2023 Requirement Code: 66074 Requirement Detail Job Description: Site Reliability Engineer (Buffer) • Bachelor's Degree in Computer Science or related; or equivalent combination of education and experience • 5~~@~~ yrs overall experience in Software Application Development &...


  • Dallas, United States PMG, Inc. Full time

    PMG is a digital company that helps marketers connect people with their brand. Focused on people and grounded in data, our award-winning culture fosters meaningful careers. Partnering with the most iconic brands in the world, we put people at the center of everything we do to deliver value, innovation, and business transformation. WHO WE ARE Agile....


  • Dallas, United States Saxon Global Full time

    As a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and a flare for technical communication, prioritization . You will work directly...


  • Dallas, United States JPMorgan Chase Full time

    Job Description DESCRIPTION:Duties: Develop creative engineering solutions to operations problems by combining software and systems engineering approaches. Lead problem resolutions for aligned lines of business (LOB), providing end-to-end resolution to ensure completion of all jobs within defined objectives. Troubleshoot priority incidents, identify Root...


  • Dallas, United States Saxon Global Full time

    As a member of the Production Support/SRE team you will work cross-functionally amongst a variety of teams and be a core contributor in every significant engineering service or solution that we deliver to our stakeholders. You'll excel if you have enthusiasm for digging deep, and a flare for technical communication, prioritization . You will work directly...


  • Dallas, United States Diverse Lynx Full time

    Description: Role: Director - Site Reliability Engineering Location: Dallas, TX (Day 1 Onsite) Long term role Job Description: Domain : Telecommunications Lead strategy and development of SRE functions for consumer applications including Digital, Assisted and Marketing Technology systems to implement best in class SRE practices to improve the reliability of...