Cloud Senior Site Reliability Engineer

4 weeks ago


Jersey City, New Jersey, United States Bank of America Full time
Job Description:

At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day.

One of the keys to driving Responsible Growth is being a great place to work for our teammates around the world. We're devoted to being a diverse and inclusive workplace for everyone. We hire individuals with a broad range of backgrounds and experiences and invest heavily in our teammates and their families by offering competitive benefits to support their physical, emotional, and financial well-being.

Bank of America believes both in the importance of working together and offering flexibility to our employees. We use a multi-faceted approach for flexibility, depending on the various roles in our organization.

Working at Bank of America will give you a great career with opportunities to learn, grow and make an impact, along with the power to make a difference. Join us

Senior Site Reliability Engineering, Hybrid Cloud Container Platform, Enterprise Cloud Platforms

About Bank of America - Global Technology:

Global Technology delivers technology services globally across the bank's eight lines of business that serve individuals, companies, and institutions. The team also focuses on digital banking, payments, infrastructure, data management and technology that enhances cyber security, and risk and capital management. Innovation is at the heart of all Global Technology does.

Enterprise Cloud Platforms Team:

Enterprise Cloud Platforms team in the CTO organization offers Private and Public Cloud platforms for Bank of America's developers to drive faster time-to-market, innovation with private and public cloud capabilities, and reduce complexity with bult-in integrations. We believe in high quality engineering culture to engineer our platforms with customer and platform mindset, design for large enterprise scale and resilience, and accelerate market innovation into the technical platforms we deliver.

As part of this team, you will have a large impact on the evolution of next generation Cloud services for Bank of America and explore an extensive list of new technologies that will drive innovation across our company.

We are seeking an experienced Senior Cloud Site Reliability Engineer (SRE) to support and administration of our Hybrid Cloud Container (OpenShift /AKS) platform.

Our Cloud Service Reliability Engineers (cSREs) ensure that our Cloud services meet the reliability and uptime requirements of our demanding enterprise customers. This is achieved with, the best engineering practices and resilient design and through a well-defined and effective global on-call rotation that runs 24x7.

The role provides opportunity to work with wide range of technologies and unique perspective on how various services (on-prem/off-prem) interact with each other. You will work with colleagues that are as smart, hardworking, and driven as you. You will get an opportunity to work in a team that keeps growing, innovating, and giving you room to be proactive and creative.

Are you ready for the next step in your career? Then we'd love to hear from you

Position Summary:
  • Responsible for reliability and support of Container PaaS Platform on-prem/off-prem (Azure /AWS /Google)
  • Monitor and troubleshoot Container PaaS platform (Openshift) and Azure (AKS) environment performance issues, connectivity issues, security issues, etc.
  • Perform deep dives into systemic and latent reliability issues, Incident management, problem management
  • Identifying, analyzing, and resolving infrastructure vulnerabilities and application deployment issues.
  • Perform blameless RCA, partner with engineering and operation teams across the organization to roll out fixes.
  • Identify and drive opportunities to improve automation for the PaaS services; scope and create automation for deployment, management, and visibility of our services.
  • Evaluating and automating the scaling and capacity requirements within PaaS environments
  • Partner with risk, and compliance teams to bring visibility and implement right controls and policies in the PaaS Platform
  • Ensure resiliency during implementation and identify/fix resiliency problems by collaborating with engineering teams
  • Be a key stakeholder in the design of cloud services and work with Architecture, engineering, product teams
  • Participate in 24x7 on-call coverage follow the sun model


Required Skills:
  • BS /MS degree in Computer Science or related technical field involving systems or equivalent practical experience.
  • Minimum 8+ years of hands-on experience supporting Kubernetes /Openshift / Container PaaS platform
  • Experience with Python, Ansible and shell scripting
  • Kubernetes /Openshift /Terraform certifications are a plus
  • Strong experience in major services related to Compute, Storage, Network and Security
  • Experience with monitoring tools like Prometheus and Dynatrace, as well as cloud native tools like Azure Monitor and Log Analytics
  • Strong understanding and background of working with a complex Active Directory and IAM controls
  • Advanced knowledge of DNS, DHCP, Kerberos and Windows Authentication
  • Experience with CI/CD tools git /Jenkins, GitOps model
  • Excellent understanding of Linux /Windows operating systems administration
  • Systematic problem-solving approach, sense of ownership and drive
  • Ability to juggle competing priorities and adapt to changes in project scope.
  • Excellent interpersonal, organizational and communication (written, verbal, and presentation) skills are a must.
  • Proven ability to work independently with minimal supervision and as part of a team with direct responsibilities.


Desired Job Skills:
  • Experience in Openshift, managed Kubernetes services such as AKS, EKS, or GKE
  • Experience in Terraform, ArgoCD, Tekton, and K-native technologies
  • Experience in agile deployment methodologies (GitOps)
  • Knowledge of various container runtimes
  • Familiarity with the operator deployment pattern.
  • Experience working in a highly available multi-datacenter environment
  • Experience working with monitoring tools such as Prometheus, Splunk, Dynatrace, Sysdig, or similar tools.
  • Understanding of cost management, inventory management, FinOps model


Shift:
1st shift (United States of America)

Hours Per Week:
40

  • Jersey City, New Jersey, United States Devexperts Full time

    Company DescriptionDevexperts has been working for nearly two decades consulting and developing for the financial industry. We solve complex technological challenges facing the most well-respected financial institutions worldwide.By becoming a part of Devexperts, you'll become a part of a company that fosters self-improvement and actively seeks...


  • Jersey City, New Jersey, United States tapwage Full time

    There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Digital Private Markets /Aumni (A JP Morgan Chase Company), you will solve complex...


  • Jersey City, New Jersey, United States BAE Systems Full time

    Job Description This job is a Hybridposition, spending 50% of their time working out of BAE Systems' 65 River Road Location. The Cloud Datacenter IT Admin will play a crucial role in managing and supporting our Virtual Cloud Computing Center (VC3). The role requires someone with a deep understanding of datacenter management, automation, cloud services and...


  • Jersey City, New Jersey, United States S & N Invent AG Full time

    Als Cloud Software Engineer in Münster konzeptionieren, designen und implementieren Sie cloud-native Applikationen, Services und Funktionen in einem agilen UmfeldSie haben eine DevOps-Mentalität und sind offen für die Zusammenarbeit in Ihrem Team und über Ihr Team hinaus, auch für Aufgaben/Tools, die über das Programmieren hinausgehen wie Deployment,...


  • Jersey City, New Jersey, United States ATR International Full time

    Job Description:We are seeking a Salesforce Marketing Cloud Engineer for a very important client. Executes email marketing solutions, design, development, and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems Setup, customize and develop Salesforce Marketing Cloud and...

  • AWS Cloud Engineer

    1 month ago


    Jersey City, New Jersey, United States Purpledrive Technologies Full time

    Experience using AWS (that's just common sense) Experience designing and building web environments on AWS which includes working with services like EC2 ELB RDS and S3 Experience building and maintaining cloud-native applications A solid background in Linux/Unix and Windows server system administration Experience using DevOps tools in a cloud environment such...


  • Jersey City, New Jersey, United States Royal Bank of Canada Full time

    Job SummaryJob DescriptionSenior Software Engineer, RBC Capital Markets, LLC, Minneapolis, MN: Manage all aspects of implementation planning & coordination. Manages all aspects of testing and verification ensuring all tasks are performed for all interfaces. Identify technical and business opportunities to take advantage of cross project knowledge, best...

  • Software Engineer

    1 month ago


    Jersey City, New Jersey, United States MetaOption LLC Full time

    Software Engineer (AI/ML Engineer)Skills: Python, Data Science, ML libraries and frameworks, Gen AI / LLM, AWS, Microsoft cloud, MLOPsExperience level: Mid-senior Experience required: 8 Years Education level: Bachelor's degree Relocation assistance: NoHybrid work: 3 days a week onsiteLooking for local candidates in Jersey City NJ or Dallas TX or Tampa,...


  • Jersey City, New Jersey, United States BAE Systems Full time

    Job Description Job DescriptionSee what you're missing. Our employees work on the world's most advanced electronics – from detecting threats for F-35 pilots to illuminating the night for soldiers. Spanning air, land, sea, and space, we are developing the technology of tomorrow, delivered today. Drawing strength from our differences, we're innovating for...


  • Jersey City, New Jersey, United States Tiger Analytics, LLC Full time

    As a Principal Data Engineer (Azure), you would have hands on experience working on Azure as cloud, Databricks and some exposure/experience on Data Modelling.using different Open Source, Big Data, and Cloud technologies on Microsoft Azure.Experience in implementing Data Lake with technologies like Azure Data Factory (ADF), PySpark, Databricks,...


  • Jersey City, New Jersey, United States Royal Bank of Canada Full time

    Job SummaryJob DescriptionRBC Capital Markets seeks a Senior Low Latency Engineer in New York, NY to design and develop solutions to complex applications problems, system administration issues, or network concerns. Perform systems management and integration functions. Verify stability, interoperability, portability, security, or scalability of system...


  • Jersey City, New Jersey, United States Bank of America Full time

    Job Description:At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day. One of the keys to driving Responsible Growth is being a great place to work for...


  • Jersey City, New Jersey, United States BAE Systems Full time

    Job Description Because of the need for consistent, in-person collaboration and/or the requirement to perform all work onsite due to the nature of this particular role, it will be performed full-time on site. This means work will be conducted on location at a BAE Systems facility 100% of the time.If you're looking to return to the workplace after a career...


  • Jersey City, New Jersey, United States BAE Systems Full time

    Job Description Because of the need for consistent, in-person collaboration and/or the requirement to perform all work onsite due to the nature of this particular role, it will be performed full-time on site. This means work will be conducted on location at a BAE Systems facility 100% of the time.If you're looking to return to the workplace after a career...


  • Jersey City, New Jersey, United States Mitchell Martin Inc Full time

    Our client, one of the largest banks in the US with wealth management, investment banking, and international business, is seeking an Application Architect VLocation: Jersey City, NJPosition Type: ContractJob Summary:We are seeking an architect to help design and deliver a single compliant and secure service mesh solution spanning private and public Clouds...

  • Hardware Engineer

    2 weeks ago


    Jersey City, New Jersey, United States ATR International Full time

    Job Description:We are seeking a Lead Hardware/ Infrastructure Engineer for a very important client Job Description:We have an exciting and rewarding opportunity for you to take your Infrastructure engineering career to the next level.As a Lead Infrastructure Engineer within the Enterprise Technology and Infrastructure Platforms division, you will...

  • Hardware Engineer

    8 hours ago


    Jersey City, New Jersey, United States ATR International Full time

    Job Description:We are seeking a Lead Hardware/ Infrastructure Engineer for a very important client Job Description:We have an exciting and rewarding opportunity for you to take your Infrastructure engineering career to the next level.As a Lead Infrastructure Engineer within the Enterprise Technology and Infrastructure Platforms division, you will...

  • C++ Developer

    1 month ago


    Jersey City, New Jersey, United States Techmorgonite Software Solutions LLC Full time

    Job Title Senior C++ DeveloperDuration 12 monthsLocation Jersey City NJ (Initally Remote)Requirements5-8 years of solid software engineering experienceCompletely hands-on with 5+ years of experience in C or C++3+ years of experience in Perl & Shell Script on Unix/Linux platform.Good knowledge in relational database (Sybase/ Oracle) SQL's and store...


  • Jersey City, New Jersey, United States Mastech Inc Full time

    Role: Senior Database and Report DeveloperSenior Database and Report Developer10+ years of hands-on experience as a Business Objects and/or Power BI developer.Excellent Knowledge of Business Objects or Power BI.Business Objects developerOracle/SQL Server database developer.Bachelor's degree in Computer Science, Electrical/Electronic Engineering, Information...


  • Jersey City, New Jersey, United States BAE Systems Full time

    Job Description See what you're missing. Our employees work on the world's most advanced electronics – from providing the latest in Smart Munition advanced seekers to illuminating the night for soldiers. Spanning air, land, sea, and space, we are developing the technology of tomorrow, delivered today. Drawing strength from our differences, we're innovating...