Cloud Senior Site Reliability Engineer

2 days ago


Jersey City, New Jersey, United States Hispanic Technology Executive Council Full time
About the Role

We are seeking a highly skilled Senior Site Reliability Engineer to join our team and contribute to the design, build, and maintenance of our next-gen AWS platform. As a key member of our engineering team, you will work closely with colleagues to deliver secure, robust, highly available, and scalable solutions for our External Cloud Platform.

Key Responsibilities
  • Collaborate with engineers, architects, and teams to design, develop, test, and implement secure, robust, highly available, and scalable solutions for our External Cloud Platform.
  • Work with software engineers and teams to design and implement deployment approaches using highly scalable, automated, continuous integration, and continuous delivery pipelines.
  • Responsible for all aspects of reliability, collaborating with technical experts, key stakeholders, and team members to resolve complex problems, owning the issue until you are sure it will not reoccur.
  • Deep understanding of SRE practices, service level indicators, and service level objectives; proactively utilize them to resolve issues before they impact customers.
  • Gather, analyze, synthesize, and develop visualizations and reporting from large, diverse data sets in service of continuous improvement of the platform.
  • Implement infrastructure, configuration, and network as code for the applications and platforms in your remit.
  • Identify opportunities to eliminate toil and automate the triage of issues to improve overall operational stability.
  • Collaborate with others to identify, analyze, and resolve platform vulnerabilities.
  • Proactively promote the adoption of site reliability engineering best practices within the team and organization.
  • Participate in 24x7 on-call coverage follow the sun model and performs blameless Postmortems (RCAs) as needed.
Requirements
  • 15 years of combined experience in either SRE, software development, or infrastructure engineering (10 years with an advanced degree in Computer Science or related technical field).
  • 7+ years of hands-on experience building and maintaining cloud platforms on a major cloud service provider.
  • Strong experience in implementing, monitoring, and maintaining a highly scalable and resilient Data Services platform on Amazon Web Services.
  • Strong experience with monitoring tools such as Grafana, Prometheus, Splunk, or Dynatrace, as well as AWS native tools like CloudWatch & CloudTrail, Azure Monitor and Log Analytics.
  • Proficiency in implementing, monitoring, and maintaining a Databricks, RDS, or OpenAI platform.
  • Proficient in at least one programming language such as Python, Java/Spring Boot, and.Net; 5+ years applied experience in Python/Java.
  • Proficiency in implementing CI/CD pipelines with tools such as git and Jenkins, familiarity with using a GitOps model.
  • Advanced knowledge of networking (firewalls, DNS, Load Balancing, Proxies, etc.).
  • Advanced understanding of Linux & Windows operating systems including shell scripting.
  • Excellent interpersonal, organizational, and communication (written, verbal, and presentation) skills are a must.
  • Proven ability to work independently with minimal supervision and as part of a team with direct responsibilities and an ability to juggle competing priorities and adapt to changes in project scope.
Desired Qualifications
  • Strong experience working with a complex IAM infrastructure, including Active Directory, Azure AD Connect, Azure AD, and PingIdentity, Okta, or other SSO solutions.
  • Proficiency in creating automation using Python, Terraform, or Ansible.
  • Proficiency in implementing, monitoring, and maintaining a Databricks, CosmosDB, or OpenAI platform.
  • Experience in implementing, monitoring, and maintaining a highly scalable and resilient enterprise platform on Microsoft Azure using native services related to compute, storage, networking, security, and observability.
  • Experience with containerization technologies such as EC2, EKS, Fargate, Openshift, or Kubernetes.
  • Understanding of cost management, inventory management, FinOps model.
Skills
  • Architecture
  • Collaboration
  • Innovative Thinking
  • Result Orientation
  • Solution Design
  • Adaptability
  • Analytical Thinking
  • Influence
  • Stakeholder Management
  • Technical Strategy Development
  • Automation
  • DevOps Practices
  • Production Support
  • Project Management
  • Risk Management
Work Schedule

1st shift (United States of America)

Hours Per Week: 40



  • Jersey City, New Jersey, United States Bank of America Full time

    Job DescriptionJob Summary:We are seeking a highly skilled Cloud Senior Site Reliability Engineer to join our team at Bank of America. As a key member of our cloud infrastructure team, you will be responsible for designing, building, and maintaining our next-gen AWS platform.Key Responsibilities:Collaborate with a diverse set of engineers, architects, and...


  • Jersey City, New Jersey, United States JPMorganChase Full time

    Job Description Guide and shape the future of technology at a globally recognized firm, driven by pride in ownership.As a Senior Manager of Site Reliability Engineering at JPMorgan Chase within the Corporate Technology, you are the non-functional requirement owner and champion for the applications in your remit. You are a key influencer in your team's...


  • Jersey City, New Jersey, United States Hispanic Technology Executive Council Full time

    About the RoleWe are seeking a highly skilled Cloud Senior Site Reliability Engineer to join our team at the Hispanic Technology Executive Council. As a key member of our engineering team, you will be responsible for designing, building, and maintaining our next-gen AWS platform.Key ResponsibilitiesCollaborate with a diverse set of engineers, architects, and...


  • Jersey City, New Jersey, United States RBC Capital Markets, LLC Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at RBC Capital Markets, LLC. As a key member of our Application Support team, you will be responsible for ensuring the reliability and performance of our applications and infrastructure.Key ResponsibilitiesPerform application production support, including off-hours...


  • Jersey City, New Jersey, United States Devexperts Full time

    Company DescriptionDevexperts has been working for nearly two decades consulting and developing for the financial industry. We solve complex technological challenges facing the most well-respected financial institutions worldwide.By becoming a part of Devexperts, you'll become a part of a company that fosters self-improvement and actively seeks...


  • Jersey City, New Jersey, United States The Goldman Sachs Group Full time

    About the RoleAt The Goldman Sachs Group, we're seeking a highly skilled Site Reliability Engineering Specialist to join our Platforms team. As a key member of our global engineering team, you'll be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform...


  • Jersey City, New Jersey, United States RBC Capital Markets, LLC Full time

    Job SummaryWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at RBC Capital Markets, LLC. As a key member of our Technology and Operations team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud infrastructure solutions.Key ResponsibilitiesDesign and implement monitoring and...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    About the RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Fidelity Investments. As a key member of our Technical Operations team, you will play a critical role in designing, implementing, and maintaining our cloud infrastructure on AWS.Key ResponsibilitiesDesign and implement highly available, secure, and scalable...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    About the RoleWe are seeking a highly skilled Cloud Engineer to join our team at Fidelity Investments. As a Cloud Engineer, you will play a critical role in designing, building, and maintaining our cloud infrastructure, ensuring it is secure, scalable, and highly available.Key ResponsibilitiesDesign and implement cloud infrastructure using AWS services such...


  • Jersey City, New Jersey, United States JPMorganChase Full time

    Job Description Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.As a Lead Site Reliability Engineer at JPMorgan Chase within the Community & Consumer Banking - Infrastructure & Production Management Team, you hold a leadership role...


  • Jersey City, New Jersey, United States Bank of America Full time

    Position Title: Senior Cloud Solutions ArchitectLocation: Various LocationsCompany Overview:At Bank of America, we are committed to enhancing financial well-being through meaningful connections. Our ethos of Responsible Growth guides our operations, ensuring we deliver value to our clients, teammates, and communities.About the Role:We are seeking a seasoned...


  • Jersey City, New Jersey, United States JPMorganChase Full time

    Job Description Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impact.As a Principal Site Reliability Engineer at JP Morgan Chase within the Corporate Technology, you draw upon your advanced knowledge to identify...

  • Cloud Engineer

    5 days ago


    Jersey City, New Jersey, United States Fidelity TalentSource LLC Full time

    Job Description:About the Role:We are seeking a highly skilled Cloud Engineer to join our team at Fidelity TalentSource LLC. As a Cloud Engineer, you will play a critical role in designing, building, and maintaining our cloud infrastructure, ensuring it is secure, scalable, and highly available.Key Responsibilities:Design and implement cloud infrastructure...


  • Jersey City, New Jersey, United States MSys Inc Full time

    Position Overview:We are seeking a highly skilled and experienced Senior Monitoring Engineer to join our team at MSys Inc. This role is crucial for overseeing our monitoring tools and ensuring optimal performance across our cloud-based environments.Key Responsibilities:- Stay updated on emerging technologies, industry trends, and innovative technical...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    {"Job Title": "Site Reliability Engineer (AWS)", "Location": "Jersey City, New Jersey", "Duration": "12 months", "Job Description": "We are seeking a skilled AWS professional to join our team at Syntricate Technologies. As a Site Reliability Engineer (AWS), you will be responsible for ensuring the reliability and scalability of our cloud infrastructure. Key...


  • Jersey City, New Jersey, United States BCForward Full time

    Job DescriptionJob Title: Senior DevOps EngineerLocation: RemoteJob Type: Full-timePay Range: $60 - $65 per hourJob Summary:BCForward is seeking a highly skilled Senior DevOps Engineer to join our team. As a Senior DevOps Engineer, you will be responsible for designing, implementing, and maintaining our cloud infrastructure, ensuring high availability,...


  • Jersey City, New Jersey, United States ST2 ManTech Advanced Systems Intl Full time

    Job SummaryWe are seeking a highly skilled Senior Platform Engineer to join our team at ST2 ManTech Advanced Systems Intl. as a Cloud Security Specialist. This is a remote position that requires a strong background in cloud security, platform engineering, and DevSecOps.Key ResponsibilitiesDesign and implement secure cloud-based Platform-as-a-Service...


  • Jersey City, New Jersey, United States JPMorganChase Full time

    Job Description We have an opportunity to impact your career and provide an adventure where you can push the limits of what's possible.As a Lead Software Engineer at JPMorgan Chase within Cloud Foundational Services, you are an integral part of an agile team that works to enhance, build, and deliver trusted market-leading technology products in a secure,...

  • AWS Cloud Engineer

    2 days ago


    Jersey City, New Jersey, United States Danta Technologies Full time

    Job Title: AWS Cloud EngineerWe are seeking a highly skilled AWS Cloud Engineer to join our team at Danta Technologies. As a key member of our Infrastructure Automation Team, you will play a crucial role in designing, developing, and deploying cloud-based applications and infrastructure.Key Responsibilities:Design and implement secure, scalable, and highly...


  • Jersey City, New Jersey, United States NAM Info Inc Full time

    Job OpportunityJob Title: Senior Java Developer with MongoDB and Cloud ExpertLocation: RemoteJob Type: Long Term ContractJob Description:We are seeking a highly skilled Senior Java Developer with expertise in MongoDB and cloud technologies to join our team at NAM Info Inc.The ideal candidate will have 5+ years of experience in software engineering, with a...