Principal Site Reliability Engineer
2 weeks ago
About the Role:
As a member of the TechOps SRE team, you will work closely with our engineering partners to help enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our enterprise-grade infrastructure strategy. These growing environments currently support numerous mission-critical workloads.
Key Responsibilities:
• Develop and refine your skills in a fun, collaborative, and rapidly changing environment.
• Collaborate across numerous Fidelity teams to drive initiatives from design to implementation.
• Have a direct impact on the emerging strategies of our infrastructure and deployments.
• Work independently with minimal direction to drive and champion the overall design of highly available, secure, scalable microservices-based applications in AWS.
• Provide technical leadership to strong teams of Site Reliability Engineers / Cloud Engineers.
• Configure and deploy resilient infrastructure in multiple regions and multiple availability zones.
• Work multi-functionally with other organizations and collaborate with our risk, product, and engineering team leaders.
• Promote a DevOps mentality, providing mentorship and establishing development standard methodologies for AWS infrastructure-as-code.
• Champion automation tools to improve software delivery and reduce risk.
• Develop and maintain logging, monitoring, and alerting capabilities using tools like Datadog and Splunk.
Requirements:
• 5+ years of hands-on experience with AWS in a production environment.
• Experience building and deploying Docker images including Docker Compose.
• Production experience running Kubernetes workloads ideally on AWS EKS.
• Experience managing and maintaining Kubernetes Clusters on AWS EKS.
• Experience with Confluent or Kafka.
• Experience creating and deploying Helm charts & libraries.
• Hands-on experience with Jenkins Core, including authoring and maintaining declarative CI/CD pipelines and libraries.
• Experience with monitoring tools e.g., CloudWatch, Datadog & Splunk Cloud.
• Proficiency with UNIX operating systems and shell scripting.
• Experience with Amazon Web Services (AWS), having managed services and applications in a large AWS cross-account environment using IAM and federated SSO.
• Experience crafting and maintaining logging, monitoring, and alerting capabilities using tools like Datadog and Splunk.
• Ability to communicate at all levels with a track record of strong written and verbal communications.
• Ability to see problems as opportunities to automate.
• Ability to work independently with minimal direction.
• Experience with configuring and deploying resilient infrastructure in multiple regions and multiple availability zones.
• Experience with the agile software development lifecycle and Kanban preferred.
• Experience with CDN Providers e.g., Akamai preferred.
About Fidelity Investments:
Fidelity Investments is a privately held company with a mission to strengthen the financial well-being of our clients. We help people invest and plan for their future. We assist companies and non-profit organizations in delivering benefits to their employees. And we provide institutions and independent advisors with investment and technology solutions to help invest their own clients' money.
Why Join Us:
At Fidelity, you'll find endless opportunities to build a meaningful career that positively impacts peoples' lives, including yours. You can take advantage of flexible benefits that support you through every stage of your career, empowering you to thrive at work and at home. Honored with a Glassdoor Employees' Choice Award, we have been recognized by our employees as a top 10 Best Place to Work in 2024. And you don't need a finance background to succeed at Fidelity-we offer a range of opportunities for learning so you can build the career you've always imagined.
-
Principal Site Reliability Engineer
2 weeks ago
Jersey City, New Jersey, United States Fidelity Investments Full timeJob Title: Principal Site Reliability EngineerAt Fidelity Investments, we're seeking a highly skilled Principal Site Reliability Engineer to join our TechOps SRE team. As a key member of our team, you'll work closely with our engineering partners to drive initiatives from design to implementation, ensuring the reliability and scalability of our...
-
Principal Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States Fidelity TalentSource LLC Full timeJob Title: Principal Site Reliability EngineerJob Summary:Fidelity Digital Assets is seeking a highly skilled Principal Site Reliability Engineer to join our Technical Operations team. As a key member of our team, you will be responsible for designing, implementing, and maintaining highly available, secure, and scalable cloud infrastructure on AWS. You will...
-
Principal Site Reliability Engineer
2 weeks ago
Jersey City, New Jersey, United States Fidelity TalentSource LLC Full timeJob Summary:We are seeking a highly skilled Principal Site Reliability Engineer to join our team at Fidelity Digital Assets. As a key member of our TechOps SRE team, you will work closely with our engineering partners to help enable and drive initiatives from design to implementation.The Role:As a Principal Site Reliability Engineer, you will be responsible...
-
Principal Site Reliability Engineer
7 days ago
Jersey City, New Jersey, United States Fidelity Investments Full timeJob Title: Principal Site Reliability EngineerThe Role:As a member of the TechOps SRE team at Fidelity Investments, you will work closely with our engineering partners to enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes environments are best-in-class and central to our enterprise-grade infrastructure...
-
Principal Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States Fidelity Investments Full timeJob Overview:About the RoleFidelity Investments is seeking a highly skilled Principal Site Reliability Engineer to join our TechOps SRE team. As a member of this team, you will work closely with our engineering partners to help enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes (AWS EKS) environments are...
-
Principal Site Reliability Engineer
2 weeks ago
Jersey City, New Jersey, United States Fidelity Investments Full timeThe RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our TechOps SRE team. As a key member of our team, you will work closely with our engineering partners to help enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our...
-
Principal Site Reliability Engineer
2 weeks ago
Jersey City, New Jersey, United States Fidelity Investments Full timeThe RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our TechOps SRE team. As a member of this team, you will work closely with our engineering partners to help enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our...
-
Principal Site Reliability Engineer
1 week ago
Jersey City, New Jersey, United States Fidelity TalentSource LLC Full timeJob Description:The RoleAs a member of the TechOps SRE team, you will work closely with our engineering partners to help enable and drive initiatives from design to implementation.Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our enterprise-grade infrastructure strategy. These growing environments...
-
Principal Site Reliability Engineer
2 weeks ago
Jersey City, New Jersey, United States Fidelity TalentSource LLC Full timeJob Description:As a member of the TechOps SRE team, you will work closely with our engineering partners to enable and drive initiatives from design to implementation.Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our enterprise-grade infrastructure strategy. These growing environments currently support...
-
Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States Syntricate Technologies Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...
-
Site Reliability Engineer
2 weeks ago
Jersey City, New Jersey, United States CyberTec Full timeSite Reliability EngineerCyberTec is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructureDevelop and maintain monitoring and...
-
Site Reliability Engineer
2 weeks ago
Jersey City, New Jersey, United States Syntricate Technologies Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...
-
Site Reliability Engineer
1 week ago
Jersey City, New Jersey, United States The Goldman Sachs Group, Inc Full timeJob Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Your...
-
Site Reliability Engineer
2 weeks ago
Jersey City, New Jersey, United States City National Bank Full timeSite Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that improve...
-
Site Reliability Engineer
2 weeks ago
Jersey City, New Jersey, United States Goldman Sachs Full timeAbout This RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our post-execution processing platforms, which handle trade processing, internal firm/firm trades, and client delivery across physical and synthetic...
-
Site Reliability Engineer
1 week ago
Jersey City, New Jersey, United States The Goldman Sachs Group, Inc Full timeAbout the RoleWe are seeking a talented Site Reliability Engineer to join our SRE Platforms team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Our team is responsible for designing and...
-
AWS Site Reliability Engineer
1 week ago
Jersey City, New Jersey, United States Syntricate Technologies Full timeWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure, particularly our AWS environment.The ideal candidate will have strong experience with AWS, with a focus on SRE principles...
-
Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States The Goldman Sachs Group Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at The Goldman Sachs Group. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Key ResponsibilitiesDevelop and...
-
Site Reliability Engineer
3 weeks ago
Jersey City, New Jersey, United States Hispanic Technology Executive Council Full timeJob DescriptionAt Hispanic Technology Executive Council, we are committed to delivering exceptional results through the power of technology. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and observability of our services.Key ResponsibilitiesPartner with engineering and technology teams to improve reliability and...
-
Site Reliability Engineer
2 weeks ago
Jersey City, New Jersey, United States Syntricate Technologies Full timeWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure, particularly on AWS. Your strong AWS experience and 2-3 years of recent experience will be invaluable in this role.The ideal...