Principal Site Reliability Engineer

3 weeks ago


Jersey City, United States Fidelity Investments Full time
Job Description:

The Role

As a member of the TechOps SRE team, you'll work closely with our engineering partners to help enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our enterprise-grade infrastructure strategy. These growing environments currently support numerous mission-critical workloads. In this exciting role, you'll have the opportunity to further develop and refine your skills, collaborate across numerous Fidelity teams, and continue to grow in a fun, collaborative, and rapidly changing environment. This is a phenomenal opportunity to have a direct impact on the emerging strategies of our infrastructure and deployments, while at the same time, helping enable the expansion of our business.

The Skills and Expertise You Bring
  • Several years of hands-on experience with AWS in a production environment
  • Experience building and deploying Docker images including Docker Compose
  • Production experience running Kubernetes workloads ideally on AWS EKS
  • Experience managing and maintaining Kubernetes Clusters on AWS EKS
  • Experience creating and deploying Helm charts & libraries
  • Production experience with infrastructure-as-code (IaC), Terraform preferred
  • Hands-on experience with Jenkins Core, including authoring and maintaining declarative CI/CD pipelines and libraries
  • Experience with monitoring tools e.g., CloudWatch, Datadog & Splunk Cloud
  • Proficiency with UNIX operating systems and shell scripting
  • Programming experience, e.g., Python preferred
  • Experience with distributed version control systems, Git preferred
  • Experience with the agile software development lifecycle and Kanban preferred
  • Experience with CDN Providers e.g., Akamai preferred
  • Experience with Amazon Web Services (AWS), having managed services and applications in a large AWS cross-account environment using IAM and federated SSO
  • Experience crafting and maintaining logging, monitoring, and alerting capabilities using tools like Datadog and Splunk
  • Ability to communicate at all levels with track record of strong written and verbal communications
  • See problems as opportunities to automate
  • Ability to work independently with minimal direction
  • Drive and champion the overall design of highly available, secure, scalable microservices-based applications in AWS
  • Track record of providing technical leadership to strong teams of Site Reliability Engineers / Cloud Engineers
  • Experience with configuring and deploying resilient infrastructure in multiple regions and multiple availability zones
  • Work multi-functionally with other organizations and collaborate with our risk, product and engineering team leaders
  • Leading the initiative to craft and deploy our applications to the cloud
  • Promoting a DevOps mentality, providing mentorship and establishing development standard methodologies for AWS infrastructure-as-code
  • Championing automation tools to improve software delivery and reduce risk
The Team

Fidelity Digital Assets, a Fidelity Investments Company, is developing a full-service enterprise-grade platform for storing, trading, and servicing digital assets, such as Bitcoin and Ethereum.

Fidelity Digital Assets embraces an entrepreneurial culture and startup mindset while serving as one of the most innovative business units within Fidelity Investments. Our global, diverse team of hundreds of forward-thinking professionals lead with agility and creativity to build solutions that bridge the gap between traditional institutional investors and their exposure to digital assets. The firm's tenure and experience across multiple business lines present our employees with unprecedented access to knowledge, technology, and resources that help our team reshape the future of finance.

Within Fidelity Digital Assets, Technical Operations team is central to our initiative of moving to the cloud. The team uses AWS services to secure our network and scale our applications to ensure their up-time and reliability. Team members are hands-on Site Reliability Engineers who promote a DevOps approach, with a focus on infrastructure-as-code, security and automation.

#cryptojobs

The base salary range for this position is $85,000-$179,000 per year.
Placement in the range will vary based on job responsibilities and scope, geographic location, candidate's relevant experience, and other factors.

Base salary is only part of the total compensation package. Depending on the position and eligibility requirements, the offer package may also include bonus or other variable compensation.

We offer a wide range of to meet your evolving needs and help you live your best life at work and at home. These benefits include comprehensive health care coverage and emotional well-being support, market-leading retirement, generous paid time off and parental leave, charitable giving employee match program, and educational assistance including student loan repayment, tuition reimbursement, and learning resources to develop your career. Note, the application window closes when the position is filled or unposted.

Certifications:

Company Overview

Fidelity Investments is a privately held company with a mission to strengthen the financial well-being of our clients. We help people invest and plan for their future. We assist companies and non-profit organizations in delivering benefits to their employees. And we provide institutions and independent advisors with investment and technology solutions to help invest their own clients' money.

Join Us

At Fidelity, you'll find endless opportunities to build a meaningful career that positively impacts peoples' lives, including yours. You can take advantage of flexible benefits that support you through every stage of your career, empowering you to thrive at work and at home. Honored with a , we have been recognized by our employees as a top 10 Best Place to Work in 2024. And you don't need a finance background to succeed at Fidelity-we offer a range of opportunities for learning so you can build the career you've always imagined.

blends the best of working offsite with maximizing time together in person to meet associate and business needs. Currently, most hybrid roles require associates to work onsite all business days of one assigned week per four-week period (beginning in September 2024, the requirement will be two full assigned weeks).

At Fidelity, we value honesty, integrity, and the safety of our associates and customers within a heavily regulated industry. Certain roles may require candidates to go through a preliminary credit check during the screening process. Candidates who are presented with a Fidelity offer will need to go through a background investigation, , and may be asked to provide additional documentation as requested. This investigation includes but is not limited to a criminal, civil litigations and regulatory review, employment, education, and credit review (role dependent). These investigations will account for 7 years or more of history, depending on the role. Where permitted by federal or state law, Fidelity will also conduct a pre-employment drug screen, which will review for the following substances: Amphetamines, THC (marijuana), cocaine, opiates, phencyclidine.

We invite you to Find Your Fidelity at .

Fidelity Investments is an equal opportunity employer. We believe that the most effective way to attract, develop and retain a diverse workforce is to build an enduring culture of inclusion and belonging.

Fidelity will reasonably accommodate applicants with disabilities who need adjustments to participate in the application or interview process. To initiate a request for an accommodation, contact the HR Accommodation Team by sending an email to .

We welcome those with experience in jobs such as Software Developer, Computer Technician, and Computer User Support Specialist and others in the Computers and Technology to apply. Principal Site Reliability Engineer

  • Jersey City, New Jersey, United States JPMorganChase Full time

    Job Description Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impact.As a Principal Site Reliability Engineer at JP Morgan Chase within the Corporate Technology, you draw upon your advanced knowledge to identify...


  • Jersey City, New Jersey, United States JPMorganChase Full time

    Job Description Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.As a Lead Site Reliability Engineer at JPMorgan Chase within the Community & Consumer Banking - Infrastructure & Production Management Team, you hold a leadership role...


  • Kansas City, United States Bayer Full time

    At Bayer we’re visionaries, driven to solve the world’s toughest challenges and striving for a world where 'Health for all Hunger for none’ is no longer a dream, but a real possibility. We’re doing it with energy, curiosity and sheer dedication, always learning from unique perspectives of those around us, expanding our thinking, growing our...


  • Kansas City, United States Bayer Full time

    At Bayer we're visionaries, driven to solve the world's toughest challenges and striving for a world where 'Health for all Hunger for none' is no longer a dream, but a real possibility. We're doing it with energy, curiosity and sheer dedication, always learning from unique perspectives of those around us, expanding our thinking, growing our capabilities and...


  • Jersey City, New Jersey, United States The Goldman Sachs Group Full time

    About the RoleAt The Goldman Sachs Group, we're seeking a highly skilled Site Reliability Engineering Specialist to join our Platforms team. As a key member of our global engineering team, you'll be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform...


  • Jersey City, New Jersey, United States JPMorganChase Full time

    Job Description Guide and shape the future of technology at a globally recognized firm, driven by pride in ownership.As a Senior Manager of Site Reliability Engineering at JPMorgan Chase within the Corporate Technology, you are the non-functional requirement owner and champion for the applications in your remit. You are a key influencer in your team's...


  • Jersey City, New Jersey, United States Devexperts Full time

    Company DescriptionDevexperts has been working for nearly two decades consulting and developing for the financial industry. We solve complex technological challenges facing the most well-respected financial institutions worldwide.By becoming a part of Devexperts, you'll become a part of a company that fosters self-improvement and actively seeks...


  • Kansas City, United States Bayer Full time

    At Bayer we’re visionaries, driven to solve the world’s toughest challenges and striving for a world where 'Health for all Hunger for none’ is no longer a dream, but a real possibility. We’re doing it with energy, curiosity and sheer dedication, always learning from unique perspectives of those around us, expanding our thinking, growing our...


  • Jersey City, United States Bank of America Full time

    Job Description: At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day. One of the keys to driving Responsible Growth is being a great place to work for...


  • Jersey City, United States Veterans Sourcing Group LLC Full time

    Site Reliability Engineer (AWS) (SRE) Jersey City, NJ - onsite 3 days/week 12 month minimum contract w/ possible full-time conversion Roles And Responsibilities Design, code, test, and deliver software to automate manual operational work. Troubleshoot priority incidents, facilitate blameless post-mortems, and ensure permanent closure of incidents. Engage...


  • Jersey City, United States Hispanic Technology Executive Council Full time

    At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day. One of the keys to driving Responsible Growth is being a great place to work for our teammates...


  • Oklahoma City, Oklahoma, United States Thegradcafe Full time

    Position OverviewThis is a full-time position for a Senior Site Reliability Engineer with a software development organization specializing in manufacturing and mechanical engineering. You will have the chance to be part of a distributed team dedicated to enhancing manufacturing processes and reducing production costs for tangible products. This role offers a...


  • Oklahoma City, United States Ford Motor Company Full time

    Site Reliability Engineering at Ford Motor Company plays a critical role in maintaining and improving the reliability, scalability, and performance of our services. You will work closely with our development teams to build and maintain large-scale, distributed systems and ensure our products meet our high standards for availability and user...


  • Oklahoma City, Oklahoma, United States Ford Motor Company Full time

    Site Reliability Engineering at Ford Motor Company plays a critical role in maintaining and improving the reliability, scalability, and performance of our services. You will work closely with our development teams to build and maintain large-scale, distributed systems and ensure our products meet our high standards for availability and user...


  • Oklahoma City, United States PAYCOM PAYROLL LLC Full time

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.RESPONSIBILITIESDevelop software to...


  • Oklahoma City, United States PAYCOM PAYROLL LLC Full time

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites. RESPONSIBILITIES Develop software...


  • Oklahoma City, United States Paycom Payroll Llc Full time

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.RESPONSIBILITIESDevelop software to...


  • Redwood City, United States 1872 Consulting Full time

    Site Reliability Engineer - 100% Remote Role Summary: Site Reliability Engineers (SREs) are responsible for working with different developer teams to keep our systems running smoothly. They are a blend of pragmatic operators and software craftspeople that apply excellent problem-solving and communication skills to develop or configure tools that will...


  • Oklahoma City, United States Paycom Online Full time

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites. RESPONSIBILITIES Develop...


  • Oklahoma City, United States Paycom Full time

    Job DetailsLevel Experienced Job Location Oklahoma City Office - Oklahoma City, OK Position Type Full Time Education Level Bachelor's Degree Travel Percentage None Job Category Development Description Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites,...