Sr Site Reliability Engineer

2 days ago


San Francisco, United States Federal Reserve Bank of San Francisco Full time

Company: Federal Reserve Bank of San Francisco

Job Description:

While the SF Fed is a Reserve Bank, we're not what you might expect. We're unreserved here. That means we seek new and diverse perspectives. We spark conversations and encourage debate. We build opportunity. We pursue careers that are true to ourselves. We are looking for people who want to help our country reach its full economic potential.

When you join the SF Fed, you join a team of people working together to foster an inclusive economy that works for everyone. From data-driven insights to cloud transformation, the information technology team moves the SF Fed forward. We use innovative technologies and Agile methods to positively impact every American across the communities we serve.

Come and be a part of the National Integration Services (NIS) dynamic team If you have a passion for building the Bank of tomorrow on the best technology available, then this role is for you. NIS is responsible for a portfolio of both established and emerging application integration products architected to support our mission-critical systems.

We are looking for a Senior Site Reliability Engineer to join our team. To be successful in this role, you must be an organized self-starter with stellar leadership and communication skills who can work independently building relationships and driving technical solutions from inception to completion.

Responsibilities:

  • Maintains and improves existing build and deployment processes across all products.
  • Collaborates with Integration engineers to create automation best practices.
  • Designs and deploys new application components and infrastructures.
  • Implements and maintains a continuous integration environment.
  • Supports and troubleshoots product and infrastructure issues.
  • Writes configuration scripts for automation tools and automates recurring tasks.
  • Actively monitors and administers cloud-hosted applications and builds integrations.
  • Participates in engineering design and deployment planning.
  • Defines and documents continuous integration/continuous deployment best practices.
  • Solves difficult problems with scripting language across multiple environments.
  • Implements and maintains security in accordance with Bank security policies.
  • Drives improvement opportunities in infrastructure, tooling, and workflows using a continuous feedback loop.
  • Ensures uptime and reliability of cloud-based infrastructure and systems, monitoring system performance, and maintaining high availability of cloud-based assets.
  • Participates in incident response and troubleshooting by conducting root cause analysis and implementing solutions to prevent recurrence.
  • Provides on-call production support.

Qualifications:

  • Bachelor's degree in computer science, Information Systems, Computer Engineering, Systems Analysis or a related field or equivalent work experience.
  • Typically requires 5+ years of relevant technical or business experience with 3 years of experience in managing complex systems using software.
  • General familiarity with automated deployments.
  • This position requires strong experience with IBM ACE / MQ.
  • Experience in common scripting languages (shell, Python, Perl, YAML, etc.).
  • Proven ability to write clear and concise technical documents including design documents, specifications, and technology roadmaps.
  • Requires recent demonstrable experience with GITLAB pipeline coding. Capable of stitching together Terraform, AWS CLI commands, and BASH scripting.
  • Ability to write and develop Terraform modules for different infrastructure setup, installation, and configuration.
  • Recent experience with AWS CLI including S3, EC2, ASG, ELB, SFTP, Cloud Formation, CloudTrail, CloudWatch, Lambda, etc.
  • AWS Certification and/or experience utilizing AWS services.
  • Working experience in Ansible, GitLab, Terraform, CloudWatch, Dynatrace, Grafana or equivalent is a must.
  • Integration experience with Micro Services is a plus.
  • Must be a U.S. Citizen or a Green Card holder with the intent to become a U.S. Citizen.

Base Salary Range Sr. Site Reliability Engineer: Min: $113600 - Mid: $147600 - Max: $181600 (Location: San Francisco)

Final salary and offer will be determined by the applicant's background, experience, skills, internal equity, and alignment with market data.

We offer a wonderful benefits package including: Medical, Dental, Vision, Pre-tax Flexible Spending Account, Backup Child Care Program, Pre-Tax Day Care Flexible Spending Account, Paid Family Care Leave, Vacation Days, Sick Days, Paid Holidays, Pet Insurance, Matching 401(k), and Retirement/Pension.

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, perform essential job functions, and receive other benefits and privileges of employment. The SF Fed is an Equal Opportunity Employer.

Job Category: Information Technology

Work Shift: First (United States of America)

The Federal Reserve Banks believe that diversity and inclusion among our employees is critical to our success as an organization, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool. The Federal Reserve Banks are committed to equal employment opportunity for employees and job applicants in compliance with applicable law and to an environment where employees are valued for their differences.

#J-18808-Ljbffr
  • Sr. SRE

    4 weeks ago


    san francisco, United States Walter Bacon, LLC Full time

    Sr. SRE (Site Reliability Engineer), (Contract or Contract-to-hire)IDEAL CANDIDATE:10+ years of SRE experienceSupporting Very High-traffic, Mission Critical, Fintech.Hybrid. Work in San Francisco on Tuesdays and Fridays.Customer-facing skills. You might interact with some Clients.AWS, Splunk, APM tools, Monitoring Tools, Automation, Scripting, Python, Bash.

  • Sr. SRE

    1 month ago


    san francisco, United States Walter Bacon, LLC Full time

    Sr. SRE (Site Reliability Engineer), (Contract or Contract-to-hire)IDEAL CANDIDATE:10+ years of SRE experienceSupporting Very High-traffic, Mission Critical, Fintech.Hybrid. Work in San Francisco on Tuesdays and Fridays.Customer-facing skills. You might interact with some Clients.AWS, Splunk, APM tools, Monitoring Tools, Automation, Scripting, Python, Bash.


  • San Francisco, United States Focal Systems Full time

    Location: San Francisco - hybrid (1-2 days per week)Salary: $165-175k + stock Company Description Focal Systems is the industry leader in retail AI solutions. We are a Silicon Valley based startup that has more than doubled in size every year since inception. We are a Deep Learning first company. Our mission is to automate and optimize brick and mortar...


  • San Francisco, United States WEX Full time

    The WEX Site Reliability Engineering (SRE) team is seeking an entry-level Site Reliability Engineer Level 1 who is passionate about learning and growing in the field of software development and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits...


  • San Francisco, California, United States Outdefine Full time

    About the JobWe are seeking a highly skilled Site Reliability Engineer to join our team at Outdefine. As a key member of our engineering team, you will be responsible for ensuring the reliability, scalability, and performance of our ecommerce platform.Key ResponsibilitiesDesign and implement scalable and highly available cloud infrastructure using Kubernetes...


  • San Francisco, California, United States Roman Health Pharmacy LLC Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Xero. As a key member of our Reliability Enablement team, you will play a critical role in ensuring the reliability and performance of our systems.Key ResponsibilitiesInvestigate operational surprises and support teams in post-incident activitiesConduct in-depth...


  • San Francisco, United States Focal Systems Full time

    Location: San Francisco - hybrid (1-2 days per week)Salary: $170-190k + stockCompany DescriptionFocal Systems is the industry leader in retail AI solutions. We are a Silicon Valley based startup that has more than doubled in size every year since inception. We are a Deep Learning first company. Our mission is to automate and optimize brick and mortar retail...


  • San Francisco, California, United States Swish Analytics Full time

    {"h1": "Site Reliability Engineer at Swish Analytics"} Swish Analytics is a sports analytics and betting startup that's revolutionizing the industry with cutting-edge predictive data products. We're on a mission to make oddsmaking a challenge rooted in engineering, mathematics, and sports betting expertise, not intuition. We're looking for a team-oriented...


  • San Francisco, United States Ellation, Inc. Full time

    Who We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...


  • San Francisco, United States Ellation, Inc. Full time

    Who We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...


  • San Francisco, California, United States WEX Full time

    Job SummaryThe WEX Site Reliability Engineering team is seeking a highly motivated and quick-learning individual to join our team as a Site Reliability Engineer Level 1. As a key member of our team, you will be responsible for ensuring the reliability, performance, and security of our systems.Key Responsibilities:Actively participate in training and...


  • San Francisco, United States iRhythm Technologies, Inc. Full time

    Requisition Request:Job Title: Site Reliability EngineerDepartment: IT-800Hiring Manager: Jai SinghSalary Range: P2 (2 years min with Bachelor's or 0 years with Master's). The salary range is 93K to 110K.Location: San Francisco, CAReplacing? Kyle SagersBudgeted/Non-Budgeted: Budgeted for 1Q 2020About iRhythm:iRhythm is a leading digital healthcare company...


  • San Francisco, United States New York Technology Partners Full time

    Must Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years


  • san francisco, United States New York Technology Partners Full time

    Must Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years


  • San Francisco, California, United States Arbitrum Inc Full time

    Reliability EngineerAt Arbitrum Inc, we're on a mission to bring blockchain to a billion people. Our developer platform is designed to make building on the blockchain easy, and we're looking for a skilled Reliability Engineer to join our Infrastructure team.As a Reliability Engineer, you'll collaborate with our engineering team to design, deploy, and...


  • San Francisco, California, United States Aitopics Full time

    About the RoleWe are seeking a highly skilled Staff Site Reliability Engineer to join our Data Engineering team. As a key member of our team, you will be responsible for maintaining and enhancing the reliability of our data infrastructure.Your work will directly impact the availability and performance of our data services, enabling the organization to make...


  • San Francisco, California, United States Hinge Health Full time

    About the RoleHinge Health is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our platform, including automation, logging, monitoring, and alerting.You will thrive in a collaborative environment, have excellent communication skills, and be...


  • San Francisco, California, United States Tampa Gardens Senior Living Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure Team. As a key member of our team, you will be responsible for deploying, managing, optimizing, and upgrading the systems that run Sight Machine software.You will work closely with our Development Engineering team to ensure the stability,...


  • San Francisco, California, United States Zilliz Full time

    Job Title: Cloud Platform Staff Site Reliability EngineerWe are seeking a highly skilled Cloud Platform Staff Site Reliability Engineer to join our team at Zilliz. As a key member of our SRE team, you will be responsible for ensuring the reliability, availability, and performance of our distributed database systems.Key Responsibilities:Design and build tools...


  • San Francisco, United States Tbwa ChiatDay Inc Full time

    Location: San Francisco - hybrid (1-2 days per week)Salary: $165-175k + stockCompany DescriptionFocal Systems is the industry leader in retail AI solutions. We are a Silicon Valley based startup that has more than doubled in size every year since inception. We are a Deep Learning first company. Our mission is to automate and optimize brick and mortar retail...