Lead Site Reliability Engineer
5 days ago
Company: Federal Reserve Bank of San Francisco
We are the Federal Reserve Bank of San Francisco-public servants with a mission to advance the nation's monetary, financial, and payment systems to build a stronger economy for all Americans. We are a community-engaged bank, and are committed to understanding and serving the vibrant, expansive communities of the Twelfth District. That means we seek and appreciate new perspectives. We respect people for what they do and for who they are. We build opportunities to learn and grow. When you join the SF Fed, you become part of a diverse team united in its purpose to promote an economy that works for everyone.
The Federal Reserve Bank of San Francisco is looking for a Lead Site Reliability Engineer (DevOps Expert) to join the Enterprise Architecture and Integrated DevOps Team. We empower the Federal Reserve business technology landscape by guiding development and management of solutions with cloud-centric strategic platforms. This is an exciting opportunity to help design, development, deployment, and support of an automation framework that enables IaaS capabilities across AWS Cloud. This requires collaborating closely with cross-functional teams to translate infrastructure architectures into automated, scalable cloud-native Integrated DevSecOps. You will standardize deployment and data models to support rapid scaling with multi-tenancy and self-service functionality across our cloud services.
We empower our people to balance their life and work responsibilities. That's why we offer a flexible hybrid work model that allows you to collaborate with office colleagues on some days, and work from home on others.
Responsibilities:- Design, implement, and maintain scalable, highly available, and secure infrastructure on AWS. Handle and optimize AWS services, including EC2, EKS, and other related services.
- Work with internal collaborators, customers for planning, delivery, and service management.
- Co-own ongoing ITIL processes, and implementation and driving of continuous improvement initiatives.
- Build and maintain reliable and scalable systems, CI/CD tooling, and automating cloud-based highly available, high performing applications.
- Design and deploy robust cloud infrastructure and container solutions, focusing on reliability, scalability, and performance.
- Create/use automation framework to streamline IaaS provisioning and configuration across cloud environment, enabling efficient scaling and operational consistency.
- Implement/leverage observability, monitoring, and SRE principles (e.g., error budgets, proactive incident management) to enhance system reliability and performance.
- Apply FinOps practices to monitor and optimize cloud resource usage, ensuring cost-effective operation across all environments.
- Guide engineering teams, fostering standard processes in cloud engineering, SRE, and automation.
- Adopt security standard processes within cloud infrastructure with secure design patterns and ensuring alignment with industry standards and regulatory requirements.
- Proactively identify potential vulnerabilities and lead initiatives to ensure systems are prepared for rapid recovery, minimizing impact from disruptions.
- Participate in strategy toward continuous monitoring and performance tuning of cloud systems to enhance efficiency and reliability. Use data-driven insights to identify optimization opportunities, address performance bottlenecks, and ensure cloud resources meet evolving business demands.
- Design and manage microservices architecture for high performance and scalability.
- Monitor and maintain system performance using cloud monitoring tools.
- Collaborate with development teams to integrate applications into the CI/CD pipeline.
- Automate configuration management and deployment processes.
- Bachelor's degree in Computer Science, Information Technology, or other related technical degree.
- Typically requires 7+ years of solid background in AWS services, container orchestration, infrastructure as code, and continuous integration/continuous deployment (CI/CD) processes.
- Experience with microservices architecture, and integration of programming languages into CI/CD pipelines.
- Experience developing, customizing, and scaling cloud monitoring tools.
- Technical/functional expertise in tooling for ITIL, Agile, Project Management and SDLC.
- Experience supporting infrastructure for large multi-services applications.
- Familiarity with Fault Injection tooling (i.e. AWS Fault Injection Simulator, Gremlin, ChaosToolkit, Chaos Monkey).
- Standard methodologies in chaos engineering process and implementation (Chaos gamedays, business critical KPIs, etc.).
- Excellent problem-solving skills and ability to work in a fast-paced environment.
- Good interpersonal skills and ability to collaborate effectively with multi-functional teams.
- AWS Certified DevOps Engineer or similar certification.
- Must be a U.S Citizen or a Green Card holder with intent to become a U.S Citizen.
Base Salary Range Lead Site Reliability Engineer: Min: $138,900 - Mid: $180,400 - Max: $221,900 (Location: San Francisco)
Final salary and offer will be determined by the applicant's background, experience, skills, internal equity, and alignment with geographic and other market data.
We offer a wonderful benefits package including: Medical, Dental, Vision, Pre-tax Flexible Spending Account, Backup Childcare Program, Pre-Tax Day Care Flexible Spending Account, Paid Family Care Leave, Vacation Days, Sick Days, Paid Holidays, Pet Insurance, Matching 401(k), and Retirement/Pension.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, perform essential job functions, and receive other benefits and privileges of employment.
The SF Fed is an Equal Opportunity Employer.
The Federal Reserve Banks believe that diversity and inclusion among our employees is critical to our success as an organization, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool. The Federal Reserve Banks are committed to equal employment opportunity for employees and job applicants in compliance with applicable law and to an environment where employees are valued for their differences.
#J-18808-Ljbffr-
Site Reliability Engineering Lead
3 days ago
San Francisco, California, United States Indotronix International Corporation Full timeJob DescriptionWe are seeking a highly experienced Site Reliability Engineering Lead to join our team at Indotronix International Corporation.The ideal candidate will have experience with site reliability engineering, Kubernetes, Docker, CI/CD, and Jenkins, as well as strong production support skills. A background in Splunk or similar logging/observability...
-
Lead Site Reliability Engineer
4 days ago
San Diego, CA, United States KForce Full timeDescriptionKforce has a client that is seeking a Lead Site Reliability Engineer in San Diego, CA.Responsibilities:* Lead, mentor, and develop a team of Site Reliability engineers, fostering a collaborative and innovative work environment* Lead Site Reliability Engineer will oversee an SRE team and drive the reliability strategy for the organization* Conduct...
-
Site Reliability Engineer
1 month ago
San Francisco, United States Bun Full timeBun is an open-source JavaScript tooling company focused on making programming simpler. We've raised $26 million from top investors in Silicon Valley, are among the top GitHub repositories and have a growing community of 33,000 Discord members.We're hiring an experienced Site Reliability Engineer to scale and maintain the infrastructure that builds and tests...
-
Lead DevOps/Site Reliability Engineer
2 days ago
San Francisco, United States Saxon Global Full timeLead DevOps/Site Reliability Enginee Looking for a resource more senior in the DevOps space, with a leaning toward site reliability engineering. Docker containers, Kubernetes automation Mostly focused on the automation, current pain points around deployments reliability around their data engineering processes. SRE who can go beyond the memory, what kind of...
-
Sr./Lead Site Reliability Engineer
5 days ago
San Francisco, United States Federal Reserve Bank of San Francisco Full timeCompany: Federal Reserve Bank of San FranciscoWe are the Federal Reserve Bank of San Francisco - public servants with a mission to advance the nation's monetary, financial, and payment systems to build a stronger economy for all Americans. We are a community-engaged bank and are committed to understanding and serving the vibrant, expansive communities of the...
-
Lead DevOps/Site Reliability Engineer
1 month ago
San Francisco, United States Indotronix International Corporation Full timePay Rate:- W2 Rate $ 61.75 Looking in PST time zone, preferred to be local to SF and willing to go into office occasionally, but okay with Remote (needs to hive high work ethic!) Lead DevOps/Site Reliability Enginee Looking for a resource more senior in the DevOps space, with a leaning toward site reliability engineering. Docker containers,...
-
Lead DevOps/Site Reliability Engineer
21 hours ago
San Francisco, United States Indotronix International Corporation Full timePay Rate:- W2 Rate $ 61.75Looking in PST time zone, preferred to be local to SF and willing to go into office occasionally, but okay with Remote (needs to hive high work ethic!) Lead DevOps/Site Reliability Enginee Looking for a resource more senior in the DevOps space, with a leaning toward site reliability engineering. Docker containers,...
-
Site Reliability Engineering Lead
2 weeks ago
San Francisco, California, United States Springshot Full timeSpringshot lives at the intersection between technology and humanity. We assimilate and simplify the complex, striving to provide users with easy-to-use web and mobile interfaces that present the right information at the right time so they can make the right decision or take the right physical action, including through robotics and autonomous machines.This...
-
Site Reliability Engineer
2 months ago
San Francisco, United States Ellation, Inc. Full timeWho We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...
-
Site Reliability Engineer
1 week ago
San Francisco, United States EVONA Full timeSite Reliability Engineer (SRE)Location: San Francisco Bay AreaRole Overview:We are seeking a highly skilled Site Reliability Engineer (SRE) to join a dynamic team at a rapidly growing technology company. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of mission-critical systems, while implementing automation...
-
Sr./Lead Site Reliability Engineer
2 days ago
San Francisco, United States Federal Reserve Bank Full timeCompany Federal Reserve Bank of San Francisco We are the Federal Reserve Bank of San Francisco—public servants with a mission to advance the nation’s monetary, financial, and payment systems to build a stronger economy for all Americans. We are a community-engaged bank, and are committed to understanding and serving the vibrant, expansive communities of...
-
Site Reliability Engineer
6 days ago
San Francisco, United States Arbitrum Full timeOur mission is to bring blockchain to a billion people. The Alchemy Platform is a world class developer platform designed to make building on the blockchain easy. We've built leading infrastructure in the space, powering over$105billion in transactions for tens of millions of users in 99% of countries worldwide. The Alchemy team draws from decades of deep...
-
Lead DevOps/Site Reliability Engineer
5 days ago
San Francisco, United States Federal Reserve Bank of New York Full timeLead DevOps/Site Reliability Engineer page is loadedLead DevOps/Site Reliability EngineerApply locations San Francisco, CA Seattle, WA Salt Lake City, UT Portland, OR Los Angeles, CAtime type Full time posted on Posted 2 Days Ago job requisition id R-0000027698CompanyFederal Reserve Bank of San Francisco. While the SF Fed is a Reserve Bank, we’re not what...
-
Site Reliability Engineer
1 month ago
San Francisco, United States Unreal Gigs Full timeAre you passionate about building and maintaining resilient systems that ensure high availability and performance? Do you excel at automating processes, troubleshooting complex issues, and creating systems that scale smoothly? If you're ready to take on the challenge of ensuring reliable, efficient, and secure system operations, our client has the perfect...
-
Lead DevOps/Site Reliability Engineer
17 hours ago
San Francisco, United States Federal Reserve Bank Full timeCompany Federal Reserve Bank of San Francisco While the SF Fed is a Reserve Bank, we’re not what you might expect. We’re unreserved here. That means we seek new and diverse perspectives. We spark conversations and encourage debate. We build opportunity. We pursue careers that are true to ourselves. We are looking for people who want to help our country...
-
Site Reliability Engineer
2 weeks ago
San Francisco Bay Area, United States Bun Full timeBun is an open-source JavaScript tooling company focused on making programming simpler. We've raised $26 million from top investors in Silicon Valley, are among the top GitHub repositories and have a growing community of 33,000 Discord members.We're hiring an experienced Site Reliability Engineer to scale and maintain the infrastructure that builds and tests...
-
Lead DevOps/Site Reliability Engineer
4 weeks ago
San Francisco, United States Federal Reserve Bank of San Francisco Full timeCompany: Federal Reserve Bank of San Francisco While the SF Fed is a Reserve Bank, we're not what you might expect. We're unreserved here. That means we seek new and diverse perspectives. We spark conversations and encourage debate. We build opportunity. We pursue careers that are true to ourselves. We are looking for people who want to help our country...
-
Site Reliability Engineer
16 hours ago
San Francisco, United States ESL FACEIT GROUP Full timeAt EFG (ESL FACEIT Group) we create worlds beyond gameplay where players and fans become community. We pride ourselves in having a corporate social responsibility which is that “IT’S NOT GG (Good Game), UNTIL IT’S GG FOR ALL”. We are passionate about the culture we foster that ultimately helps to create and shape the world of esports, gaming...
-
Site Reliability Engineer
2 months ago
san francisco, United States New York Technology Partners Full timeMust Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years
-
Site Reliability Engineer
2 months ago
San Francisco, United States New York Technology Partners Full timeMust Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years