Senior Reliability Engineer

2 weeks ago


San Francisco, California, United States Federal Reserve Bank (NY) Full time
Position Overview:
As a Senior Reliability Engineer, you will be integral to the Data & Analytics Services (DAS) Team, collaborating with application development, quality assurance, and DevOps teams, as well as our National IT departments. Your primary focus will be on architecting, constructing, and overseeing the systems that support our application suite, both on-premises and in the cloud.

Key Responsibilities:
- Collaborate with development and DevOps teams to establish and execute Continuous Integration and Continuous Delivery (CI/CD) pipelines.
- Partner with National IT to develop and implement necessary tooling for Continuous Delivery.
- Ensure that our systems and tools are scalable and adaptable to evolving requirements.
- Engage in design and planning sessions to account for changes in tooling or services.
- Drive process enhancements and mentor others on best practices in Platform Engineering and DevOps.
- Lead the design of logical models and implement solutions to meet business needs.
- Take ownership of the technical aspects of software development for assigned applications.
- Work on multiple projects as a technical team member or lead, focusing on user story analysis, design, development, testing, and automation tools.
- Ensure consistency across all data solutions in collaboration with developers.
- Communicate effectively with technical and business product managers, as well as third parties, regarding solution design.
- Participate actively in an Agile development environment, attending daily standups and sprint planning activities.
- Provide operational support for applications and utilities, assisting in the deployment of new modules, upgrades, and fixes.
- Conduct user training workshops and act as a role model for new software and technology adoption.

Qualifications:
- Bachelor’s degree in Computer Science, Information Systems, Computer Engineering, Systems Analysis, or a related field, or equivalent work experience.
- A minimum of 7 years of experience in building and supporting enterprise-level systems as a platform engineer or similar role in a production environment.
- At least 3 years of relevant technical experience with AWS and Big Data technologies.
- Proven expertise in designing, developing, and maintaining end-to-end data solutions using modern data lake and enterprise data warehouse technologies.
- Strong communication skills, with the ability to produce clear and concise technical documentation.
- Hands-on experience with tools and services essential for on-premises and cloud DevOps best practices.

Compensation and Benefits:
- Competitive salary range based on experience and qualifications.
- Comprehensive benefits package including medical, dental, vision, retirement plans, and more.
- Commitment to diversity and inclusion, ensuring a welcoming environment for all employees.

  • San Francisco, California, United States Chelsoft Solutions Co Full time

    Job OverviewWe are seeking a Senior Site Reliability Engineer to join our dynamic team at Chelsoft Solutions Co. This position is designed for a skilled SRE professional who thrives in a hybrid work environment.Key ResponsibilitiesImplement and maintain reliable systems and infrastructure.Collaborate with cross-functional teams to enhance system...


  • San Francisco, California, United States Pager Full time

    PagerDuty empowers teams of all kinds to drive business forward through our Operations Cloud.We're seeking a Senior Site Reliability Engineer to join our SRE-Platform team. As a key contributor, you'll build, maintain, and scale our Kubernetes platform, accelerating developer productivity, improving reliability, and helping PagerDuty scale for the...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    About AutoRABIT Holding Inc.AutoRABIT Holding Inc. is a leading provider of Salesforce DevSecOps platform for regulated industries such as financial institutions, insurance, and healthcare. Our solutions enable developers to automate their daily tasks, increasing productivity and release velocity while meeting stringent security, compliance, and privacy...


  • San Francisco, California, United States AutoRABIT Holding, Inc. Full time

    About AutoRABIT Holding, Inc.AutoRABIT Holding, Inc. is a leading provider of Salesforce DevSecOps platform for regulated industries such as financial institutions, insurance, and healthcare. Our solutions enable developers to automate their daily tasks to be more productive and increase the release velocity for their development team, while meeting...


  • San Francisco, California, United States Outdefine Full time

    About the JobWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Outdefine. As a key member of our Infrastructure team, you will be responsible for ensuring the reliability and scalability of our blockchain-based systems.Key ResponsibilitiesRun internal Chainlink and Blockchain nodes to ensure seamless connectivity and data...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS provider and a prominent leader in the Salesforce DevSecOps platform tailored for regulated sectors such as finance, insurance, and healthcare. Our solutions empower developers to streamline their daily operations, enhancing productivity and accelerating release cycles while adhering to...


  • San Francisco, California, United States RevenueCat Full time

    About RevenueCatWe are a leading provider of mobile subscription infrastructure, handling over $3 billion in in-app purchases annually across thousands of apps. Our mission is to build a standard for mobile subscription infrastructure, and we're looking for a Senior Site Reliability Engineer to help us achieve this goal.About the RoleWe're seeking a highly...


  • San Diego, California, United States Mentis Systems Full time

    Job OverviewWe are currently seeking a Senior Reliability Systems Engineer at Mentis Systems. This role is pivotal in overseeing the development of new products and ensuring their reliability throughout the lifecycle.Position DetailsRole: Senior Reliability Systems EngineerDuration: 12+ months ContractLocation: Hybrid/San Diego CAKey ResponsibilitiesThe...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our offerings empower developers to streamline their daily operations, enhancing productivity and accelerating release cycles while adhering...


  • San Francisco, California, United States Operant AI Full time

    Job OverviewSenior Site Reliability EngineerAs the inaugural SRE within our organization, we are looking for an individual to establish Operant's SRE strategy and operations aimed at ensuring the resilience and security of our platforms and services. If you are enthusiastic about the prospect of being an early engineer at a startup ready to revolutionize...


  • San Francisco, California, United States Centene Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Centene. As a key member of our technology organization, you will play a critical role in ensuring the reliability, performance, and security of our platform infrastructure.Key ResponsibilitiesLead Projects and Initiatives: Help lead projects focused on...


  • San Diego, California, United States Mentis Systems Full time

    Job OverviewWe are currently seeking a Senior Reliability Systems Engineer at Mentis Systems. This role is crucial for driving our New Product Development initiatives.Position DetailsThe Senior Reliability Systems Engineer will utilize their extensive technical expertise and leadership capabilities to guide Systems Engineering through various stages of the...


  • San Francisco, California, United States MasterCard Full time

    Our Mission:We strive to connect and empower a comprehensive digital economy that serves everyone, everywhere by ensuring transactions are secure, straightforward, intelligent, and accessible. Through the use of secure data and networks, along with strategic partnerships and a passionate workforce, our innovations and solutions assist individuals, financial...


  • San Francisco, California, United States Abnormal Security Full time

    Job OverviewAt Abnormal Security, we empower organizations of all sizes to combat cyber threats through our innovative cloud solutions. As we strive to enhance our offerings in highly regulated environments, we are seeking a dedicated **Site Reliability Engineer II** to play a crucial role in ensuring the scalability, reliability, and availability of our...


  • San Francisco, California, United States Circle Full time

    About CircleCircle is a leading financial technology company that is revolutionizing the way value is transferred globally. Our innovative infrastructure enables businesses, institutions, and developers to harness the power of blockchain technology and capitalize on the emerging internet of money.Job SummaryWe are seeking a highly skilled Senior Site...


  • San Francisco, California, United States Block Full time

    Company DescriptionIt all started with an idea at Block in 2013. Initially built to take the pain out of peer-to-peer payments, Cash App has gone from a simple product with a single purpose to a dynamic ecosystem, developing unique financial products, including Afterpay/Clearpay, to provide a better way to send, spend, invest, borrow and save to our 47...


  • San Francisco, California, United States Square Inc. Full time

    Job Description**About the Role**We are seeking a highly skilled and experienced Senior Engineering Manager to lead our Mobile Performance, Reliability, and Observability (MPRO) team at Square Inc. As a key member of our engineering organization, you will be responsible for delivering highly performant and reliable experiences to our customers.**Key...


  • San Diego, California, United States Mentis Systems Full time

    Job OverviewWe are currently seeking a Senior Reliability Systems Engineer to contribute to a pivotal project at Mentis Systems. This role is essential for ensuring the robustness and reliability of our innovative products.Key Responsibilities:The selected candidate will be responsible for:Utilizing extensive technical knowledge to guide systems engineering...


  • San Francisco, California, United States Test Dev Tools Full time

    Job DescriptionAstranis is a leading provider of innovative satellite technology, aiming to bridge the digital divide by connecting the world's underserved communities. As a Senior Reliability Test Engineer at Test Dev Tools, you will play a critical role in ensuring the highest level of quality and reliability in our satellite hardware.Key...


  • San Francisco, California, United States Cognizant Full time

    Senior Site Reliability Engineer and R2 Solutions Architect (Remote) Cognizant is seeking an experienced Senior Site Reliability Engineer and R2 Solutions Architect with expertise in Python Performance Validation and Dynatrace to oversee critical projects. Your contributions will significantly enhance the efficiency and effectiveness of our solutions,...