Principal Site Reliability Engineer, Platform
1 week ago
Department : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Platform focuses around building a scalable and secure foundations platform, enabling Engineering to deploy, validate, and operate their services in production, improve resiliency of the service and increase organizational efficiency by reducing operational toil and increase system efficiency through architectural evolution. The Site Reliability Engineering team engages directly with our other engineering teams to onboard them onto our platform systems, reviewing and recommending design and architectural decisions, and guiding our engineering teams on how to implement the tooling provided by the larger Platform organization required to ensure systems can scale and react to changing conditions, with continuous improvement loops. The Role: Principal Site Reliability Engineer You will be an integral part of leading Gemini’s engineering teams towards modern DevOps practices, both by developing and providing modern automation and operational tooling, and working cross-functionally across Gemini’s engineering teams to influence and shape our development practices and culture. Responsibilities: Provide primary operational support and engineering for various Gemini services Improve reliability, quality and time-to-market across all Gemini services and offerings Guide engineering teams onto the various supported services provided by Platform Run on-going performance evaluations and improvements for Gemini systems Provide architecture recommendations and engagement as part of SDLC Create “Production-ready Scorecards” to evaluate the health of systems pre-launch Implement and teaching monitoring, alerting and automated resolution best practices Define SLIs, SLOs with Engineering teams Educate and guide Engineering teams on reliability and resiliency best practices, like statelessness, chaos testing, blue/green deployments, etc. Design, build, and maintain operational tooling and automation that streamline processes and enhance system reliability Qualifications: 10+ years using monitoring, alerting, and automation tooling to understand and remediate performance and health issues in systems at scale Good knowledge for various cloud technology providers like AWS, GCP, or Azure Expert in an infrastructure as code environment (Terraform), developing automated solutions to solve support and operational issues Experience as a Technical Leader within a team, helping evaluating and making tech decisions for the team Expert working with containerization such as Nomad, EKS (k8s), Docker, etc. Expert working with Configuration Management such as Ansible, Chef, Puppet Proficient at writing scripts or cli tools that help increase Developer Productivity in high-level languages like Python, Go, etc. Expert analyzing system and application performance, identifying bottlenecks, and recommending architectural or systemic improvements Experience working with Engineering teams, teaching, training, and mentoring on how to implement best-practice technical solutions It Pays to Work Here The compensation & benefits package for this role includes: Competitive starting salary A discretionary annual bonus Long-term incentive in the form of a new hire equity grant Comprehensive health plans 401K with company matching Paid Parental Leave Flexible time off Salary Range : The base salary range for this role is between $198,000 - $247,000 in the State of New York, the State of California and the State of Washington. This range is not inclusive of our discretionary bonus or equity package. When determining a candidate’s compensation, we consider a number of factors including skillset, experience, job scope, and current market data.
-
Staff Site Reliability Engineer, Platform
7 days ago
(usa), United States GEMINI Full timeDepartment : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Platform focuses around building a scalable and secure foundations platform, enabling Engineering to deploy, validate, and...
-
Site Reliability Engineer
2 weeks ago
USA, United States TwinStream Full time $120,000 - $140,000 per yearWho are we:In 2019, our founders were working as engineers solving complex cross domain problems within government organisationsTwinStream was formed to consolidate their collective expertise and experience into one business, providing technical excellence and exceptional service to their clients. We have teams working both on-site with clients and remotely...
-
Manager, Site Reliability Engineering
2 weeks ago
(usa), United States GEMINI Full timeDepartment s to ensure smooth integration of applications and systems. Define and enforce Service Level Objectives (SLOs) and Service Level Agreements (SLAs) to ensure system reliability and uptime. Monitor system performance, troubleshoot issues, and ensure timely incident response, root cause analysis, and problem resolution. Implement effective...
-
Site Reliability Engineer
2 weeks ago
USA, United States Baseten Full time $200,000 - $250,000 per yearABOUT BASETENBaseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. With...
-
Senior Site Reliability Engineer, Onchain
2 weeks ago
(usa), United States GEMINI Full timeDepartment : Onchain The Role: Senior Site Reliability Engineer The infrastructure team at Gemini creates and manages software tools and platforms, automates the creation and support of this infrastructure, helps integrate complex processes, and supports secure data access. Security of customers’ digital assets and personal information held with Gemini is...
-
Site Reliability Engineer with 2K
4 days ago
USA, United States eTek IT Services Full timeJob DescriptionPosition: Site reliability Engineer Location: Remote Duration: 1 year Required Qualification:6+ years of demonstrated influence across one or more teams for large scale projects that drive impact and improvement across the organization 6+ years of developing tools for automation of processes or augmenting off the shelf tool functionality6+...
-
Site Reliability Engineer
2 days ago
USA, VA, McLean ( Greensboro Dr, Hamilton), United States Booz Allen Hamilton Full timeSite Reliability EngineerThe Opportunity:Everyone is trying to "harness the cloud," but not everyone knows how. As a DevOps engineer, you're eager to develop, manage, and secure a container platform that meets your client's needs and takes advantage of cloud capabilities. We need you to help us develop container management software to solve some of our...
-
Principal Software Engineer
1 week ago
USA, United States Red Cell Partners Full time $200,000 - $250,000 per yearAbout UsRed Cell Partners is an incubation firm building and investing in rapidly scalable technology-led companies that are bringing revolutionary advancements to market in three distinct practice areas: healthcare, cyber, and national security. United by a shared sense of duty and deep belief in the power of innovation, Red Cell is developing powerful...
-
Principal Reliability Engineer
6 days ago
MA: Innovation Dr Tewks Bdg North Street Building , Tewksbury, MA, USA, United States RTX Full time $101,000 - $203,000Date Posted: Country:United States of AmericaLocation:MA134: Innovation Dr Tewks Bdg North Street Building 400, Tewksbury, MA, 01876 USAPosition Role Type:OnsiteU.S. Citizen, U.S. Person, or Immigration Status Requirements: Active and transferable U.S. government issued security clearance is required prior to start date. U.S. citizenship is required, as...
-
Principal Software Engineer, Funding
1 week ago
(usa), United States GEMINI Full timeDepartment : Funding The Role: Principal Software Engineer This role reports to the Engineering head of Funding. This is a strategic and influential position responsible for driving engineering excellence, helping shape technical strategy, and providing technical leadership across the funding products - both Crypto and Fiat. This individual serves as the...