Principal Site Reliability Engineer

2 weeks ago

San Francisco, United States Apollo Solutions Full time

Principal Site Reliability Engineer

Apollo Solutions have partnered with a groundbreaking Fintech start-up backed by top tier venture capital. They are looking to significantly disrupt how we view, store and invest our personal finance and have already made significant waves in the industry.

The Principal Site Reliability Engineer will be working closely with the other engineers to ensure fast, secure and reliable features can be delivered as well as building their infrastructure to feature massive scalability.

Responsibilities of the Principal Site Reliability Engineer.

Lead technical direction across the SRE function.
Set technical direction in the organizations cloud infrastructure (AWS and GCP).
Design and build systems with high availability and scalability as a core component.
Be a driver of the DevOps culture across the engineering organization.
Mentor junior members of the team.

Requirements of the Principal Cloud Infrastructure Engineer:

5+ years' experience working in a DevOps/SRE/Cloud Infrastructure role.
Experience programming in one or more languages
Experience with Cloud technologies (AWS, GCP, Azure)
Experience with Kubernetes
Experience with one or more of the following tools: Terraform, Cloudformation

What we offer:

Full medical, dental and vision insurance.
Unlimited PTO
Mental health support
Retirement plans

If you are interested, please apply now

Principal Site Reliability Engineer

3 days ago

San Francisco, CA, United States Apollo Solutions Full time

Principal Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking Fintech start-up backed by top tier venture capital. They are looking to significantly disrupt how we view, store and invest our personal finance and have already made significant waves in the industry. The Principal Site Reliability Engineer will be working closely...
Site Reliability Engineer

3 days ago

San Francisco, CA, United States Apollo Solutions Full time

Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking artifical inteligence business who are making major developments in how we use AI/ML for gaming/security. They are working closely with government contracts as well as gaming consoles companys and are now searching for an SRE to join their growing team. The Site Reliability...
Site Reliability Engineer

5 days ago

San Francisco, United States Patreon Full time

Patreon is the best place for creators to build exclusive content and community for their fans. We enable creators (podcasters, writers, musicians, illustrators, etc) to connect with their fans directly and make money from their creative work. Creators can sell one-off items from their own shops or offer recurring monthly memberships with exclusive access to...
Site Reliability Engineer

5 days ago

San Francisco, United States Pelago Full time

Role Overview: At Pelago, we run a serverless architecture on AWS, with infrastructure managed using Terraform. Our system has been built to deliver our virtual clinic for Substance Use Management, and we are looking for a talented Site Reliability Engineer to join the engineering team supporting Pelago.As a HIPAA compliant, HITRUST certified organization it...
Site Reliability Engineer

5 days ago

San Francisco, United States Instabase Full time

At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...
Site Reliability Engineer

7 days ago

San Francisco, United States Instabase Full time

At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...
Site Reliability Engineer

7 days ago

San Francisco, United States Talkdesk Full time

At Talkdesk, we are courageous innovators focused on helping organizations around the world create better customer experiences. Our AI-powered cloud contact center solutions optimize our customers’ most critical customer service processes. We are recognized as a Contact Center as a Service (CCaaS) leader by influential research organizations including...
Site Reliability Engineering

3 days ago

San Francisco, CA, United States Forhyre Full time

Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changing technology landscape. To be successful in this role You'll have the opportunity to design and implement major...
Site Reliability Engineer

1 week ago

San Francisco, United States Resource Informatics Group Full time

Job Title: Site Reliability Engineer Work Location : San Francisco, CA (Hybrid after showing successful engagement) Duration: 18+ months Most important skills: 10 years of Oracle database administration experience on large production environment Database hands on skills especially around database and system troubleshooting and administration GoldenGate...
Site Reliability Engineer

7 days ago

San Francisco, United States Talkdesk Full time

At Talkdesk, we are courageous innovators focused on helping organizations around the world create better customer experiences. Our AI-powered cloud contact center solutions optimize our customers’ most critical customer service processes. We are recognized as a Contact Center as a Service (CCaaS) leader by influential research organizations including...
Site Reliability Engineer

7 days ago

San Francisco, United States DAOmatch Full time

Aptos is a people-first blockchain on a mission to help billions of people achieve universal and fair access to decentralized assets in a safe and scalable way.Founded by some of the original creators and maintainers that researched, designed, and built the Diem blockchain to serve this purpose, we have dedicated several years toward this mission. We believe...
Site Reliability Engineer

2 weeks ago

San Francisco, United States Resource Informatics Group Full time

Job Title: Site Reliability Engineer Work Location: San Francisco, CA (Hybrid after showing successful engagement) Duration: 18+ months Most important skills:10 years of Oracle database administration experience on large production environment Database hands on skills especially around database and system troubleshooting and administration GoldenGate setup,...
Site Reliability Engineer

3 days ago

San Francisco, CA, United States Pelago Full time

Role Overview: At Pelago, we run a serverless architecture on AWS, with infrastructure managed using Terraform. Our system has been built to deliver our virtual clinic for Substance Use Management, and we are looking for a talented Site Reliability Engineer to join the engineering team supporting Pelago.As a HIPAA compliant, HITRUST certified organization...
Site Reliability Engineer

1 week ago

San Francisco, United States Cypress Human Capital Management, LLC Full time

Site Reliability Engineer (Grafana) Responsibilities Collaborate with Service Owners and Observability Leaders to develop a strategy for monitoring the technology stack using Grafana. Initiate data ingestion by deploying Telegraf and exporters (if necessary), utilizing discovery to feed data into Grafana Mimir. Establish initial alerting by creating alert...
Site Reliability Engineer

6 days ago

San Francisco, United States Swish Analytics Full time

Swish Analytics is a sports analytics, betting and fantasy startup building the next generation of predictive sports analytics data products. We believe that oddsmaking is a challenge rooted in engineering, mathematics, and sports betting expertise; not intuition. We're looking for team-oriented individuals with an authentic passion for accurate and...
Site Reliability Engineer

2 weeks ago

San Francisco, United States Cypress HCM Full time

Job DescriptionJob DescriptionSite Reliability Engineer (Grafana)Responsibilities:Collaborate with Service Owners and Observability Leaders to develop a strategy for monitoring the technology stack using Grafana.Initiate data ingestion by deploying Telegraf and exporters (if necessary), utilizing discovery to feed data into Grafana Mimir.Establish initial...
Engineering Manager, Site Reliability

6 days ago

San Francisco, United States Webflow Full time

At Webflow, our mission is to bring development superpowers to everyone. Webflow is the leading visual development platform for building powerful websites without writing code. By combining modern web development technologies into one platform, Webflow enables people to build websites visually, saving engineering time, while clean code seamlessly generates...
Sr. Site Reliability Engineer

4 weeks ago

San Francisco, United States hims & hers Full time

About the Role: We are seeking a Site Reliability Engineer to help build a reliable web experience for our users. We believe that moving fast is our competitive advantage, and enables us to better serve our users. We also know that the faster we move, the more likely we are to break things. You Will: Design and implement SRE practices ensuring availability,...
Site Reliability Engineer

1 week ago

San Diego, United States ObjectWin Technology Full time

Job Title: Site Reliability Engineer Location: San Diego, CA or Remote in CA Duration: 6 Months Description: It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make PlayStation highly...
Infrastructure and Site Reliability Engineer

3 weeks ago

San Francisco, California, United States Observable Full time

Observable is seeking a full-time infrastructure and site reliability engineer to help improve, administrate, and grow Observable systems as we scale to meet our customer's needs.What you will doPerform site reliability and ops work for Observable production and staging environments. (Manage servers Tweak WAF rules Optimize SQL queries And more)Design and...

Americas

Europe

Asia / Oceania

Africa

Principal Site Reliability Engineer