Site Reliability Engineer
1 week ago
The ideal candidate should have experience in designing highly scalable infrastructure, building systems, and performing testing, monitoring, and maintenance.Our backend stack includes Python, PostgreSQL, DynamoDB, Redis, and Kubernetes - built alongside dedicated in-house AI and search interfaces.
Responsibilities
- Design and implement highly available, high-performance, and scalable systems.
- Maintain and optimize key-value and relational databases.
- Scale and load balance web server backends to meet rapidly changing needs.
- Participate in on-call rotation. Monitor systems and applications, proactively identifying and resolving reliability, scalability, or performance issues.
- Develop monitoring tools, alerts, and dashboards to provide visibility into system health and performance.
- Collaborate with product engineering and security engineering teams to develop scalable automations.
- Strong experience with cloud infrastructure built on AWS.
- Proficient in database management and caching strategies.
- Excellent problem-solving and troubleshooting skills, with the ability to analyze, debug, and resolve complex technical issues.
- Experience working with containers (Docker, Kubernetes) and orchestration tools.
- Excellent communication and collaboration skills.
- Experience with Python and Terraform.
- 4+ years of SRE experience.
The cash compensation range for this role is $200,000 to $240,000.
At Perplexity, we've experienced tremendous growth and adoption since publicly launching the world's first fully functional conversational answer engine just over a year ago. Our AI-powered search assistant has amassed 10 million monthly active users as of early 2024, with our mobile apps installed over 1 million times across iOS and Android devices. In 2023 alone, we served over 500 million queries from users around the globe.
To support our rapid expansion, we've raised significant funding from some of the most respected investors in technology. In January 2024, we raised $73.6 million in a Series B round led by IVP, with participation from NVIDIA, Jeff Bezos' investment fund, NEA, Databricks, and other prominent firms. We followed that up with a $62.7 million Series B1 round in April 2024 led by Daniel Gross, valuing Perplexity at over $1 billion.
Our prominent investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Naval Ravikant, Tobi Lutke, and many other visionary individuals.
Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above.
Equity: In addition to the base salary, equity is part of the total compensation package.
Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.
-
Site Reliability Engineer
1 week ago
San Francisco, United States Apollo Solutions Full timeSite Reliability Engineer Apollo Solutions have partnered with a groundbreaking artifical inteligence business who are making major developments in how we use AI/ML for gaming/security. They are working closely with government contracts as well as gaming consoles companys and are now searching for an SRE to join their growing team. The Site Reliability...
-
Site Reliability Engineer
3 weeks ago
San Francisco, United States WEX Full timeThe WEX Site Reliability Engineering (SRE) team is seeking an entry-level Site Reliability Engineer Level 1 who is passionate about learning and growing in the field of software development and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits...
-
Site Reliability Engineer
3 weeks ago
San Francisco, United States Ellation, Inc. Full timeWho We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...
-
Site Reliability Engineer
3 weeks ago
San Francisco, United States Ellation, Inc. Full timeWho We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...
-
Site Reliability Engineer
1 week ago
San Francisco, United States Unreal Gigs Full timeAre you passionate about building and maintaining resilient systems that ensure high availability and performance? Do you excel at automating processes, troubleshooting complex issues, and creating systems that scale smoothly? If you're ready to take on the challenge of ensuring reliable, efficient, and secure system operations, our client has the perfect...
-
Site Reliability Engineer
1 month ago
san francisco, United States New York Technology Partners Full timeMust Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years
-
Site Reliability Engineer
1 month ago
San Francisco, United States New York Technology Partners Full timeMust Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years
-
Site Reliability Engineer
1 week ago
San Francisco, United States New York Technology Partners Full timeMust Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years
-
Site Reliability Engineer
1 week ago
San Francisco, California, United States WEX Inc Full timeThe WEX Site Reliability Engineering team is looking for a motivated Site Reliability Engineer to join our Benefits Reliability organization. As a member of our team, you will be responsible for ensuring the reliability, performance, and security of our systems.Key Responsibilities:Learning and Development: Participate in training and mentorship programs to...
-
Site Reliability Engineer
4 weeks ago
San Francisco, United States Focal Systems Full timeLocation: San Francisco - hybrid (1-2 days per week)Salary: $165-175k + stock Company Description Focal Systems is the industry leader in retail AI solutions. We are a Silicon Valley based startup that has more than doubled in size every year since inception. We are a Deep Learning first company. Our mission is to automate and optimize brick and mortar...
-
Senior Site Reliability Engineer
3 weeks ago
San Francisco, United States Focal Systems Full timeLocation: San Francisco - hybrid (1-2 days per week)Salary: $170-190k + stockCompany DescriptionFocal Systems is the industry leader in retail AI solutions. We are a Silicon Valley based startup that has more than doubled in size every year since inception. We are a Deep Learning first company. Our mission is to automate and optimize brick and mortar retail...
-
Site Reliability Engineer
2 weeks ago
San Jose, United States EVONA Full timeSite Reliability Engineer (SRE)Location: San Francisco Bay AreaRole Overview:We are seeking a highly skilled Site Reliability Engineer (SRE) to join a dynamic team at a rapidly growing technology company. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of mission-critical systems, while implementing automation...
-
Senior Staff Site Reliability Engineer
4 weeks ago
San Francisco, United States WEX, Inc. Full timeAbout the RoleThe WEX Site Reliability Engineering (SRE) team is seeking a Senior Staff SRE who is passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits Reliability organization which supports our internal...
-
Senior Staff Site Reliability Engineer
3 weeks ago
San Francisco, United States WEX Full timeAbout the Role The WEX Site Reliability Engineering (SRE) team is seeking a Senior Staff SRE who is passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits Reliability organization which supports our internal...
-
Lead DevOps/Site Reliability Engineer
1 day ago
San Francisco, United States Indotronix International Corporation Full timePay Rate:- W2 Rate $ 61.75 Looking in PST time zone, preferred to be local to SF and willing to go into office occasionally, but okay with Remote (needs to hive high work ethic!) Lead DevOps/Site Reliability Enginee Looking for a resource more senior in the DevOps space, with a leaning toward site reliability engineering. Docker containers,...
-
Lead DevOps/Site Reliability Engineer
1 week ago
San Francisco, United States Indotronix International Corporation Full timePay Rate:- W2 Rate $ 61.75 Looking in PST time zone, preferred to be local to SF and willing to go into office occasionally, but okay with Remote (needs to hive high work ethic!) Lead DevOps/Site Reliability Enginee Looking for a resource more senior in the DevOps space, with a leaning toward site reliability engineering. Docker containers,...
-
Site Reliability Engineer II
19 hours ago
San Francisco, CA, United States Earnest Current Job Openings Full timeThe Site Reliability Engineer II position will report to the Lead Cloud Engineer. As an SRE II Engineer, you will: Set up and maintain comprehensive monitoring, create and refine playbooks, build dashboards, and adopt industry-standard practices to enhance the reliability and resilience of our site and systems. Develop and manage IaC to ensure reliable,...
-
Sr Site Reliability Engineer
2 weeks ago
San Francisco, United States Federal Reserve Bank of San Francisco Full timeCompany: Federal Reserve Bank of San FranciscoJob Description:While the SF Fed is a Reserve Bank, we're not what you might expect. We're unreserved here. That means we seek new and diverse perspectives. We spark conversations and encourage debate. We build opportunity. We pursue careers that are true to ourselves. We are looking for people who want to help...
-
Site Reliability Engineer, Security
4 weeks ago
San Francisco, United States Okta Full timeOktaOkta's Workforce and Customer Identity Clouds enable secure access, authentication, and automation—putting identity at the heart of business security and growth.Get to know OktaOkta is The World’s Identity Company. We free everyone to safely use any technology—anywhere, on any device or app. Our Workforce and Customer Identity Clouds enable secure...
-
Staff Site Reliability Engineer
4 weeks ago
San Francisco, United States Ellation, Inc. Full timeWho We Are We're a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our collection...