Site Reliability Engineer
1 month ago
Site Reliability Engineer - 100% Remote
Role Summary:
Site Reliability Engineers (SREs) are responsible for working with different developer teams to keep our systems running smoothly. They are a blend of pragmatic operators and software craftspeople that apply excellent problem-solving and communication skills to develop or configure tools that will automate, monitor, and alert the reliability of internal Systems
What you will be doing:
- Be on-call rotation to respond to LeadIQ availability incidents and support developers with customer incidents
- Use your on-call shift to prevent incidents from happening. Step-in either actively or in support of the engineers when they do.
- Run our infrastructure with AWS, Terraform, and Kubernetes (EKS).
- Think about systems - edge cases, failure modes, behaviors, specific implementations.
- Make monitoring and alert on symptoms and not on outages.
- Document every action, so your findings turn into repeatable actions–and then into automation.
- Improve the deployment process to make it as boring as possible.
- Design, build and maintain core infrastructure pieces that allow LeadIQ scaling to support hundreds of thousands of concurrent users.
- Debug production issues across services and levels of the stack.
- Plan the growth of LeadIQ infrastructure.
- Support the definition and building of SLI and SLO for engineering teams
The Requirements:
- 4+ years working with Terraform and AWS
- 2+ years working with-
- Gitlab (or similar) as CI tool
- Datadog (or similar) as Alerting tool
- Kubernetes
- Know your way around Linux and the Unix Shell.
- Programming skills on NodeJS and/or Go
Nice to Haves
- Have experience with tech stack: Nginx, Docker, Kubernetes, Terraform, Terragrunt, AWS, Gitlab, Helm, ArgoCD, Datadog, or similar technologies
- AWS, Terraform, Kubernetes certifications
-
Redwood City, United States C3 AI Full timeWe are looking for an Associate Site Reliability Engineer/Site Reliability Engineer to join our team at our HQ in Redwood City, CA.Responsibilities:Maximize system uptime and availability, ensuring functional and performance SLAs.Establish end-to-end monitoring and alerting on all critical aspects.Solve complex problems for critical services and build...
-
Redwood City, CA, United States C3 AI Full timeWe are looking for an Associate Site Reliability Engineer / Site Reliability Engineer to join our team at our HQ in Redwood City, CA. Responsibilities: Maximize system uptime and availability, ensuring functional and performance SLAs. Establish end-to-end monitoring and alerting on all critical aspects. Solve complex problems for critical services...
-
Site Reliability Engineer
4 weeks ago
Jersey City, United States Syntricate Technologies Full timeJob Title : Site Reliability Engineer (AWS) (SRE)- Location : Jersey city ,NJ -( 3 days WFO, 2 days WFH) Duration : 6 +Months Position Responsibilities: Site Reliability Engineer (AWS) (SRE) Work Location: Jersey city New Jersey Only near by candidate will be considered ( 3 days WFO, 2 days WFH) 1 Zoom / tech interview and 1 onsite interview with...
-
Site Reliability Engineering Manager
2 weeks ago
Foster City, United States Zoox Full timeZoox is looking for a Site Reliability Engineering Manager who will be responsible for leading and growing Zoox's Core Site Reliability Engineering team, ensuring the reliability, scalability, and performance of our critical infrastructure, cloud platform, and core services that powers company-wide software engineering operations.Zoox is a robotics company...
-
Site Reliability Engineer
7 months ago
Foster City, United States Zoox Full timeZoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant through...
-
Redwood City, California, United States Zilliz Full timeAbout ZillizZilliz is a pioneering company that specializes in developing next-generation vector database technologies to empower organizations in creating AI applications. As a fast-growing startup, we are dedicated to simplifying data management for AI and making vector databases accessible to every organization.Job DescriptionWe are seeking an experienced...
-
Principal Site Reliability Engineer
1 month ago
Jersey City, United States Fidelity Investments Full timeJob Description:The RoleAs a member of the TechOps SRE team, you'll work closely with our engineering partners to help enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our enterprise-grade infrastructure strategy. These growing environments...
-
Sr Site Reliability Engineer
4 weeks ago
Salt Lake City, United States FEDERAL RESERVE OF SAN FRANCISCO Full timeCompanyFederal Reserve Bank of San Francisco Job Description: While the SF Fed is a Reserve Bank, we're not what you might expect. We're unreserved here. That means we seek new and diverse perspectives. We spark conversations and encourage debate. We build opportunity. We pursue careers that are true to ourselves. We are looking for people who want to help...
-
Senior Site Reliability Leader
5 days ago
Kansas City, Missouri, United States Granicus Full timeAbout the RoleWe are seeking an exceptional Senior Site Reliability Leader to join our team at Granicus. As a key member of our organization, you will play a vital role in shaping the future of our company. With a focus on cloud infrastructure engineering, you will be responsible for designing and implementing scalable solutions that meet the needs of our...
-
Senior Director
4 weeks ago
Foster City, United States Visa Full timeCompany Description Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...
-
Oracle: Principal Site Reliability Engineer
4 weeks ago
Jersey City, NJ, United States Fidelity Investments Full timeAs a member of the TechOps SRE team, you'll work closely with our engineering partners to help enable and drive initiatives from design to implementation. This is a phenomenal opportunity to have a direct impact on the emerging strategies of our infrastructure and deployments, while at the same time, helping enable the expansion of our business. ...
-
Aumni - Site Reliability Engineer III - MLOPS
7 months ago
Salt Lake City, United States JPMorgan Chase & Co. Full timeThere’s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a Site Reliability Engineer III at JPMorgan Chase within the Digital Private Markets /Aumni (A JP Morgan Chase Company), you will solve...
-
Electrical Reliability Engineer
4 weeks ago
Jersey City, United States Ben Aris Full timeAbout the job Electrical Reliability Engineer (Hybrid-Remote - Any Location - USA) - Regional Electrical Reliability Engineer About this Role and About You The Electrical Reliability Engineer establishes and maintains standards of Best Practice for manufacturing site maintenance functions across North American operations. Best Practice standards reflect...
-
Civil Engineer
3 weeks ago
Redwood City, California, United States BKF Engineers Full timeJob OverviewBKF Engineers is seeking a highly skilled Civil Engineer to join our team in Redwood City or San Francisco. As a key member of our site development team, you will have the opportunity to work on a variety of projects and contribute to the success of our company.CompensationThe estimated salary for this position is $93,375 per year, paid biweekly....
-
Senior Site Reliability Engineer
3 weeks ago
Foster City, CA, United States Zoox Full timeZoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant through...
-
Senior Reliability Engineer
2 weeks ago
Calvert City, United States Wacker Chemical Corporation Full timeDate: Dec 9, 2024 Location: Calvert City Company: Wacker Chemical Corporation ...
-
Facilities Reliability Engineer
4 weeks ago
Evans City, Pennsylvania, United States Ascensus Specialties Full timeAscensus Specialties is seeking a highly skilled Facilities Reliability Engineer to join our team.The estimated salary for this position is $120,000 - $160,000 per year, based on industry standards and the company's location.About the RoleThe Facilities Reliability Engineer will be responsible for ensuring the sustained performance of infrastructure through...
-
Staff/Senior Staff Site Reliability Engineer
3 weeks ago
Foster City, CA, United States Zoox Full timeFoster City, CA • Full-time Staff/Senior Staff Site Reliability Engineer Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from...
-
Redwood City, California, United States BKF Engineers Full timeJob OverviewBKF Engineers is seeking a skilled Design Engineer to join our team in Redwood City. This exciting opportunity involves working on various projects, including residential, commercial, and mixed-use developments.Estimated Salary RangeThe estimated annual salary for this position is $80,000 - $101,500, depending on skills, experience, education,...
-
Senior Civil Engineer
4 weeks ago
Redwood City, California, United States BKF Engineers Full timeOpportunity OverviewWe are seeking a highly skilled Senior Civil Engineer to join our team as a Site Development Specialist. This role will involve working on various site development projects, including residential, commercial, and mixed-use developments.