We have other current jobs related to this field that you can find below

Sr. Site Reliability Engineer

2 months ago

Santa Clara, United States TCWGlobal Full time

Sr. SRE EngineerW2 Contract to Possible HireHybrid, Santa Clara, CA$75-90/hr + PTO, Paid Holidays, Benefits We are looking for a seasoned SRE to join our multifaceted and fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer. The position will be part of a fast-paced crew that develops and maintains...
Sr. Site Reliability Engineer

2 months ago

Santa Clara, United States TCWGlobal Full time

Sr. SRE EngineerW2 Contract to Possible HireHybrid, Santa Clara, CA$75-90/hr + PTO, Paid Holidays, Benefits We are looking for a seasoned SRE to join our multifaceted and fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer. The position will be part of a fast-paced crew that develops and maintains...
Site Reliability Engineer

1 week ago

Santa Clara, United States Veear Full time

Position: Site Reliability Engineer Location: Remote role Duration: 12+ Months Contract with possible extension Job Description: We seek development-heavy Site Reliability Engineers to design, build, maintain, and scale production services and server farms within our FedRAMP SASE product portfolio. We want passionate engineers who bring new ideas to all...
Sr Principal Site Reliability Engineer

1 week ago

Santa Clara, United States Palo Alto Networks Full time

Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking...
Site Reliability Engineer

2 weeks ago

Santa Clara, United States VeeAR Projects Inc. Full time

Position: Site Reliability EngineerLocation: Remote roleDuration: 12+ Months Contract with possible extensionJob Description: We seek development-heavy Site Reliability Engineers to design, build, maintain, and scale production services and server farms within our FedRAMP SASE product portfolio. We want passionate engineers who bring new ideas to all facets...
Site Reliability Engineer

2 weeks ago

Santa Clara, United States VeeAR Projects Inc. Full time

Position: Site Reliability EngineerLocation: Remote roleDuration: 12+ Months Contract with possible extensionJob Description: We seek development-heavy Site Reliability Engineers to design, build, maintain, and scale production services and server farms within our FedRAMP SASE product portfolio. We want passionate engineers who bring new ideas to all facets...
Site Reliability Engineer

1 month ago

Santa Clara, United States NVIDIA Full time

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and outstanding people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers,...
Cloud Site Reliability Engineer

1 month ago

Santa Clara, United States Centrify Corporation Full time

Our software runs on public clouds with 99.9% or better uptime and is mission critical for our customers. Our cloud operations team is where the rubber meets the road and needs innovative Site Reliability Engineers. Join a professional team of smart and hard-working professionals building enterprise-class cloud-based services in the rapidly growing market of...
Sr Site Reliability Engineer

1 week ago

Santa Clara, United States Palo Alto Networks Full time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done,...
Principal Site Reliability Engineer

3 months ago

Santa Clara, United States Kofi Group Full time

To Apply for this Job Click HerePrincipal Site Reliability EngineerSan Francisco Bay Area, CAWe are partnering with a late-stage Cloud Security company that is looking for a Principal Level SRE The ideal candidate will have:Strong sense of architecture and design for fault tolerance, scale-out approaches, and stability Deep experience in building tools...
Site Reliability Engineering Manager

3 days ago

Santa Clara, California, United States Promote Project Full time

About Promote Project: Promote Project is a leader in innovative technology solutions, dedicated to pushing the boundaries of what is possible in the realm of artificial intelligence and cloud computing. Our commitment to excellence is reflected in our talented workforce and our pursuit of groundbreaking advancements.Position Overview: We are seeking a...
Site Reliability Engineering Manager

3 days ago

Santa Clara, California, United States Promote Project Full time

About the Company: Promote Project is at the forefront of innovation, leveraging cutting-edge technology to redefine the landscape of AI and computing. Our mission is to harness the power of advanced computing to create transformative solutions that impact various industries.Position Overview: We are seeking a Manager of Site Reliability Engineering to...
Senior Site Reliability Engineer

4 days ago

Santa Clara, California, United States ServiceNow Full time

Company OverviewAt ServiceNow, we harness technology to create a better world for everyone, driven by our talented workforce. We prioritize speed and innovation to meet the demands of our customers and communities.Joining ServiceNow means becoming part of a dynamic team of innovators who possess a relentless curiosity and a commitment to creativity.We...
Senior Site Reliability Engineer

4 days ago

Santa Clara, California, United States ServiceNow Full time

Company OverviewAt ServiceNow, we harness technology to enhance global operations, and our dedicated workforce makes it all possible. We operate swiftly because the world demands it, innovating uniquely for our clients and communities.By becoming part of ServiceNow, you join a dynamic team of innovators who possess a relentless curiosity and a passion for...
Principal Site Reliability Engineer

2 days ago

Santa Clara, United States Palo Alto Networks Full time

Principal Site Reliability Engineer (SASE) Full-time Job Country: United States of America To comply with U.S. federal government requirements, U.S. citizenship is required for this position. Our Mission At Palo Alto Networks, everything starts and ends with our mission: being the cybersecurity partner of choice, protecting our digital way of life. Our...
Site Reliability Engineering Manager

3 days ago

Santa Clara, California, United States Promote Project Full time

About the Company: Promote Project is at the forefront of innovation, focusing on redefining technology and enhancing the capabilities of AI. We are dedicated to creating groundbreaking solutions that push the boundaries of what is possible in computing.Position Overview: We are seeking a Manager for Site Reliability Engineering to spearhead our cloud...
Principal Site Reliability Engineer SASE

3 days ago

Santa Clara, United States Palo Alto Networks Full time

Job Description Your Career The Global Customer Operation Team is responsible for building products that protect data, workloads, and infrastructure for some of the largest enterprise customers in the world. We help our customers in their journey to the public cloud by ensuring they have the best in class protection. The public cloud market has been...
Senior Site Reliability Engineer

2 months ago

Santa Clara, United States Nvidia Full time

Senior Site Reliability Engineer - StoragelocationsUS, CA, Santa Claratime typeFull timejob requisition idJR1979072NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and...
Senior Site Reliability Engineer

1 month ago

Santa Clara, California, United States Nvidia Full time

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables unique creativity and discovery, and powers what were...
Principal Site Reliability Engineer

1 week ago

Santa Clara, United States Palo Alto Networks Full time

Company DescriptionTo comply with U.S. federal government requirements, U.S. citizenship is required for this positionOur Mission At Palo Alto Networks® everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before....

Sr. Site Reliability Engineer

2 months ago

Santa Barbara, United States AppFolio Full time

We are hiring a Senior Site Reliability Engineer to run and evolve AppFolio Investment Manager's ecosystem of services. This is an ideal opportunity for someone with a desire to help own/maintain as well as 'teach to fish' fully 'shifted left' development teams and a passion for building reliable yet simple systems. This position, as with all members of Investment Manager R&D, may require on-call responsibilities. Your Impact

You'll be a key member of the team that provides reliable, scalable infrastructure for key components of AppFolio Investment Manager. You'll help build the future of reliable, critical services, and support the rapid and sustainable growth of new features. You'll lead the effort to build, deploy, and help maintain the cloud infrastructure that powers AppFolio Investment Manager, as well as collaborate with R&D engineering teams. You'll help build product-specific infrastructure as well as help improve the reliability and quality of their services. Along with your team, you'll ensure all aspects of the IM product have a plan to address any shortcomings for exception reporting, capacity planning, monitoring and alerting, backups, runbooks, configuration management, DDoS protection, infrastructure as code, and disaster recovery. You'll collaborate with engineering teams, helping them to improve the reliability and quality of their services and infrastructure. You'll be the 'domain expert' of all Investment Manager infrastructure, while leveraging the wider shared knowledge and assistance of the overall SRE and Infrastructure Engineering group. Qualifications

Proven ability to diagnose and monitor performance and reliability issues across the stack: relational databases, web servers, networking, OS, containers, load balancers, etc. You'll chase down performance problems and uncover the root causes of system failures. Strong coding background: you've written code to perform critical tasks or in production. The exact language doesn't matter, though we give bonus points for Go, Ruby, or Python. You are able to create and maintain container images. Mastery experience with some areas of our tech like Ruby on Rails, Kubernetes, MySQL, Linux, container orchestration, Networking, etc. You have strong communication skills and enjoy working on a team that values openness, integrity, ownership, and attention to detail Must-Haves

5+ years of hands-on experience running production, highly available, distributed, and cloud-based services, preferably in a SaaS environment. Experience developing Service Level Indicators and Service Level Objectives for the above systems. Experience with Docker/Container technologies. Experience with Amazon Web Services (commonly EKS, RDS Aurora, Lambda, S3, EBS, Route53, DynamoDB, and VPCs) Expertise with Infrastructure as Code (Terraform, CloudFormation, Pulumi, etc.) Familiarity with Kubernetes or other container orchestration tooling. Bachelor's degree and at least 5 years of industry experience

#J-18808-Ljbffr

Americas

Europe

Asia / Oceania

Africa

We have other current jobs related to this field that you can find below

Sr. Site Reliability Engineer