Lead Site Reliability Engineer

1 month ago


San Jose, United States VDart Inc Full time
Job DescriptionJob Description

Job Title: Lead Site Reliability Engineer

Location: San Jose, CA (2 Days Hybrid)

Duration: / Term: 6+ months

Job Description:

Experience Desired: 14+ Years.

Responsibilities:

Please look for 14 years hands on Coding/scripting (Ansible) , Python , Cloud Computing

About the Role

We seek a highly skilled and dynamic Site Reliability Engineer Consultant In this role you will

Maintain and improve the reliability, performance, and availability of software systems.

Act as a bridge between traditional IT operations and software development, bringing a software engineering approach to system administration.

Job Responsibilities

Creating and supporting automation scripts (shell/ansible/python) for infrastructure deployments, validations and monitoring to improve operational tasks

Scheduling monitoring scripts using cron and airlfow

Monitoring using tools including Dynatrace, Apica, Grafana etc

Database handling

Build CICD pipelines

Incident handling and problem management

Mandatory Skills

Experience in Ansible/ Python

Monitoring Tools Dynatrace/Apica/Grafana

Required Education Bachelor's degree in computer science or a related field.

Required Experience

14 plus years of IT Infrastructure experience

Extensive experience working with linux flavors like rhel/centos os, shells, filesystems and utilities

Experience in programming languages like Python, ansible

Knowledge of distributed computing and experience working with container orchestration frameworks including on-prem and rancher kubernetes and good knowledge on kubernetes objects

Experience working with Storage, ONTAP is preferable: volume, aggregates, back ups, DR planning

Experience scheduling monitoring scripts using cron and airlfow

Experience with monitoring tools including Dynatrace, Apica, Grafana etc

Database knowledge including sql and nosql dbs

Experience building CICD pipelines (preferred)

Cloud platform knowledge (specifically AWS) is required

Key Skills:

SRE, AWS, Python, Monitoring Tools Dynatrace/Apica/Grafana



  • San Jose, California, United States Zscaler Full time

    About ZscalerAt Zscaler, our Engineering team has developed the largest cloud security platform globally, and we continue to innovate. With over 100 patents and ambitious plans for service enhancement and global expansion, our team has established us as a leader in cloud security, serving more than 15 million users across 185 countries. We invite you to...


  • San Jose, California, United States Zscaler Full time

    About ZscalerAt Zscaler, our Engineering team has developed the largest cloud security platform globally, and we continue to innovate. With over 100 patents and ambitious plans for service enhancement and global expansion, our team has established us as the leader in cloud security, serving more than 15 million users across 185 countries. We invite you to...


  • San Jose, California, United States Zscaler Full time

    About UsZscaler has developed the world's largest cloud security platform, continually innovating and expanding our services. With a robust portfolio of over 100 patents and ambitious plans for global growth, our team has established itself as a leader in cloud security, serving more than 15 million users across 185 countries. We are looking for talented...


  • San Jose, United States Adobe Full time

    Site Reliability Engineer page is loadedAdobe’s Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. Adobe Sign is the fastest, and easiest way to get contracts signed and filed.You have a track record as a site reliability engineer in large-scale SaaS businesses, and a strong...


  • San Jose, California, United States Adobe Full time

    Site Reliability Engineer page is loadedAdobe's Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. Adobe Sign is the fastest, and easiest way to get contracts signed and filed.You have a track record as a site reliability engineer in large-scale SaaS businesses, and a strong...


  • San Jose, United States Trianz Full time

    Job Description Role: Site Reliability Engineer Employment Type: Contract – Only VISA FREE Work location: Sanjose, CA Work mode: Onsite- 2 days in a week / 3 days Remote About the Role We seek a highly skilled and dynamic Site Reliability Engineer – Consultant. In this role you will: Maintain and improve the reliability, performance, and availability of...


  • San Jose, California, United States Hireio, Inc. Full time

    Exciting Opportunity: Data Infrastructure Site Reliability Engineering (SRE) TeamJoin Hireio, Inc., a premier platform for short-form mobile video hosting services. As a trailblazer in technology, our SRE team integrates software development with infrastructure management to architect, construct, and oversee extensive, highly distributed systems. We operate...


  • San Jose, United States F5 Full time

    F 5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F 5 Distributed Cloud Product. Due to the nature of work this role requires US Citizenship. Primary Responsibil Reliability Engineer, Liability, Engineer, Reliability, Reliability, Technology, Support


  • San Jose, California, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security platform provider, offering a comprehensive suite of solutions to protect businesses from cyber threats. Our team of experts has built a robust platform that enables organizations to harness the power of the cloud while ensuring the security and integrity of their data.Job SummaryWe are seeking an experienced...


  • San Jose, United States Zscaler Full time

    Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...


  • San Jose, California, United States VDart Inc Full time

    Job OverviewPosition: Lead Site Reliability EngineerLocation: San Jose, CA (Hybrid Work Model)Contract Duration: 6+ monthsExperience Required: 14+ YearsRole Summary:We are in search of a highly experienced and proactive Site Reliability Engineer Consultant. In this pivotal role, you will be responsible for:Key Responsibilities:Enhancing the reliability,...


  • San Jose, California, United States VDart Inc Full time

    Job OverviewPosition: Lead Site Reliability EngineerLocation: San Jose, CA (Hybrid Work Model)Contract Duration: 6+ monthsExperience Required: 14+ YearsRole Summary:We are in search of a highly experienced and proactive Site Reliability Engineer Consultant. In this capacity, you will be responsible for:Key Responsibilities:Enhancing the reliability,...


  • San Jose, United States Zscaler Full time

    Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...


  • San Jose, California, United States Hireio, Inc. Full time

    About Hireio, Inc.Hireio, Inc. stands at the forefront of the mobile video landscape, recognized as a premier platform for short-form video content. As a leading Unicorn startup, we have achieved remarkable milestones, including over 1.3 billion mobile downloads in the United States and 2 billion globally. With a robust user base of 1.5 billion monthly...


  • San Jose, California, United States Western Digital Full time

    Job OverviewCompany Overview:At Western Digital, we are dedicated to enhancing the way you store and manage data, whether it’s in your pocket, home, car, or the cloud. Our Advanced Reliability Engineering (ARE) team is committed to pioneering reliability assurance methodologies that set industry standards and encompass the entire product lifecycle for our...


  • San Diego, California, United States Dexcom Full time

    About Dexcom:Founded in 1999, Dexcom, Inc. (NASDAQ: DXCM) is a pioneer in the development and marketing of Continuous Glucose Monitoring (CGM) systems designed for use by individuals with diabetes and healthcare professionals. As a leader in the transformation of diabetes management, Dexcom is committed to providing innovative CGM technology that empowers...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our platform empowers developers to streamline their workflows, enhancing productivity and accelerating release cycles while adhering to...


  • San Francisco, United States Saxon Global Full time

    Lead DevOps/Site Reliability Enginee Looking for a resource more senior in the DevOps space, with a leaning toward site reliability engineering. Docker containers, Kubernetes automation Mostly focused on the automation, current pain points around deployments reliability around their data engineering processes. SRE who can go beyond the memory, what...


  • San Jose, United States TCWGlobal Full time

    Site Reliability Engineer (Kubernetes)*US citizenship or Greencard holder- W2 ContractSan Jose, CA 95134 ( LOCAL CANDIDATES ONLY- MUST BE LIVING IN SAN JOSE, CA)$80-110hr (Weekly pay + benefits)6 month contract (Excellent potential for extension)Full-time: M-F 8am-5pm (Onsite 2 days a week)***Please note: This role is only accepting candidates that currently...


  • San Jose, United States TCWGlobal Full time

    Job DescriptionJob DescriptionSite Reliability Engineer (Kubernetes)*US citizenship or Greencard holder- W2 ContractSan Jose, CA 95134 ( LOCAL CANDIDATES ONLY- MUST BE LIVING IN SAN JOSE, CA)$80-110hr (Weekly pay + benefits)6 month contract (Excellent potential for extension)Full-time: M-F 8am-5pm (Onsite 2 days a week)***Please note:This role is only...