Senior Site Reliability Engineer

3 weeks ago


Southfield, United States NetApp Full time

Title: Senior Site Reliability Engineer

Location:

Bangalore, Karnataka, IN, 560071

Requisition ID: 126263

About NetApp

We’re forward-thinking technology people with heart. We make our own rules, drive our own opportunities, and try to approach every challenge with fresh eyes. Of course, we can’t do it alone. We know when to ask for help, collaborate with others, and partner with smart people. We embrace diversity and openness because it’s in our DNA. We push limits and reward great ideas. What is your great idea?

"At NetApp, we fully embrace and advance a diverse, inclusive global workforce with a culture of belonging that leverages the backgrounds and perspectives of all employees, customers, partners, and communities to foster a higher performing organization." -George Kurian, CEO

Job Summary

As a Cloud Infrastructure/Site Reliability Engineer, you will be operating at the intersection of development and operations. Your role will involve engaging in and enhancing the lifecycle of cloud services - from design through deployment, operation, and refinement. You will be responsible for maintaining these services by measuring and monitoring their availability, latency, and overall system health. 
You will play a crucial role in sustainably scaling systems through automation and driving changes that improve reliability and velocity. As part of your responsibilities, you will administer cloud-based environments that support our SaaS/IaaS offerings, which are implemented on a microservices, container-based architecture (Kubernetes).
In addition, you will oversee a portfolio of customer-centric cloud services (SaaS/IaaS), ensuring their overall availability, performance, and security. You will work closely with both NetApp and cloud service provider teams, including those from Google, located across the globe in regions such as RTP, Reykjavík, Bangalore, Sunnyvale, Redmond, and more.
Due to the critical nature of the services we support, this position involves participation in a rotation-based on-call schedule as part of our global team. This role offers the opportunity to work in a dynamic, global environment, ensuring the smooth operation of vital cloud services. To be successful in this role, you should be a motivated self-starter and self-learner, possess strong problem-solving skills, and be someone who embraces challenges.

Job Requirements

• Incident Response and Troubleshooting: Address and perform root cause analysis (RCA) of complex live production incidents and cross-platform issues involving OS, Networking, and Database in cloud-based SaaS/IaaS environments. Implement SRE best practices for effective resolution.
• Analysis, and Infrastructure Maintenance: Continuously monitor, analyze, and measure system health, availability, and latency using tools like Prometheus, Stackdriver, ElasticSearch, Grafana, and SolarWinds. Develop strategies to enhance system and application performance, availability, and reliability. In addition, maintain and monitor the deployment and orchestration of servers, docker containers, databases, and general backend infrastructure.
• Document system knowledge as you acquire it, create runbooks, and ensure critical system information is readily accessible.
• Security Management: Stay updated with security protocols and proactively identify, diagnose, and resolve complex security issues.
• Automation and Efficiency: Identify tasks and areas where automation can be applied to achieve time efficiencies and risk reduction. Develop software for deployment automation, packaging, and monitoring visibility.
• Issue Tracking and Resolution: Use Atlassian Jira, Google Buganizer, and Google IRM to track and resolve issues based on their priority.
• Team Collaboration and Influence: Work in tandem with other Cloud Infrastructure Engineers and developers to ensure maximum performance, reliability, and automation of our deployments and infrastructure. Additionally, consult and influence developers on new feature development and software architecture to ensure scalability.
• Debugging, Troubleshooting, and Advanced Support: Undertake debugging and troubleshooting of service bottlenecks throughout the entire software stack. Additionally, provide advanced tier 2 and 3 support for NetApp's Cloud Data Services solutions.
• Directly influence the decisions and outcomes related to solution implementation: measure and monitor availability, latency, and overall system health.
• Proficiency in Linux/Unix and CORE OS.
• Demonstrated experience in scripting and infrastructure automation using tools such as Ansible, Python, Go or Ruby.
• Deep working knowledge of Containers, Kubernetes, and Serverless computing implementation.
• DevOps development methodologies.
• Familiarity with distributed systems design patterns using tools such as Kubernetes.
• Experience with cloud platforms such as AWS, Azure, or Google Cloud.

Education

A minimum of 8 - 12 years of experience is required. 

A Bachelor of Science Degree in Computer Science, a master’s degree; or equivalent experience is required. 

Did you know…
Statistics show women apply to jobs only when they’re 100% qualified. But no one is 100% qualified. We encourage you to shift the trend and apply anyway We look forward to hearing from you.

Why NetApp?

In a world full of generalists, NetApp is a specialist. No one knows how to elevate the world’s biggest clouds like NetApp. We are data-driven and empowered to innovate. Trust, integrity, and teamwork all combine to make a difference for our customers, partners, and communities. 

We expect a healthy work-life balance. Our volunteer time off program is best in class, offering employees 40 hours of paid time off per year to volunteer with their favorite organizations. We provide comprehensive medical, dental, wellness, and vision plans for you and your family. We offer educational assistance, legal services, and access to discounts. We also offer financial savings programs to help you plan for your future.

If you run toward knowledge and problem-solving, join us. 


Job Segment: Cloud, Linux, Unix, Software Engineer, Computer Science, Technology, Engineering


  • Site Manager

    2 weeks ago


    Southfield, United States G-TECH Services Full time

    Oversee and participate in site activity with the goal of successfully installing and commissioning equipment. Essential Functions and Responsibilities: · Process customer bulletins and field orders · Provide value analysis documentation support to project managers · Litigation scope, cost and timing analysis · PCR / CCR documentation and cost...


  • Southfield, United States Lear Corporation Full time

    Lear Corporation Senior Quality Engineer Southfield , Michigan Apply Now We work hard for the people who work for us. We champion our teams. We foster collaboration, inclusion, respect and excellence. What we are trying to say is we want to be more for you. We are your path to a better career, a better future, and a better you. Our teams have invented...

  • Senior Engineer

    4 weeks ago


    Southfield, United States Lear Corporation Full time

    Lear Corporation Senior Engineer Southfield , Michigan Apply Now We work hard for the people who work for us. We champion our teams. We foster collaboration, inclusion, respect and excellence. What we are trying to say is we want to be more for you. We are your path to a better career, a better future, and a better you. Our teams have invented groundbreaking...

  • Senior Engineer

    3 days ago


    Southfield, United States Lear Corporation Full time

    Lear For You We work hard for the people who work for us. We champion our teams. We foster collaboration, inclusion, respect and excellence. What we are trying to say is we want to be more for you. We are your path to a better career, a better future, and a better you. Our teams have invented groundbreaking technologies, flawlessly manufactured millions...

  • Site Manager

    2 months ago


    Southfield, United States Comau Full time

    Comau is a worldwide leader in the industrial automation field. We offer complete engineering solutions, from product development to the realization of industrial process and automation systems. Through dynamic research and development, we constantly expand and improve our product portfolio. Our competencies in Body Assembly, Powertrain Machining & Assembly,...


  • Southfield, United States EXP Full time

    Job DescriptionAt EXP, we're driven to provide innovative solutions for the world's built and natural environments. As a team of engineers, architects, designers, scientists, creators and a community of professionals, we bring diverse and talented people together to solve the world's most complex challenges. Here, you join a team that leverages differences,...

  • Site Manager

    2 months ago


    Southfield, United States Comau LLC Full time

    Comau is a worldwide leader in the industrial automation field. We offer complete engineering solutions, from product development to the realization of industrial process and automation systems. Through dynamic research and development, we constantly expand and improve our product portfolio. Our competencies in Body Assembly, Powertrain Machining & Assembly,...


  • Southfield, United States Lear Corporation Full time

    Understand Device Transmittals, gather and maintain loads, special requirements such as twisted pairs, shielding, resistance, voltage drop, etc Understand Subsystems logical connection (ex, Radio, Amplifier, Speakers etc.) Power Distribution Compon Test Engineer, Systems, Engineer, Senior, Manufacturing, Technology


  • Southfield, United States EXP Full time

    Job Description At EXP, we're driven to provide innovative solutions for the world's built and natural environments. As a team of engineers, architects, designers, scientists, creators and a community of professionals, we bring diverse and talented people together to solve the world's most complex challenges. Here, you join a team that leverages differences,...


  • Southfield, United States Lear Corporation Full time

    Lear Corporation Senior Advanced Manufacturing Engineer Southfield , Michigan Apply Now We work hard for the people who work for us. We champion our teams. We foster collaboration, inclusion, respect and excellence. What we are trying to say is we want to be more for you. We are your path to a better career, a better future, and a better you. Our teams have...

  • Site Manager

    2 weeks ago


    Southfield, United States Drr Systems Inc Full time

    The Drr Group is one of the world's leading mechanical and plant engineering firms. Products, systems and services offered by Drr enable highly efficient manufacturing processes in different industries. Business with automobile manufacturers and their suppliers accounts for approximately 60% of Drr's sales. Other market segments include, for example, the...


  • Southfield, United States eTeam Full time

    Duties:The Senior Firewall Engineer is responsible for managing, designing and improving RJ’s enterprise network firewall infrastructure. He or she will assist network architects with design and implementation of firewall network technologies. This role is responsible for senior level firewall engineering implementation and providing technical principles...


  • Southfield, United States eTeam Full time

    Duties:The Senior Firewall Engineer is responsible for managing, designing and improving RJ's enterprise network firewall infrastructure. He or she will assist network architects with design and implementation of firewall network technologies. This role is responsible for senior level firewall engineering implementation and providing technical principles...


  • Southfield, United States eTeam Full time

    Duties:The Senior Firewall Engineer is responsible for managing, designing and improving RJ’s enterprise network firewall infrastructure. He or she will assist network architects with design and implementation of firewall network technologies. This role is responsible for senior level firewall engineering implementation and providing technical principles...


  • Southfield, United States eTeam Full time

    Duties:The Senior Firewall Engineer is responsible for managing, designing and improving RJ’s enterprise network firewall infrastructure. He or she will assist network architects with design and implementation of firewall network technologies. This role is responsible for senior level firewall engineering implementation and providing technical principles...


  • Southfield, United States eTeam Full time

    Job Summary The Senior Network Engineer is responsible for managing, designing and improving RJ’s enterprise network. He or she will assist network architects with design and implementation of network technologies. This role is responsible for senior level network engineering implementation and providing technical principles guidance to peer engineers,...


  • Southfield, United States eTeam Full time

    Job Summary The Senior Network Engineer is responsible for managing, designing and improving RJ’s enterprise network. He or she will assist network architects with design and implementation of network technologies. This role is responsible for senior level network engineering implementation and providing technical principles guidance to peer engineers,...


  • Southfield, United States First Recruitment Group Full time

    Our Client has a requirement for a Senior Environmental Engineer, who will be required to work on a contract basis in West London. Role Purpose: Excellent career growth opportunity for a technically strong environmental engineer seeking a new challenge in a fast-paced environment, executing the environmental design of a range of facilities including...


  • Southfield, United States First Recruitment Group Full time

    Our Client has a requirement for a Senior Environmental Engineer, who will be required to work on a contract basis in West London. Role Purpose: Excellent career growth opportunity for a technically strong environmental engineer seeking a new challenge in a fast-paced environment, executing the environmental design of a range of facilities including...


  • Southfield, United States Lear Corporation Full time

    Drive Your Career Lear Corporation is the leading Tier 1 automotive supplier serving all of the world's major automotive manufacturers with our world-class automotive seating and automotive electrical products. Our products are developed and produced by a diverse, talented team of more than 165,000 people. With operations in 39 countries, Lear operates in...