Current jobs related to Senior Site Reliability Engineer - Irvine - NetApp


  • Irvine, California, United States Weedmaps Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Weedmaps. As a key member of our engineering team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-native applications.Key ResponsibilitiesCollaborate with cross-functional teams to design, implement, and...


  • Irvine, California, United States Weedmaps Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Weedmaps. As a key member of our engineering team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based services.Key ResponsibilitiesLeverage your expertise in cloud native technologies to design, implement,...


  • Irvine, California, United States TP-Link North America, Inc. Full time

    About the Role:We are seeking a highly skilled Senior Cloud Reliability Engineer to join our team at TP-Link North America, Inc. This individual will play a crucial role in ensuring the security, reliability, scalability, and operational excellence of our cloud platform.Key Responsibilities:Serve as a technical SME for implementing and operating...


  • Irvine, California, United States Fortive Corporation Full time

    Job Summary:The Senior Reliability Engineer will be responsible for ensuring the reliability of Terminal Sterilization and High Level Disinfection equipment designed by ASP. This position will oversee the execution of protocols, compilation of data, and preparation of final test reports. The successful candidate will also be responsible for performing root...


  • Irvine, California, United States TP-Link Systems Inc. Full time

    Job DescriptionTP-Link Systems Inc. is seeking a highly skilled Senior Cloud Reliability Engineer to join our team in Irvine, California. As a key member of our R&D Center, you will play a crucial role in ensuring the security, reliability, scalability, and operational excellence of our cloud platform.ResponsibilitiesImplement and operate Microservices on...

  • Senior Engineer

    3 weeks ago


    Irvine, California, United States The ERM International Group Limited Full time

    Senior Engineer - Contaminated Site ManagementWe are seeking a highly skilled Senior Engineer to join our Contaminated Site Management technical team in Irvine, California. As a Senior Engineer, you will provide project management and technical assistance on site investigation, remediation, and environmental construction projects for clients locally and...


  • Irvine, California, United States Apple Full time

    Role OverviewApple is seeking a highly skilled Live Media Encoding Site Reliability Engineer to join our team. As a key member of our media production team, you will be responsible for the technical operation and management of live HW/SW media encoding infrastructure present in event-site mobile data centers, flypacks, and operations centers. Your expertise...


  • Irvine, California, United States Apple Full time

    Job Title: Live Media Encoding Site Reliability EngineerAt Apple, we're looking for a highly skilled Live Media Encoding Site Reliability Engineer to join our team. As a key member of our media production team, you will be responsible for the technical operation and management of live HW/SW media encoding infrastructure present in event-site mobile data...


  • Irvine, California, United States Apple Full time

    Job Title: Live Media Encoding Site Reliability EngineerWe are seeking a highly skilled Live Media Encoding Site Reliability Engineer to join our team at Apple. As a key member of our media encoding infrastructure team, you will be responsible for the technical operation and management of live HW/SW media encoding infrastructure present in event-site mobile...


  • Irvine, California, United States Apple Full time

    Job SummaryWe are seeking a highly skilled Live Media Encoding Site Reliability Engineer to join our team at Apple. As a key member of our technical team, you will be responsible for the technical operation and management of live HW/SW media encoding infrastructure present in event-site mobile data centers, flypacks, and operations centers.Key...

  • Reliability Engineer

    4 weeks ago


    Irvine, California, United States Fortive Corporation Full time

    Job SummaryWe are seeking a highly skilled Reliability Engineer to join our team at Fortive Corporation. As a key member of our engineering team, you will be responsible for ensuring the reliability and performance of our capital equipment.Key ResponsibilitiesDesign and develop automated and semi-automated test systems for capital equipmentDevelop and...

  • Reliability Engineer

    4 weeks ago


    Irvine, California, United States Fortive Corporation Full time

    Job SummaryFortive Corporation is seeking a highly skilled Sr. Reliability Engineer to join our team. As a key member of our reliability engineering team, you will be responsible for ensuring the reliability and performance of our capital equipment.Key ResponsibilitiesDesign and develop automated and semi-automated design characterization tests and...

  • Reliability Engineer

    4 weeks ago


    Irvine, California, United States Fortive Corporation Full time

    Job SummaryThe Sr. Reliability Engineer will be responsible for ensuring the reliability of Terminal Sterilization and High Level Disinfection equipment designed by ASP. This position will execute protocols, compile data, and create final test reports. The Sr. Reliability Engineer will also perform root cause analysis on issues observed during protocols or...


  • Irvine, California, United States Rivian Full time

    About Rivian:Rivian is a pioneering electric vehicle manufacturer that's revolutionizing the automotive industry. Our mission is to create a world where adventure and sustainability go hand-in-hand. We're a company that's constantly pushing the boundaries of what's possible, and we're looking for talented individuals to join our team.Role Summary:Rivian's...


  • Irvine, California, United States SysMind Tech Full time

    Job Title: Reliability Test EngineerAt SysMind Tech, we are seeking a highly skilled Reliability Test Engineer to join our team. As a key member of our engineering team, you will be responsible for designing and executing test setups, analyzing results, and troubleshooting electromechanical systems.Responsibilities:Design and develop test fixtures and...


  • Irvine, California, United States First Tek Full time

    Job SummaryThe RMS Engineer provides technical expertise and leadership in an engineering discipline, product, or systems field. This role requires research and execution of complex engineering assignments that result in the introduction of new technologies, products, or processes.Key ResponsibilitiesConduct investigations to assess and optimize designs for...


  • Irvine, California, United States Masimo Full time

    Job DescriptionAt Masimo, we are seeking a highly skilled Product Assurance and Reliability Engineer II to join our team. This multifaceted role requires a strong understanding of basic science and engineering principles, as well as experience in product assurance and reliability engineering.Key ResponsibilitiesContribute to new medical microelectronics...


  • Irvine, United States DSJ Global Full time

    A global Consumer Products client of mine is hiring for a Senior Packaging Engineer based out of their site in Orange County, CA!The Senior Packaging Engineer will be responsible for:Creating packaging SOP'sGenerating engineering change orders & packaging related requirementsLeading packaging development & improvement projectsWorking cross functionally with...


  • Irvine, California, United States Masimo Full time

    Job DescriptionThe Product Assurance and Reliability Engineer position at Masimo is a multifaceted role that requires a good understanding of basic science as well as engineering.An individual in this role will be part of the Product Assurance department working to investigate, test, and provide expertise in the effort to improve the quality and reliability...


  • Irvine, California, United States Rivian Full time

    About RivianRivian is a pioneering company that's revolutionizing the automotive industry with its innovative electric adventure vehicles. Our mission is to keep the world adventurous forever, and we're committed to making a positive impact on the environment.Role SummaryWe're seeking a highly skilled Thermal Field Reliability Engineer to join our team. As a...

Senior Site Reliability Engineer

2 months ago


Irvine, United States NetApp Full time

Title: Senior Site Reliability Engineer

Location:

Bangalore, Karnataka, IN, 560071

Requisition ID: 126263

Job Summary

As a Cloud Infrastructure/Site Reliability Engineer, you will operate at the intersection of development and operations. Your role will involve engaging in and enhancing the lifecycle of cloud services - from design through deployment, operation, and refinement. You will maintain these services by measuring and monitoring their availability, latency, and overall system health. 
You will play a crucial role in sustainably scaling systems through automation and driving changes that improve reliability and velocity. As part of your responsibilities, you will administer cloud-based environments that support our SaaS/IaaS offerings, implemented on a microservices, container-based architecture (Kubernetes).
In addition, you will oversee a portfolio of customer-centric cloud services (SaaS/IaaS), ensuring their overall availability, performance, and security. You will work closely with both NetApp and cloud service provider teams, including those from Google, located across the globe in regions.
Due to the critical nature of the services we support, this position involves participation in a rotation-based on-call schedule as part of our global team. This role offers the opportunity to work in a dynamic, global environment, ensuring the smooth operation of vital cloud services. To be successful in this role, you should be a motivated self-starter and self-learner, possess strong problem-solving skills, and be someone who embraces challenges.

Job Requirements

Incident Response and Troubleshooting: Address and perform root cause analysis (RCA) of complex live production incidents and cross-platform issues involving OS, Networking, and Database in cloud-based SaaS/IaaS environments. Implement SRE best practices for effective resolution. Analysis, and Infrastructure Maintenance: Continuously monitor, analyze, and measure system health, availability, and latency using tools like Prometheus, Stackdriver, ElasticSearch, Grafana, and SolarWinds. Develop strategies to enhance system and application performance, availability, and reliability. In addition, maintain and monitor the deployment and orchestration of servers, docker containers, databases, and general backend infrastructure. Document system knowledge as you acquire it, create runbooks, and ensure critical system information is readily accessible. Security Management: Stay updated with security protocols and proactively identify, diagnose, and resolve complex security issues. Automation and Efficiency: Identify tasks and areas where automation can be applied to achieve time efficiencies and risk reduction. Develop software for deployment automation, packaging, and monitoring visibility. Issue Tracking and Resolution: Use Atlassian Jira, Google Buganizer, and Google IRM to track and resolve issues based on their priority. Team Collaboration and Influence: Work in tandem with other Cloud Infrastructure Engineers and developers to ensure maximum performance, reliability, and automation of our deployments and infrastructure. Additionally, consult and influence developers on new feature development and software architecture to ensure scalability. Debugging, Troubleshooting, and Advanced Support: Undertake debugging and troubleshooting of service bottlenecks throughout the entire software stack. Additionally, provide advanced tier 2 and 3 support for NetApp's Cloud Data Services solutions. Directly influence the decisions and outcomes related to solution implementation: measure and monitor availability, latency, and overall system health. Proficiency in Linux/Unix and CORE OS. Demonstrated experience in scripting and infrastructure automation using tools such as Ansible, Python, Go or Ruby. Deep working knowledge of Containers, Kubernetes, and Serverless computing implementation.

Education

A minimum of 8 - 12 years of experience is required.  A Bachelor of Science Degree in Computer Science, a master’s degree; or equivalent experience is required. 


Job Segment: Cloud, Software Engineer, Linux, Unix, Computer Science, Technology, Engineering