Reliability Manager

3 weeks ago


Santa Clara, United States Nvidia Full time

NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" that constantly evolves by adapting to new opportunities which are hard to solve, which only we can pursue, and which matter to the world. This is our life's work: to amplify human inventiveness and intelligence. Make the choice to join us today. We are seeking an outstanding candidate for Product Reliability Manager who will lead and manage NVIDIA product reliability team to ensure world-class quality products and services that meet or exceed customer expectations.

What you'll be doing:

* Manage a team to drive qualifications activities to success for all new products, supplies' changes. Lead the team in planning DOE and performing analyses to improve the quality of products and effectiveness of the reliability stresses. Resolve reliability issues and if needed perform risk analyses related to products' reliability and quality problem.


* Provide periodically reports for NVIDIA products' qualification progress.


* Work with a cross function team of NPI, PE, TE, Foundry, Advanced Technology, Silicon validation, DFT … to gather needed information of packaging technology, CMOS technology nodes. Employ these data to plan for qualification matrix. if applicable, draft Design of Experiments to improve the quality and/or reliability of NVIDIA products.


* Work with Customer Quality teams to provide requested information to customers. For example, FIT and MTTF w.r.t customers' mission profiles.


* Participate in assessment of reliability test requirements required to release non-conforming materials.


* Have committed to proactively seeking continuous improvement of the Reliability team infrastructures.


* Provide the leadership in guiding the teams in planning, oversee and direct complex and multi-dimensional projects.


* Provide oversight to technical staff. As the lead of the team, you need to provide the strategy to the team how to complete complex projects successfully. Recommend project improvements.


* To improve the performance of the team, you need to present the mission of the whole team annually.


* Manage the resources or seeking additional resources if needed to increase the productive performance of the whole team. Monitor the budget, resources, and all project progress.



What we need to see:

* Have sound knowledge of CMOS technology, especially Fin-FET.


* Have hands-on experience in debugging circuit issues, related to reliability aspects.


* Have good knowledge of 2.5D/3D packaging technology. COWOS specifically.


* Must have hands-on experience with lab equipment such as TC/HAST/Reflow chambers, and ovens used in HTOL such as HBPs.


* Great teammate with good communication, process oriented and problem-solving skills.


* Familiar with reliability statistics, reliability models, and industrial standards (JEDEC, AEQC, Mil, IPC).


* PhD or MS in Electrical Engineering, Physics, or a related major (or equivalent experience).


* 8+ years of overall experience in semiconductor reliability and 3+ years of managing a team.


* Being Fluently in English Speaking and Writing is a must.



With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world's most desirable employers. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you. Come build the future with us

The base salary range is 164,000 USD - 304,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.


  • Reliability Manager

    4 days ago


    Santa Clara, United States NVIDIA Full time

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by...


  • Santa Clara, United States Palo Alto Networks Full time

    Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for...


  • Santa Clara, United States Palo Alto Networks Inc. Full time

    Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done,...


  • Santa Clara, United States Palo Alto Networks Full time

    Job DescriptionJob DescriptionCompany DescriptionOur MissionAt Palo Alto Networks® everything starts and ends with our mission:Being the cybersecurity partner of choice, protecting our digital way of life.Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting...


  • Santa Clara, United States Natron Energy Full time

    We are seeking a highly experienced and dynamic individual to join our team as the Senior Director of Quality and Reliability. In this role, you will play a pivotal leadership role in championing a culture of quality excellence and driving the design and testing processes to ensure product reliability. You will lead a team of talented professionals and...


  • Santa Clara, United States Natron Energy Full time

    We are seeking a highly experienced and dynamic individual to join our team as the Senior Director of Quality and Reliability. In this role, you will play a pivotal leadership role in championing a culture of quality excellence and driving the design and testing processes to ensure product reliability. You will lead a team of talented professionals and...


  • Santa Clara, United States Centrify Corporation Full time

    Our software runs on public clouds with 99.9% or better uptime and is mission critical for our customers. Our cloud operations team is where the rubber meets the road and needs innovative Site Reliability Engineers. Join a professional team of smart and hard-working professionals building enterprise-class cloud-based services in the rapidly growing market of...


  • Santa Clara, United States OSI Engineering, Inc. Full time

    Responsibilities:Coordinate reliability test plans and prepare REL test results under oversight of individual product REL DRIsPartner with in-region REL DRIs to monitor REL test status and drive tests to completionParticipate in meetings with and provide regular updates to Cupertino cross- functional teamsCoordinate technical deep-dives and failure analysis...


  • Santa Clara, United States Kofi Group Full time

    To Apply for this Job Click HerePrincipal Site Reliability EngineerSan Francisco Bay Area, CAWe are partnering with a late-stage Cloud Security company that is looking for a Principal Level SRE The ideal candidate will have:Strong sense of architecture and design for fault tolerance, scale-out approaches, and stability Deep experience in building tools...


  • Santa Clara, United States Pure Storage Full time

    Company Overview: BE PART OF BUILDING THE FUTURE. What do NASA and emerging space companies have in common with COVID vaccine R&D teams or with Roblox and the Metaverse? The answer is dataall fast moving, fast growing industries rely on data for a competitive edge in their industries. And the most advanced companies are realizing the full data advantage by...


  • Santa Clara, United States Cryptoware Technologies Inc Full time

    Job DescriptionJob Description Responsibility • Lead the effort of global expansion of Huobi globe spanning infrastructure. • Work with engineering teams to make sure new features and changes are deployed quickly and safely. • Constantly improve our system performance and reliability through better tools, process and monitoring system. • Staffing an...


  • Santa Clara, United States NVIDIA Full time

    Site Reliability Engineering (SRE) is an engineering discipline that involves designing, building, and maintaining large-scale production systems with high efficiency and availability. It encompasses various areas, including software and systems engineering practices, storage, data management, and services. SRE professionals are highly specialized and...


  • Santa Clara, United States Cryptoware Technologies Inc Full time

    Job DescriptionJob DescriptionResponsibility•       Lead the effort of global expansion of Huobi globe spanning infrastructure.•       Work with engineering teams to make sure new features and changes are deployed quickly and safely.•       Constantly improve our system performance and reliability through better tools, process and...


  • Santa Clara, United States Palo Alto Networks Full time

    Sr Principal Site Reliability Engineer (Advanced Threat Protection) Palo Alto Networks Implement Zero Trust, Secure your Network, Cloud workloads, Hybrid Workforce, Leverage Threat Intelligence & Security Consulting. Cybersecurity Services & Education for CISO’s, Head of Infrastructure, Network Security Engineers, Cloud... View company page At Palo Alto...


  • Santa Maria, United States Catalyst Recruiting, Inc Full time

    Maintenance Manager Company is a multi-faceted, complex speciality materials manufacturing and distribution company needing a maintenance manager for multi-plant operations in California. Lead the continuous improvement and management of the maintenance program, CMMS, etc KEY: Experienced in RCM and/or TPM in minerals, inorganic chemicals, materials...


  • Santa Clara, United States Palo Alto Networks Full time

    Company Description Due to FedRAMP High requirements this person must be a US Citizen or Green Card Holder. Our Mission At Palo Alto Networks® everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a...


  • Santa Maria, United States Catalyst Recruiting Inc. Full time

    Job DescriptionJob DescriptionMaintenance ManagerCompany is a multi-faceted, complex speciality materials manufacturing and distribution company needing a maintenance manager for multi-plant operations in California. Lead the continuous improvement and management of the maintenance program, CMMS, etcKEY:Experienced in RCM and/or TPM in minerals, inorganic...


  • Santa Clara, United States Palo Alto Networks Full time

    Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for...


  • Santa Clara, United States Software Technology Inc Full time

    Job Description Job Description Position : Service Reliability Engineer / Sr. Devops Engineer Location : Santa Clara, CA Duration : 1 Year + OK with any visa No OPT please Local consultants only Customer will not provide letter for H1B candidates. Please check with the candidate and employers before submitting the resume. Face to face is mandatory so please...


  • Santa Clara, United States Palo Alto Networks Full time

    Job DescriptionJob DescriptionCompany DescriptionOur MissionAt Palo Alto Networks® everything starts and ends with our mission:Being the cybersecurity partner of choice, protecting our digital way of life.Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting...