Current jobs related to Senior System Reliability Engineer - Santa Clara - NVIDIA

Senior Reliability Engineer

1 week ago

Santa Clara, California, United States Nvidia Full time

Senior Reliability EngineerNVIDIA is seeking a highly skilled Senior Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for planning and implementing the qualifications of new NVIDIA products, including IC chips in AI, Mobile, Automotive, Deep Learning, Graphic Processor, and System on Chip sectors.Key...
Senior System Reliability Engineer

2 weeks ago

Santa Clara, California, United States NVIDIA Full time

Reliability EngineerNVIDIA is a leader in the field of artificial intelligence and high-performance computing. We are seeking a highly skilled Reliability Engineer to join our team.The successful candidate will be responsible for providing expertise in hardware reliability engineering for electronics and server systems. This will involve establishing and...
Senior Reliability Engineer

2 weeks ago

Santa Clara, California, United States NVIDIA Full time

Job Title: Senior Reliability EngineerNVIDIA is a leader in the field of computer graphics, PC gaming, and accelerated computing. We are seeking a highly skilled Senior Reliability Engineer to join our team.Job Summary:We are looking for a talented individual with expertise in HTOL stress testing, JEDEC standards, and thermal management techniques. The...
Senior Cloud Reliability Engineer

2 weeks ago

Santa Clara, California, United States NVIDIA Full time

Job Title: Senior Cloud Reliability EngineerWe are seeking a highly motivated Senior Cloud Reliability Engineer to join our Embedded organization.This team is responsible for automating, deploying, and maintaining infrastructure for various NVIDIA AI workflows and applications such as Metropolis, ACE, and Riva hosted in the cloud.The Senior Cloud Reliability...
Senior Cloud Reliability Engineer

3 weeks ago

Santa Clara, California, United States NVIDIA Full time

Job Title: Senior Site Reliability EngineerNVIDIA is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing, implementing, and supporting operational and reliability aspects of large scale Kubernetes clusters.Key Responsibilities:Design and implement...
Senior Product Reliability Engineer

4 weeks ago

Santa Clara, California, United States Anello Photonics Full time

About Anello PhotonicsAnello Photonics is a pioneering technology company based in Santa Clara, California. We have developed cutting-edge integrated photonic system-on-chip technology for next-generation navigation. Our SIPHOG gyroscope is based on patented photonic integrated circuit technology, offering higher performance, smaller size and weight, and...
Senior Product Reliability Engineer

4 weeks ago

Santa Clara, California, United States Anello Photonics Full time

About Anello PhotonicsAnello Photonics is a leading-edge technology company based in Santa Clara, CA. We have developed integrated photonic system-on-chip technology for next-generation navigation. Our SIPHOGTM gyroscope is based on our patented photonic integrated circuit technology.This innovative technology enables a product that is higher performance,...
Senior Product Reliability Engineer

3 days ago

Santa Clara, California, United States Anello Photonics Full time

About Anello Photonics:Anello Photonics is a leading-edge technology company based in Santa Clara, CA. The company has developed integrated photonic system-on-chip technology for next-generation navigation. ANELLO's SIPHOGTM gyroscope is based on its patented photonic integrated circuit technology. The result is a product that is higher performance, much...
Senior Cloud Reliability Engineer

2 weeks ago

Santa Clara, California, United States NVIDIA Full time

About the RoleNVIDIA is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing, implementing, and supporting operational and reliability aspects of our large-scale Observability & Telemetry collection platform.You will engage in the entire lifecycle of...
Senior Site Reliability Engineer

2 weeks ago

Santa Clara, California, United States NVIDIA Full time

About NVIDIANVIDIA is a leader in the field of artificial intelligence, machine learning, and datacenter acceleration. Our company has a rich history of innovation, with a legacy that dates back to the invention of the GPU in 1999. This groundbreaking technology sparked the growth of the PC gaming market, redefined modern computer graphics, and...
Senior Cloud Reliability Engineer

7 days ago

Santa Clara, California, United States NVIDIA Full time

At NVIDIA, we're seeking a highly skilled Senior Cloud Reliability Engineer to join our team. As a key member of our Site Reliability Engineering (SRE) team, you'll be responsible for designing, building, and maintaining large-scale production systems with high efficiency and availability.This is a highly specialized discipline that demands knowledge across...
Senior Staff Site Reliability Engineer

1 week ago

Santa Clara, California, United States Palo Alto Networks Full time

Job OverviewPalo Alto Networks is seeking a highly skilled Cloud Infrastructure Engineer to join our CDL/SLS team. As a Senior Staff Site Reliability Engineer, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Our team is at the forefront of innovation, constantly pushing the boundaries of what is...
Senior Cloud Reliability Engineer

2 weeks ago

Santa Clara, California, United States NVIDIA Full time

Job DescriptionNVIDIA is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for designing, implementing, and supporting operational and reliability aspects of large scale Kubernetes clusters.Key Responsibilities:Design and implement operational and reliability aspects of large...
Senior Reliability Engineer

1 week ago

Santa Clara, California, United States NVIDIA Full time

Reliability Engineer Job DescriptionNVIDIA is a leader in the field of computer graphics, PC gaming, and accelerated computing. We are seeking a highly skilled Reliability Engineer to join our team.Key Responsibilities:Develop, debug, and manage test programs for the HTOL oven.Review and design HTOL board schematics for various ovens.Diagnose signal...
Senior Systems Engineer

1 week ago

Santa Clara, California, United States Apollo Professional Solutions Full time

Job SummaryApollo Professional Solutions is seeking a highly skilled Senior Systems Engineer to join our team. As a key member of our engineering team, you will be responsible for improving the reliability and robustness of our next-generation sequencing and sample prep platforms.Key Responsibilities:Conduct failure analysis and root cause investigations to...
Director of Reliability Engineering

2 weeks ago

Santa Clara, California, United States Ushur Full time

About UshurUshur is a leading provider of Customer Experience Automation solutions, empowering enterprises to deliver delightful customer and employee experiences. Our cutting-edge technologies, including Conversational AI, Machine Learning, and Intelligent Process Automation, enable Fortune 100 companies to automate their customer engagement.The RoleWe are...
Senior Staff Site Reliability Engineer

4 weeks ago

Santa Clara, California, United States Palo Alto Networks Full time

About the RolePalo Alto Networks is seeking a highly skilled Senior Staff Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, building, and operating reliable, secure cloud infrastructure.Key ResponsibilitiesDevelop expertise in new technologies and contribute to the success of SRE and...
Senior Staff Site Reliability Engineer

2 weeks ago

Santa Clara, California, United States Palo Alto Networks Full time

About the RolePalo Alto Networks is seeking a highly skilled Senior Staff Site Reliability Engineer to join our CDL/SLS team. As a key member of our team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Key ResponsibilitiesContribute to the success of SRE and DevOps teamsDevelop expertise in new...
Senior Staff Site Reliability Engineer

1 week ago

Santa Clara, California, United States Palo Alto Networks Full time

About UsPalo Alto Networks is a leader in the cybersecurity industry, dedicated to protecting the digital way of life. Our mission is to be the cybersecurity partner of choice, and we're looking for innovators who share our passion for shaping the future of cybersecurity.We're a company built on disruption, and we're looking for individuals who are...
Senior Staff Site Reliability Engineer

1 week ago

Santa Clara, California, United States Palo Alto Networks Full time

Job DescriptionPalo Alto Networks is seeking a highly skilled Senior Staff Site Reliability Engineer to join our CDL/SLS team. As a key member of our infrastructure team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Key Responsibilities:Develop expertise in new technologies and contribute to the...

Senior System Reliability Engineer

2 months ago

Santa Clara, United States NVIDIA Full time

Senior System Reliability Engineer Locations: US, CA, Santa Clara Time Type: Full time Posted on: Posted 2 Days Ago Job Requisition ID: JR1980220 NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing — with the GPU acting as the brains of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company and build our teams with the most thoughtful people in the world. Join us at the forefront of technological advancement. GPU Servers are one of the fastest-growing segments for NVIDIA and the Artificial Intelligence industry. As the computational power increases with every GPU generation, developing efficient and reliable systems is an imperative. We are looking for a System Reliability Engineer to join NVIDIA's existing Reliability Engineering team, involved in NVIDIA's diverse system product range specifically Graphics and High-Performance Computing printed circuit boards and Data Center Servers. What you'll be doing: Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems (graphics cards, server, rack, cluster) from Concept to End-of-Life phase. Establish, deliver and maintain product reliability standards and metrics for NVIDIA's new system technologies, using existing tools and processes or developing new as required. Participate in product and engineering design reviews, assess the reliability budget of products/designs, and inspire changes that enhance product reliability. Interface and interact with all pertinent engineering groups, suppliers, and partners ensuring the desired reliability is achieved using Design for Reliability (DfR) methods including FMEA and DoE approaches. Define and implement Reliability Plans & Specifications. Provide reliability predictions, along with test plans and methods to access and drive product reliability to the desired levels. Perform and lead appropriate testing with associated failure analysis and recommendations for improving designs and manufacturing. Develop and present methods of correlating reliability test results with actual field performance. What we need to see: BS (or equivalent experience) in Engineering, Material Science, Physics, or a related field, MS or PhD preferred. 5+ years in a hardware validation/reliability environment related to PCIE peripherals, graphics cards, and servers. Understand power supply, memory, high speed I/O, PCI express, Ethernet, and I2C. Hands-on experience in theoretical and practical Reliability concepts as it relates to high-tech electronic enterprise and consumer products. Have a strong command and understanding of statistical concepts/models/analysis and how they relate to product reliability & life analysis. Good verbal and writing skills as well as the ability to communicate at a high level. Self-motivating, independent, and committed to getting things done. Good project management skills and ability to balance multiple simultaneous projects during development and production stages. With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you. Come build the future with us The base salary range is 96,000 USD - 166,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. About Us NVIDIA is a Learning Machine. NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and the metaverse is transforming the world's largest industries and profoundly impacting society. #J-18808-Ljbffr

Americas

Europe

Asia / Oceania

Africa

Current jobs related to Senior System Reliability Engineer - Santa Clara - NVIDIA

Senior System Reliability Engineer