Resiliency and Safety Architect

4 weeks ago


Santa Clara, California, United States NVIDIA Full time

We are seeking a Resiliency and Safety Architect to support the development of GPU and Tegra SoC hardware and software resiliency features. As a key member of our team, you will collaborate with hardware and software teams to architect new resiliency and safety features and guide future development.

Key Responsibilities:

  • Collaborate with hardware and software teams to architect new resiliency and safety features and guide future development.
  • Optimize hardware and software features to improve system robustness, performance, and security.
  • Model and analyze RAS metrics like Failures in Time and Availability; and Safety metrics like Diagnostic Coverage and PMHF.
  • Run simulations to analyze Architectural Vulnerability Factor and Liveness of on-die memory.
  • Participate in testing new and existing resiliency and safety hardware and software features.
  • Develop diagnostics software components for Resiliency and Safety to run on NVIDIA GPUs.
  • Work on compliance of products with functional safety standards (ISO 26262 and ASPICE (Automotive SPICE)).

Requirements:

  • Master's or PhD degree in Computer Science, Computer Engineering, Electrical Engineering or closely related degree or equivalent experience.
  • Familiarity with computer system architecture, microprocessors, and microcontroller fundamentals (caches, buses, direct memory access, etc.).
  • Basic knowledge of some aspects of GPU/SoC architecture - Clocks, Resets, Interrupts, Memory Controller, Multimedia accelerator pipelines.
  • Proficiency in C/C++.
  • Scripting and automation with Python or similar.
  • Understanding of the software development life cycle, from requirements to testing closure and maintenance.
  • Strong debugging and analytical skills.

NVIDIA is a leader in the field of AI computing, and we are committed to fostering a diverse work environment. We are an equal opportunity employer and do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.



  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in the field of artificial intelligence and high-performance computing. We are seeking a talented individual to join our Accelerated and Resilient Compute Systems team as a Resiliency and Safety Architect.This role will involve collaborating with hardware and software teams to design and develop new resiliency and safety features for our...


  • Santa Monica, California, United States Resiliency Full time

    Resiliency Operating Room Leadership Manager Job DescriptionThe Nursing Management Operating Room Director is responsible for the overall management of the operating room, including the coordination of staff, scheduling of surgeries, and ensuring the safety and quality of patient care. Key Responsibilities:Lead the nursing team in the operating room,...


  • Santa Clara, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Cloud Solutions Architect to join our team at Amazon. As a Cloud Solutions Architect, you will be responsible for designing and implementing cloud-based solutions for our customers.Key Responsibilities:Collaborate with customer executives and architects to accelerate their business outcomes and recommend cloud...


  • Santa Clara, California, United States GyanSys Full time

    Job Title: Microsoft Cloud Solutions ArchitectAbout the Role:GyanSys is seeking a skilled Microsoft Cloud Solutions Architect to join our team. As a key member of our cloud solutions team, you will be responsible for designing and implementing scalable, secure, and resilient cloud architectures using Microsoft Azure and other cloud services.Key...

  • FPGA Architect

    4 weeks ago


    Santa Clara, California, United States Applied Materials Full time

    Role: FPGA Architect - (E5)Location: OnsiteAt Applied Materials, we're committed to pushing the boundaries of science and engineering to make possible the next generations of technology.We're seeking a highly skilled FPGA Architect to join our team. As a key member of our Advanced Packaging group, you will be responsible for designing and developing...


  • Santa Clara, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Cloud Computing Expert to join our team as a Senior Solutions Architect for EC2 and Networking. In this role, you will work closely with customers to design and implement scalable, flexible, and resilient technical architectures that address their complex challenges.As a Cloud Computing Expert, you will have the...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is seeking a highly skilled Principal Platform Software Architect to lead the development of next-generation data center server product platforms. As a key member of our team, you will be responsible for designing and implementing scalable, secure, and high-performance firmware solutions for our data center products.Key Responsibilities:Design and...


  • Santa Clara, California, United States AmazonWebServices Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Solutions Architect to join our team at Amazon Web Services (AWS). As a key member of our Solutions Architect team, you will be responsible for designing and building scalable, flexible, and resilient cloud architectures and solutions for our customers.Key Responsibilities:Design and develop cloud...

  • Senior DFX Architect

    4 weeks ago


    Santa Clara, California, United States Nvidia Full time

    Job DescriptionNVIDIA is a leader in developing cutting-edge processor and system architectures that accelerate machine learning, automotive, and high-performance computing platforms. Our work enables groundbreaking innovations, outstanding creativity, and discovery, and powers what were once science fiction inventions, from artificial intelligence to...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in groundbreaking developments in Artificial Intelligence, High Performance Computing, and Visualization. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.We are the GPU Communications Libraries and...


  • Santa Clara, California, United States Nvidia Full time

    We are seeking a highly skilled Senior Architect to lead our Data Systems team in developing cutting-edge, high-performance, and power-efficient System on Chip (SoC) processors for various markets, including 3D graphics, deep learning, HPC, and automotive.This role offers the opportunity to drive innovation and have a real impact in a dynamic,...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in the field of high-performance computing, and we are seeking a skilled Senior Software Engineer to join our team.The ideal candidate will have a strong background in software development, with experience in designing and implementing reliable distributed systems. They will also have a solid understanding of scalability challenges and...


  • Santa Monica, California, United States Resiliency Full time

    Job SummaryThe RN Manager - Operating Room is a key leadership position responsible for the overall management of the operating room. This includes coordinating staff, scheduling surgeries, and ensuring the safety and quality of patient care.Key ResponsibilitiesManage the day-to-day operations of the operating room, including staffing, scheduling, and...


  • Rancho Santa Fe, California, United States Leidos Full time

    Job Title:Unified Communications Domain ArchitectJob Summary:Leidos is seeking a highly skilled Unified Communications Domain Architect to join our team. As a key member of our Service Management Integration and Transport (SMIT) Contract, you will be responsible for developing and implementing unified communications solutions that meet the needs of the Navy...

  • Project Manager

    1 month ago


    Santa Clara, California, United States Jobot Full time

    Established General Contractor Seeks Experienced Project ManagerAt Jobot, we're expanding our project management team to support our growing commercial interiors business in the Silicon Valley. As a seasoned Project Manager, you'll play a critical role in delivering high-quality projects on time and within budget.Key Responsibilities:Maintain accurate...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a trailblazer in computer graphics, PC gaming, and accelerated computing. We're now tapping into the unlimited potential of AI to define the next era of computing.An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world.As a Sr. System SW Engineer, you'll be immersed in a diverse,...

  • Executive Chef

    4 weeks ago


    Santa Clara, California, United States Guckenheimer Full time

    About GuckenheimerGuckenheimer is a global organization committed to making the international community more resilient and just for all people. We encourage diversity and inclusion in their broadest terms, including ethnicity, race, age, gender, gender identity, disability, sexual orientation, religious beliefs, language, culture, and educational...


  • Santa Clara, California, United States XPENG Motors Full time

    We are seeking an experienced Senior C++ Software Engineer to join our Self Driving Architecture Team at XPENG Motors. As a key member of our team, you will have the opportunity to impact all teams across autonomy, authoring libraries, improving system performance, and mentoring junior developers.This is a great opportunity for experienced C++ developers to...


  • Santa Clara, California, United States Applied Materials, Inc. Full time

    Role: As a Field Service Engineer at Applied Materials, you will be responsible for installing, maintaining, and upgrading cutting-edge customer equipment. You will work in our state-of-the-art facility and at customer sites around the globe, where you are the face of Applied Materials and an integral part of a vibrant and diverse...


  • Santa Clara, California, United States Applied Materials Full time

    Job SummaryAs a Senior Manager, Security Operations, you will be responsible for managing the development, oversight, and execution of security operations programs and initiatives for Applied Materials at the site level. This role requires a unique combination of security/safety expertise coupled with deep program management and strategic initiatives...