Resiliency and Safety Architect

2 weeks ago


Santa Clara, United States NVIDIA Full time

Resiliency and Safety Architect - New College Grad 2025 Apply locations US, CA, Santa Clara time type Full time posted on Posted 2 Days Ago job requisition id JR1985520 NVIDIA is a learning machine that constantly evolves by seeking exciting opportunities that matter to the world, and that only we can solve. We attract the world’s best people, so we can achieve our highest aim: building a company that lets us do our life’s work, at the highest level of our craft. We are now seeking a Resiliency and Safety Architect to support the development of GPU (graphical processing units) and Tegra SoC hardware and software resiliency features. In this role, you will be a key member of a team of innovators, challenging the status quo and pushing beyond boundaries. You will have the opportunity to impact the industry's leading GPUs and SOCs powering product lines ranging from consumer graphics to self-driving cars and the growing field of artificial intelligence. What you'll be doing: Collaborate with the hardware and software teams to architect new resiliency and safety features and guide future development. Optimize hardware and software features to improve system robustness, performance, and security. Model and analyze RAS metrics like Failures in Time and Availability; and Safety metrics like Diagnostic Coverage and PMHF. Run simulations to analyze Architectural Vulnerability Factor and Liveness of on-die memory. Participate in testing new and existing resiliency and safety hardware and software features. Develop diagnostics software components for Resiliency and Safety to run on NVIDIA GPUs. Work on compliance of products with functional safety standards (ISO 26262 and ASPICE (Automotive SPICE)). This includes defining requirements, architecture, and design with end-to-end traceability, performing safety analyses - FMEA/DFA/FTA and ensuring compliance of software to MISRA and Cert-C standards. What we need to see: Master’s or PhD degree in Computer Science, Computer Engineering, Electrical Engineering or closely related degree or equivalent experience. Familiarity with computer system architecture, microprocessors, and microcontroller fundamentals (caches, buses, direct memory access, etc.). Basic knowledge of some aspects of GPU/SoC architecture - Clocks, Resets, Interrupts, Memory Controller, Multimedia accelerator pipelines. Proficiency in C/C++. Scripting and automation with Python or similar. Understanding of the software development life cycle, from requirements to testing closure and maintenance. Strong debugging and analytical skills. Be self-driven and results oriented. Ways to stand out from the crowd: Familiarity with general HW concepts, Verilog RTL coding and simulations/debug. Familiarity with GPU Architectures, Machine Learning/Deep Learning concepts. CUDA Programming. Experience in embedded software development. Experience with resiliency and functional safety. NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”. Do you love the challenge of crafting the highest-performance silicon possible? If so, we want to hear from you Come, join our Accelerated and Resilient Compute Systems team and help build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field. The base salary range is 120,000 USD - 230,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr



  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a dynamic organization that continuously adapts by pursuing impactful opportunities that only we can address. We attract top talent to achieve our ultimate goal: to create a workplace that allows us to excel in our craft. We are currently looking for a Safety and Resiliency Architect to contribute to the development of GPU (Graphics Processing...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a dynamic organization that continually seeks meaningful opportunities to address global challenges that only we can tackle. We attract top talent to achieve our mission: to create an environment where we can excel in our respective fields. We are currently looking for a Resiliency and Safety Architect to contribute to the advancement of GPU...


  • Santa Clara, California, United States NVIDIA Full time

    About the RoleNVIDIA is a leading innovator in the field of artificial intelligence, computer graphics, and high-performance computing. We are seeking a highly skilled Resiliency and Safety Expert to join our team and contribute to the development of cutting-edge GPU and Tegra SoC hardware and software resiliency features.Key ResponsibilitiesCollaborate with...


  • Santa Clara, California, United States NVIDIA Full time

    About the RoleNVIDIA is seeking a highly skilled Senior Software Architect to lead the development of AI software resilience for our most powerful AI supercomputers.Key ResponsibilitiesDevelop and implement critical resilience features to support frontier model training at scale, ensuring robust and reliable AI systems.Serve as a trusted authority on AI...


  • Santa Clara, California, United States Resiliency Full time

    Job Description:We're seeking a highly skilled Executive Director of Operations to join our team at Resiliency. As a key stakeholder, you will play a crucial role in ensuring customer satisfaction, leading company guidelines and processes, and driving company profitability.The successful candidate will:Work closely with the Distribution Manager, Warehouse,...

  • Sales Engineer

    3 months ago


    Santa Clara, United States Empower Associates Full time

    Job DescriptionJob Description Position: Solutions Architect/Sales Engineer 6+ contractHybrid Remote 1-3 days onsite office in Santa Clara, CA This role will be supporting Amazon Web Services’ key client, Intel.  Required experience: AWS and Semiconductor experience required (Familiarity with Intel preferred) Familiarity with Intel products and...

  • Sales Engineer

    3 weeks ago


    Santa Clara, United States Empower Associates Full time $90 - $120

    Job DescriptionJob Description Position: Solutions Architect/Sales Engineer  Semi-conductor Manufacturing 6+  month contractHybrid Remote 1-3 days onsite Santa Clara, CA office This role will be supporting Amazon Web Services’ key client, Intel.  Required experience:MUST have previous experience in Semiconductor Manufacturing with specific experience...


  • Santa Clara, California, United States Intervision Systems Technologies Inc Full time

    Are you looking for a challenging role as an Application Development Architect?As a leading managed service provider (MSP), InterVision assists IT leaders in solving the most crucial challenges they face by solving for the right technology, deployed on the right premises, and managed through the right model to fit their unique demands and meet their...


  • Santa Clara, United States InterVision Systems Full time

    Job DescriptionJob DescriptionAre you looking for a challenging role as an Application Development Architect?As a leading managed service provider (MSP), InterVision assists IT leaders in solving the most crucial challenges they face by solving for the right technology, deployed on the right premises, and managed through the right model to fit their unique...


  • Santa Clara, United States InterVision Systems Full time

    Job DescriptionJob DescriptionAre you looking for a challenging role as an Application Development Architect?As a leading managed service provider (MSP), InterVision assists IT leaders in solving the most crucial challenges they face by solving for the right technology, deployed on the right premises, and managed through the right model to fit their unique...


  • Santa Clara, California, United States NVIDIA Full time

    Job SummaryNVIDIA is seeking a highly skilled Senior Cloud Engineer to join its Infrastructure, Planning and Processes organization. As a Senior Cloud Engineer, you will be part of a fast-paced team that develops and maintains NVIDIA's internal cloud provisioning product for GPUs and Tegra systems.Key ResponsibilitiesDesign and implement scalable, resilient...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Lead Software Architect with a strong background in developing highly scalable and resilient enterprise applications to join our innovative team. Our focus is on enhancing a sophisticated platform designed to automate the diagnosis and repair of GPU and CPU clusters across various environments, including public clouds, private clouds, and...


  • Santa Clara, California, United States Resiliency LLC Full time

    Job Description:We're seeking a highly skilled Executive Director of Operations to join our team at Resiliency LLC. As a key stakeholder, you will be responsible for ensuring customer satisfaction, leading company guidelines and processes, and driving company profitability.The successful candidate will:Collaborate closely with the Distribution Manager,...


  • Santa Clara, United States MindSource Full time

    Mindsource is seeking a Principal Endoscope Architect for one of our Direct Clients based in Silicon Valley . If interested, please drop your resume to akhil@mindsource.com Title: Principal Endoscope ArchitectRemote Fulltime Need people that can build complex catheter systems.A Day In The Life Of Our Principal Endoscope Architect:Leads the technical...


  • Santa Clara, United States MindSource Full time

    Mindsource is seeking a Principal Endoscope Architect for one of our Direct Clients based in Silicon Valley . If interested, please drop your resume to akhil@mindsource.com Title: Principal Endoscope ArchitectRemote Fulltime Need people that can build complex catheter systems.A Day In The Life Of Our Principal Endoscope Architect:Leads the technical...


  • Santa Clara, California, United States Amazon Full time

    Are you driven by the challenge of designing innovative cloud solutions for cutting-edge "Internet of Things" (IoT) clients? The Amazon Web Services (AWS) Solutions Architect team collaborates with customers to create and implement some of the most scalable, adaptable, and robust cloud architectures. AWS Solutions Architects work closely with AWS Sales and...


  • Santa Clara, California, United States NVIDIA Full time

    About the RoleNVIDIA is a leader in the field of artificial intelligence and high-performance computing. We are seeking a highly skilled Principal Platform Software Architect to join our team.Key ResponsibilitiesDesign and develop next-generation data center server product platform architecture, bringing up and driving solutions to production.Work closely...


  • Santa Clara, California, United States Jobot Full time

    Fire Safety Project Coordinator - Greater San Jose AreaAbout Us:At Jobot, we are dedicated to delivering top-notch electrical services while adhering to the most stringent project timelines demanded by our clients. Our commitment to excellence ensures that we maintain a high volume of repeat business, as we strive to keep our customers for life.Why Join Our...


  • Santa Monica, California, United States Resiliency Full time

    Position Overview:As a vital member of our healthcare team at Resiliency, you will be tasked with conducting patient imaging procedures under the guidance of a nuclear medicine physician. Your role will encompass calculating appropriate radioisotope dosages for patient administration, managing advanced radioscopic machinery, and adhering to stringent...


  • Santa Clara, California, United States Jobot Full time

    Fire Safety Project CoordinatorThis Jobot Job is hosted by: David RochaAbout Us:At Jobot, we are dedicated to delivering exceptional electrical services while adhering to the most stringent project timelines. Our commitment to quality and customer satisfaction has established us as a leader in the industry, fostering long-term relationships with our...