Senior Infrastructure Engineer

3 weeks ago


Santa Clara, California, United States Sustainable Talent Full time
Job Overview

Sustainable Talent is seeking a highly skilled Senior Infrastructure Engineer to support the NVIDIA Cloud Infrastructure Team. As a key member of our team, you will be responsible for supporting infrastructure team operations, cloud infrastructure system enrollments, deployments, and troubleshooting.

Key Responsibilities:

  • Support Infrastructure Team Operations queue working within our Infrastructure and Cloud Operations Environments.
  • Support DC and Cloud based Infrastructure system enrollments, deployments, troubleshooting and operations
  • Assist in roll-out and deployment of new development features aimed at supporting the latest NVIDIA hardware and technologies.
  • Work closely with world-class engineers, architects, technical product managers and application developers setting the best strategies in place for a product launch.
  • Defining and implementing full scale solutions for product onboarding into our hosted and private cloud environments.
  • Solve sophisticated problems involving multi-site deployments of NVIDIA products.
  • Collaborate with multi-functional teams, including system engineering, software engineering, mechanical/thermal engineering, operations, data center teams, external vendors, and other partners to successfully deliver a reliable and robust platform from concept to prototype to deployments.
  • Directly contribute to the overall quality of deployments and improve time to market next gen products.

Requirements:

  • Bachelor's or Master's Degree in Computer Science or Software Engineering, or equivalent experience.
  • 5+ years of relevant experience.
  • 3+ years of Linux and Scripting experience.
  • Solid background on supporting private Cloud and HW operations
  • A track record of quickly understanding new technologies outside of your domain expertise and deploying systems in sophisticated configurations from hardware through multiple layers of software in a fast-paced environment.
  • Strong technical skills and understanding of embedded systems, orchestration & automation systems, data centers and cloud architecture, as well as excellent communication and planning skills.
  • Strong problem-solving ability and experience in product engineering/failure analysis and debug/ HW or test design.
  • Detailed understanding of HW system configurations, including BIOS, Compute, Storage and Networking.

Preferred Qualifications:

  • Experience in large scale QA environments, for product bring ups.
  • Operations Support managing Bug Queues and Support Tickets
  • Background with supporting GPUs, embedded device development, driver development and CUDA applications.
  • Experience with converged and hyper-converged hardware and servers.
  • Background with Bash and Python.
  • Familiarity with Jenkins, Ansible and REST APIs.
  • Strong background on Windows & Linux administration.

Sustainable Talent is an equal employment opportunity and affirmative action employer.



  • Santa Clara, California, United States Palo Alto Networks Full time

    Job Title: Senior Cloud Infrastructure EngineerPalo Alto Networks is seeking a highly skilled Senior Cloud Infrastructure Engineer to join our team. As a Senior Cloud Infrastructure Engineer, you will be responsible for designing, building, and operating reliable, secure cloud infrastructure.Key Responsibilities:Design and implement scalable cloud...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Senior Cloud Infrastructure EngineerNVIDIA is seeking a highly skilled Senior Cloud Infrastructure Engineer to join our Infrastructure, Planning and Process (IPP) team. As a key member of our global organization, you will be responsible for designing, building, and maintaining our cloud infrastructure to support the development and deployment of...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job Title: Senior Cloud Infrastructure EngineerPalo Alto Networks is seeking a highly skilled Senior Cloud Infrastructure Engineer to join our team. As a key member of our Cloud Infrastructure team, you will be responsible for designing, building, and operating scalable and secure cloud infrastructure.About the RoleWe are looking for a talented engineer with...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Senior Cloud Infrastructure EngineerNVIDIA is seeking a highly skilled Senior Cloud Infrastructure Engineer to join our Infrastructure, Planning and Process (IPP) team. As a key member of our global organization, you will be responsible for designing, building, and maintaining our cloud infrastructure to support the development and deployment of...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Senior Site Reliability EngineerNVIDIA is seeking a highly skilled Senior Site Reliability Engineer to join our Infrastructure, Planning and Process (IPP) team. As a key member of our global organization, you will play a critical role in designing and implementing scalable, reliable, and efficient cloud infrastructure solutions.Our cloud services...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Infrastructure Engineer to join our team at Palo Alto Networks. As a key member of our Cloud Infrastructure team, you will be responsible for designing, building, and operating scalable and secure cloud infrastructure to support our mission-critical applications.Key ResponsibilitiesDesign and...


  • Santa Clara, California, United States Pan Asia Resources Full time

    Job Title: Senior Systems Infrastructure EngineerWe are seeking a highly skilled Senior Systems Infrastructure Engineer to join our team at Pan Asia Resources. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining our cloud infrastructure on AWS.Key Responsibilities:Design and implement scalable and...


  • Santa Clara, California, United States NVIDIA Full time

    Transformative Infrastructure Performance EngineerNVIDIA is at the forefront of technological innovation, driving efficiency and optimizing the performance of our infrastructure both on-prem and cloud. We are seeking a highly skilled Senior Staff Infrastructure Performance Engineer to join our dynamic team.Key Responsibilities:Lead initiatives to transform...


  • Santa Clara, California, United States Nvidia Full time

    Job Title: Senior Staff Infrastructure Performance EngineerNVIDIA is a leader in the technology industry, and we are seeking a highly skilled Senior Staff Infrastructure Performance Engineer to join our dynamic team. As a key member of our IT organization, you will play a critical role in driving efficiency and optimizing the performance of our...


  • Santa Clara, California, United States NVIDIA Full time

    Transform IT Compute Platform ArchitectureNVIDIA is at the forefront of technological innovation, driving efficiency and optimizing the performance of our infrastructure both on-prem and cloud. We are seeking a highly skilled Senior Staff Infrastructure Performance Engineer to join our dynamic team.Key Responsibilities:Lead initiatives to transform IT...


  • Santa Clara, California, United States Trillium Staffing Full time

    Senior SRE EngineerTrillium Staffing is seeking a seasoned Senior SRE Engineer to join its fast-paced Infrastructure, Planning and Processes organization in Santa Clara, CA. As a key member of the team, you will be responsible for developing and maintaining sophisticated internal cloud provisioning products for GPUs and Tegra systems.Key...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Senior Staff Site Reliability Engineer to join our CDL/SLS team at Palo Alto Networks. As a key member of our team, you will be responsible for designing, building, and operating reliable, secure cloud infrastructure.Key ResponsibilitiesContribute to the success of SRE and DevOps teamsDevelop expertise in new...


  • Santa Clara, California, United States NVIDIA Full time

    About the RoleNVIDIA is seeking a highly skilled Senior SRE Engineer to join its Infrastructure, Planning and Processes organization. As a key member of the team, you will be responsible for designing and implementing scalable, resilient cloud infrastructure platforms using Kubernetes and other technologies.Key ResponsibilitiesDesign and implement Kubernetes...


  • Santa Clara, California, United States NVIDIA Full time

    Transform IT Compute Platform ArchitectureNVIDIA is seeking a highly skilled Senior Staff Infrastructure Performance Engineer to join our dynamic team. As a key member of our IT organization, you will be responsible for leading initiatives to transform our IT Compute platform architecture to build new service offerings across On-Prem & Cloud.Key...


  • Santa Clara, California, United States Oracle Full time

    Job Title: Senior Network Engineer in Cloud InfrastructureAt Oracle, we're building the future of cloud computing for enterprises. As a Senior Network Engineer in Cloud Infrastructure, you'll be part of a diverse team of creators and inventors who drive innovation and excellence.Responsibilities:Design, deploy, and operate a large-scale global Oracle cloud...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Senior Staff Site Reliability Engineer to join our CDL/SLS team at Palo Alto Networks. As a key member of our team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Key ResponsibilitiesContribute to the success of SRE and DevOps teamsDevelop expertise in new...


  • Santa Clara, California, United States ServiceNow Full time

    OverviewThe ServiceNow SRE team is a group of highly technical engineers who are tasked with maintaining and developing the reliability, scalability, and performance of the ServiceNow cloud infrastructure.Our SREs are empowered to drive technical resolutions across the technology stack from hardware through to application and all stops in between.They are...


  • Santa Clara, California, United States XPENG Motors Full time

    Job Title: Senior Staff AI Infrastructure SREXpeng Motors is a leading smart electric vehicle company that designs, develops, and manufactures cutting-edge EVs with advanced Internet, AI, and autonomous driving technologies. We are committed to in-house R&D and intelligent manufacturing to create a better mobility experience for our customers.About the...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RolePalo Alto Networks is seeking a highly skilled Senior Staff DevOps Engineer to join our Cloud Infrastructure team. As a key member of our team, you will be responsible for designing, building, and operating reliable, secure cloud infrastructure to support our mission-critical applications.Key ResponsibilitiesDesign and implement scalable,...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RolePalo Alto Networks is seeking a highly skilled Senior Staff Site Reliability Engineer to join our Cortex Data Lake team. As a key member of our team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Key ResponsibilitiesContribute to the success of our SRE and DevOps teams by developing...