Current jobs related to Cloud Infrastructure Specialist - Santa Clara, California - NVIDIA


  • Santa Clara, California, United States TEKsystems co Allegis Group Full time

    About the RoleWe are seeking a skilled Cloud Infrastructure Specialist to join our team at TEKsystems c/o Allegis Group. As a key member of our infrastructure team, you will be responsible for deploying and maintaining our client's DGX Cloud infrastructure services running on top of bare metal.Key ResponsibilitiesDeploy and run cloud infrastructure services...


  • Santa Clara, California, United States NVIDIA Full time

    Cloud Infrastructure Optimization SpecialistNVIDIA is a leader in computer graphics, PC gaming, and accelerated computing. We're now leveraging AI to define the next era of computing. As a Cloud Infrastructure Optimization Specialist, you'll work with internal teams to optimize cloud resources, reducing costs and improving performance.Key Responsibilities:...


  • Santa Clara, California, United States Diverse Lynx Full time

    Job Description:At Diverse Lynx LLC, we are seeking a skilled Cloud Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining our cloud infrastructure. Key Responsibilities:Design and implement cloud infrastructure solutions using AWS, Azure, or Google Cloud...


  • Santa Clara, California, United States eTeam Full time

    Job Title: Cloud Infrastructure ArchitectWe are seeking a highly skilled Cloud Infrastructure Architect to join our team at eTeam. As a Cloud Infrastructure Architect, you will be responsible for designing and implementing scalable, secure, and efficient cloud infrastructure solutions for our clients.Key Responsibilities:Design and implement cloud...


  • Santa Clara, California, United States eTeam Full time

    Job Title: Cloud Infrastructure ArchitectWe are seeking a highly skilled Cloud Infrastructure Architect to join our team at eTeam. As a Cloud Infrastructure Architect, you will be responsible for designing and implementing scalable, secure, and efficient cloud infrastructure solutions for our clients.Key Responsibilities:Design and implement cloud...


  • Santa Clara, California, United States eTeam Full time

    Job Title: Cloud Infrastructure ArchitectWe are seeking a highly skilled Cloud Infrastructure Architect to join our eTeam team. As a key member of our team, you will be responsible for designing and implementing scalable, secure, and efficient cloud infrastructure solutions on Google Cloud Platform (GCP).Key Responsibilities:Design and implement cloud...


  • Santa Clara, California, United States eTeam Full time

    Job Title: Cloud Infrastructure ArchitectWe are seeking a highly skilled Cloud Infrastructure Architect to join our team at eTeam. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and secure cloud infrastructure on Google Cloud Platform.Key Responsibilities:Design and implement cloud...


  • Santa Clara, California, United States IT Management Corp. dba 101 VOICE Full time

    Job Title: IT Infrastructure SpecialistIT Management Corp. dba 101 VOICE is seeking an experienced IT Infrastructure Specialist to join our team. As an IT Infrastructure Specialist, you will play a pivotal role in designing, implementing, and managing our network infrastructure and systems, with a strong focus on cloud platforms and virtualization...


  • Santa Clara, California, United States eTeam Full time

    Job DescriptionJob Title: Cloud Infrastructure ArchitectLocation: Remote (with occasional travel)Job Type: Full-timeAbout eTeam: eTeam is a leading provider of cloud-based solutions, dedicated to delivering innovative and secure infrastructure to our clients.Job Summary: We are seeking an experienced Cloud Infrastructure Architect to join our team. The ideal...


  • Santa Clara, California, United States Oracle Full time

    Job Title: Senior Cloud Infrastructure DeveloperWe are seeking a highly skilled Senior Cloud Infrastructure Developer to join our Oracle Cloud Infrastructure (OCI) Platform Integration (PINT) team. As a key member of our team, you will be responsible for designing, implementing, and maintaining cloud infrastructure solutions that meet the needs of our...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Cloud Infrastructure Engineer to join our team at Palo Alto Networks. As a key member of our Cloud Infrastructure team, you will be responsible for designing, building, and operating our cloud infrastructure to ensure high availability, scalability, and security.Key ResponsibilitiesDesign and implement cloud...


  • Santa Clara, California, United States Astera Labs Full time

    Astera Labs Job DescriptionAstera Labs is a global leader in purpose-built connectivity solutions that unlock the full potential of AI and cloud infrastructure. Our Intelligent Connectivity Platform integrates PCIe, CXL, and Ethernet semiconductor-based solutions and the COSMOS software suite of system management and optimization tools to deliver a...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Infrastructure Engineer to join our team at Palo Alto Networks. As a key member of our Cloud Infrastructure team, you will be responsible for designing, building, and operating scalable and secure cloud infrastructure to support our mission-critical applications.Key ResponsibilitiesDesign and...


  • Santa Clara, California, United States Astera Labs Full time

    Astera Labs: Transforming Data-Driven ApplicationsAstera Labs is a global leader in purpose-built connectivity solutions that unlock the full potential of AI and cloud infrastructure.Our Intelligent Connectivity Platform integrates PCIe, CXL, and Ethernet semiconductor-based solutions and the COSMOS software suite of system management and optimization tools...


  • Santa Clara, California, United States Oracle Full time

    Job SummaryWe are seeking a highly skilled Senior Cloud Infrastructure Developer to join our team at Oracle. As a key member of our team, you will be responsible for designing, implementing, and maintaining our cloud infrastructure. This is a unique opportunity to work with cutting-edge technology and be part of a dynamic team that is shaping the future of...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job Title: Senior Cloud Infrastructure EngineerPalo Alto Networks is seeking a highly skilled Senior Cloud Infrastructure Engineer to join our team. As a Senior Cloud Infrastructure Engineer, you will be responsible for designing, building, and operating reliable, secure cloud infrastructure.Key Responsibilities:Design and implement scalable cloud...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Our MissionPalo Alto Networks is committed to protecting the digital way of life by providing innovative cybersecurity solutions. We believe in the power of collaboration and value in-person interactions, fostering a culture of innovation and creativity.Job DescriptionWe are seeking a highly skilled Senior Staff DevOps Engineer to join our CDL/SLS team. As a...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Senior Site Reliability EngineerNVIDIA is seeking a highly skilled Senior Site Reliability Engineer to join our Infrastructure, Planning and Process (IPP) team. As a key member of our global organization, you will play a critical role in designing and implementing scalable, reliable, and efficient cloud infrastructure solutions.Our cloud services...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Senior Cloud Infrastructure Engineer to join our CDL/SLS team. As a key member of our team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using Terraform,...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RolePalo Alto Networks is seeking a highly skilled Senior Staff Site Reliability Engineer to join our CDL/SLS team. As a key member of our infrastructure platform team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Our infrastructure platform stack includes Terraform, Kubernetes, GitLab...

Cloud Infrastructure Specialist

2 months ago


Santa Clara, California, United States NVIDIA Full time

Job Summary

NVIDIA is seeking a highly skilled Senior Cloud Engineer to join its Infrastructure, Planning and Processes organization. As a Senior Cloud Engineer, you will be part of a fast-paced team that develops and maintains NVIDIA's internal cloud provisioning product for GPUs and Tegra systems.

Key Responsibilities

  • Design and implement scalable, resilient cloud platforms in public and private clouds.
  • Administer and configure Kubernetes clusters, including cluster segmentation, internal/external networking, and monitoring.
  • Develop and improve CI/CD pipelines for container build and deployment.
  • Craft and develop tools for automating workflows and infrastructure management.
  • Develop, improve, and maintain infrastructure codebase.
  • Craft and implement critical metrics using various analytics methods and dashboards.
  • Participate in prototyping, crafting, and developing cloud infrastructure for NVIDIA.
  • Reuse AI techniques to extract useful signals about machines and jobs from generated data.

Requirements

  • Extensive experience building scalable, resilient platforms in public and private clouds.
  • High proficiency in administering and configuring Kubernetes.
  • Proficient with CI/CD pipelines like Jenkins, Gitlab CI, GitHub Actions, ArgoCD, etc.
  • Experience with data analytics/visualization tools like Kibana, Grafana, Splunk, etc.
  • Strong Ansible skills. Experience with other configuration tools like Chef and Puppet is also beneficial.
  • Proficient using source code management and binary repository systems like GitLab, GitHub, Artifactory, Perforce, etc.
  • Knowledge of monitoring systems such as Zabbix, Alertmanager, PagerDuty, and/or similar systems.
  • Well-versed in Prometheus, writing custom exporters, and PromQL.
  • 8+ years of proven experience.
  • Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent experience.

Preferred Qualifications

  • Experience managing NVIDIA hardware like GPUs and Tegras.
  • Background with Gitlab CI.
  • Experience with building and deploying containers.
  • Solid understanding of containerization and microservices architecture.
  • Certified Kubernetes Administrator (CKA), Certified Kubernetes Security Specialist (CKS) & Certified Kubernetes Application Developer (CKAD) preferred.

About NVIDIA

NVIDIA is a technology leader in the field of artificial intelligence, graphics processing units (GPUs), and high-performance computing. We are committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.