Senior DevOps Engineer, Kubernetes

3 weeks ago


Santa Clara, California, United States NVIDIA Full time

NVIDIA is seeking a skilled Senior DevOps Engineer to join our team. As a key member of our organization, you will play a crucial role in designing and building Linux-based management agents, CLI tools, and end-to-end integration solutions that combine GPUs with the rest of the data center software management ecosystem.

You will work closely with the broader NVIDIA team to ensure the reliable and secure delivery of our data-center monitoring products. This includes maintaining and improving CI/CD pipelines on Jenkins, GitLab, and GitHub, as well as contributing to the development and release infrastructure of the team.

To succeed in this role, you must have a strong Linux background, familiarity with state-of-the-art CI/CD, Docker, Shell/Python scripting, a proven work ethic, and strong attention to detail. You will be expected to jump in quickly and provide valuable contributions from day one.

Key Responsibilities:

  • Create and Maintain Helm Charts for custom software deployment.
  • Create and Maintain development environments that use technologies such as k3d, kind, tilt, helmfile, etc.
  • Utilize and implement best practices for software delivery in Kubernetes environments.
  • Create and maintain CI/CD pipelines on Jenkins, GitLab, and/or GitHub.
  • Improve and maintain integrations with static-analysis tools such as Coverity to ensure the quality of our products.
  • Interface with internal NVIDIA tooling to enable the signing and publishing of our products.
  • Configure CI/CD runners and integrations with version control systems.
  • Create and manage Infrastructure-as-Code tools like Terraform or Ansible for provisioning and managing infrastructure.

Requirements:

  • BS or higher in Computer Science or equivalent experience.
  • 5+ years of meaningful industry experience with a strong DevOps background.
  • Experience maintaining and debugging CI/CD pipelines on Jenkins/GitLab/GitHub. Experience with containerized environments (Docker, cri-o, podman).
  • Business level English. Outstanding written and verbal interpersonal skills.
  • Strong motivation and commitment to learn new skills.
  • Execute all aspects of the software development lifecycle.
  • Ability to manage time in a fast, heavily multitasked environment.
  • Experience with container orchestration platforms like Kubernetes, including availability and scaling solutions.

NVIDIA is an Equal Opportunity Employer:

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.



  • Santa Clara, California, United States NVIDIA Full time

    Job DescriptionNVIDIA is seeking a highly skilled DevOps Engineer to join our team. As a DevOps Engineer, you will play a critical role in designing and building Linux-based management agents, CLI tools, and end-to-end integration solutions that combine GPUs with the rest of the data center software management ecosystem.You will work closely with the broader...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled and experienced DevOps Expert for Robotics to join our dynamic NVIDIA team. The ideal candidate will have a strong background in managing and optimizing software development and deployment processes, with expertise in Monorepo, Bazel, Git, Linux, Jenkins, Docker, Kubernetes, and Python.This role will involve leading the DevOps...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled and experienced DevOps Engineer to join our dynamic NVIDIA Robotics team.The ideal candidate will have a strong background in managing and optimizing software development and deployment processes, with expertise in Monorepo, Bazel, Git, Linux, Jenkins, Docker, Kubernetes, and Python.You will be working on many open-source and...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior DevOps Engineer to join our Data and Application Services team at NVIDIA. The ideal candidate will have a strong background in cloud infrastructure, automation, and Kubernetes.As a Senior DevOps Engineer, you will be responsible for designing, implementing, and maintaining our multi-tenant Kubernetes platform. You will...


  • Santa Clara, California, United States United Software Group Full time

    Job Title: Senior DevOps Engineer in Santa Clara, CAJob Description:We are seeking a highly skilled Senior DevOps Engineer to join our team at United Software Group. The successful candidate will be responsible for setting the strategy for how we build, deploy, monitor, and operate our applications.Responsibilities:Set direction and strategy for the DevOps...

  • DevOps Engineer

    4 weeks ago


    Santa Clara, California, United States Selector Software Full time

    Job OverviewSelector Software is seeking a skilled DevOps Engineer to play a pivotal role in ensuring the reliability, scalability, and performance of our cutting-edge AIOps platform. As a key member of our team, you will be responsible for overseeing the software delivery lifecycle, from infrastructure provisioning and configuration management to monitoring...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Sr Staff Site Reliability Engineer to join our CDL/SLS team at Palo Alto Networks. As a key member of our engineering team, you will be responsible for designing, building, and operating reliable, secure cloud infrastructure.As a Sr Staff Site Reliability Engineer, you will contribute to the success of our SRE...


  • Santa Clara, California, United States Oracle Corporation Full time

    About the RoleWe are seeking a highly skilled DevOps Engineer to join our team at Oracle Corporation. As a DevOps Engineer, you will play a critical role in the development and deployment of our cloud-based solutions.Key ResponsibilitiesDesign and implement automated deployment and testing processesCollaborate with cross-functional teams to identify and...


  • Santa Clara, California, United States Roche Holdings Inc. Full time

    About the Role:Roche is seeking a Principal DevOps Engineer to lead the QCS Algorithms deployments. The ideal candidate will have experience in designing and developing build, release, and deploy toolchains for DevOps, as well as setting up and managing parity across development, staging, and production environments in cloud infrastructure.Key...


  • Santa Clara, California, United States Agilent Technologies Full time

    Job Title: Senior DevOps/Build EngineerJob Summary:Agilent Technologies is seeking a skilled Senior DevOps/Build Engineer to join our Software Engineering team. As a key member of our team, you will be responsible for managing software builds and installations centered on our goal of delivering a unified customer experience by integrating with the OpenLab...

  • Software Engineer

    4 weeks ago


    Santa Clara, California, United States Rootshell Enterprise Technologies Inc. Full time

    Job DescriptionWe are seeking a highly skilled Software Engineer (Java) to join our team at Rootshell Enterprise Technologies Inc. in Santa Clara, CA. As a key member of our team, you will be responsible for designing, developing, and deploying scalable, maintainable, and high-performance microservices using Spring Boot and Java.Key Responsibilities:Design...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job OverviewPalo Alto Networks is seeking a highly skilled Cloud Infrastructure Engineer to join our CDL/SLS team. As a Senior Staff Site Reliability Engineer, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Our team is at the forefront of innovation, constantly pushing the boundaries of what is...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Senior Cloud Infrastructure Engineer to join our CDL/SLS team. As a key member of our team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using Terraform,...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Senior Staff Security Engineer to lead our vulnerability management efforts. As a key member of our security team, you will be responsible for securing our expansive, multi-cloud and containerized infrastructure.You will manage the complexities of vulnerability detection and remediation across...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RolePalo Alto Networks is seeking a highly skilled Senior DevSecOps Engineer to join our InfoSec team. As a key member of our team, you will be responsible for building, maintaining, and scaling highly critical security production services in a Google Cloud Platform environment.Key ResponsibilitiesDesign and implement secure cloud-based...


  • Santa Clara, California, United States Exostellar, Inc. Full time

    We are seeking a highly skilled Senior Software Quality Assurance Engineer to join our team at Exostellar, Inc. The ideal candidate will have extensive experience in designing and implementing automated test frameworks for cloud-native applications, as well as a strong background in software engineering and testing methodologies.The successful candidate will...

  • DevOps Engineer

    4 weeks ago


    Santa Clara, California, United States ServiceNow Full time

    At ServiceNow, we're transforming the way organizations work by harnessing the power of artificial intelligence and machine learning. As a DevOps Engineer on our Advanced Technology Group, you'll play a key role in building and deploying cloud-based AI/ML solutions that empower our customers to find smarter, faster, and better ways to work.We're looking for...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is seeking a senior build and continuous integration (CI/CD) engineer for its GenAI Frameworks (NeMo, Megatron Core) team.NVIDIA NeMo is an open-source, scalable, and cloud-native framework built for researchers and developers working on Large Language Models (LLM), Multimodal (MM), and Speech AI.NeMo provides end-to-end model training, including data...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Senior Staff Site Reliability Engineer to join our CDL/SLS team. As a key member of our infrastructure team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Key Responsibilities:Develop expertise in new technologies and contribute to the...


  • Santa Rosa, California, United States Cynet Systems Full time

    Job Title: Senior DevOps EngineerJob Summary:Cynet Systems is seeking a highly skilled Senior DevOps Engineer to design, implement, and test dev ops tools, including build and deploy pipeline tools. The ideal candidate will have 10 years of dev ops experience and 3 years of experience in a cloud environment.Responsibilities:- Design, implement, and test dev...