Lead Cloud Solutions Engineer

2 weeks ago


Santa Clara, California, United States NVIDIA Full time

NVIDIA is seeking exceptional software engineers to enhance our enterprise GPU management and monitoring solutions. In this position, you will collaborate with the broader NVIDIA team to architect and develop Linux-based management agents, Kubernetes integrations, and comprehensive integration solutions that merge GPUs with the overall datacenter software management ecosystem. Our focus is on supporting NVIDIA products across HPC, cloud, and enterprise environments on both bare metal and virtualized platforms as the role of GPUs continues to evolve. Your contributions will encompass various facets of GPU system integration, including telemetry and metrics, health assessments, diagnostics, configuration, and system oversight. These tools serve both passive background monitoring and active online management, emphasizing operational transparency and seamless integration within customer environments. Your code will support everything from single-node developer systems to extensive clusters comprising thousands of nodes.

Key Responsibilities:

  • Design and maintain robust, scalable Go applications within a Kubernetes framework.
  • Develop and sustain user-space applications, containers, Go-bindings, and command-line interface tools.
  • Facilitate GPU management integration with cutting-edge open-source ecosystems, including Kubernetes and Docker.
  • Assist internal and external users through bug resolutions, documentation, and feature enhancements.
  • Ensure high-quality products through comprehensive test coverage.

Qualifications:

  • Bachelor's degree or higher in Computer Science or equivalent experience.
  • Over 5 years of relevant industry experience with a strong background in Go and Kubernetes development.
  • Expertise in user space development and debugging within Linux environments.
  • Proficient in business-level English.
  • Experience with APIs and interface design.
  • Excellent written and verbal communication skills.
  • Strong motivation and commitment to acquiring new skills.
  • Proficient in executing all phases of the software development lifecycle.
  • Ability to manage time effectively in a fast-paced, multitasked environment.

Preferred Qualifications:

  • Development experience with Python, Go, C, C++, and/or Rust. Familiarity with Jenkins and GitHub/GitLab CI/CD pipelines. Background in containers, common orchestration frameworks, and logging/telemetry backends.
  • Experience with APIs and interface design. Exposure to GPU programming with CUDA. Experience in developing and maintaining enterprise software. Familiarity with job schedulers like Slurm or K8s-scheduler.
  • Understanding of Kubernetes internals. Experience in developing Kubernetes operators and knowledge of resource allocations.

NVIDIA is widely recognized as one of the most desirable employers in the technology sector. We pride ourselves on having some of the most innovative and dedicated individuals in the industry. If you are creative and self-motivated, we encourage you to reach out.

The base salary range is competitive and will be determined based on your location, experience, and the compensation of employees in similar roles. You will also be eligible for equity and benefits. NVIDIA values diversity in our workforce and is proud to be an equal opportunity employer.



  • Santa Clara, California, United States P17 Solutions Full time

    P17 Solutions is seeking a talented Solutions Architect with a strong background in Machine Learning (ML) and Deep Learning (DL) to lead innovative projects. In this role, you will engage with cutting-edge computing technologies, collaborating with top-tier clients to implement advanced AI solutions.Key ResponsibilitiesStay abreast of the latest advancements...


  • Santa Clara, California, United States P17 Solutions Full time

    Position OverviewP17 Solutions is seeking a highly skilled Solutions Architect with a strong background in Machine Learning and Deep Learning. This role involves deploying advanced ML and DL models both on-premises and in cloud environments. As part of our dedicated architecture team, you will engage with cutting-edge computing technologies, driving...


  • Santa Clara, California, United States P17 Solutions Full time

    P17 Solutions is seeking a talented Solutions Architect with a strong background in Machine Learning (ML) and Deep Learning (DL) to enhance our technical capabilities. This role is pivotal in collaborating with leading technology firms to implement cutting-edge AI solutions both on-premises and in cloud environments.Key ResponsibilitiesStay abreast of...


  • Santa Clara, California, United States P17 Solutions Full time

    OverviewP17 Solutions is seeking a talented Solutions Architect with a strong background in Machine Learning (ML) and Deep Learning (DL) to support our innovative projects. This role involves working with cutting-edge computing technologies and collaborating with leading enterprises to drive advancements in AI.Key ResponsibilitiesStay updated on the latest...


  • Santa Clara, California, United States Summit Healthcare Inc Full time

    We are thrilled to present an opportunity for a Cloud Solutions Architect at Summit Healthcare Inc. We are in search of a dedicated professional with a keen interest in artificial intelligence and machine learning. If you are passionate about engaging in initiatives that redefine the possibilities of cloud-scale AI, we encourage you to explore this role.Key...


  • Santa Clara, California, United States P17 Solutions Full time

    OverviewP17 Solutions is seeking a highly skilled Solutions Architect with a focus on Machine Learning and Deep Learning technologies. This role involves deploying advanced ML and DL models both on-premises and in cloud environments. As part of our Solution Architecture team, you will collaborate with leading technology companies, utilizing cutting-edge...


  • Santa Clara, California, United States Amazon Full time

    Join Our Team as a Lead Software EngineerAre you ready to influence the evolution of computing within the Amazon Web Services cloud? The EC2 Enterprise Workloads division is dedicated to solving complex challenges faced by enterprise clients through innovative cloud solutions. Our team leverages state-of-the-art technologies to create extensive platforms...


  • Santa Clara, California, United States Summit Healthcare Inc Full time

    We are thrilled to present an opportunity for a Cloud Solutions Architect at Summit Healthcare Inc. We are in search of a dedicated professional with a keen interest in artificial intelligence and machine learning. If you are passionate about engaging in initiatives that redefine the possibilities of cloud-scale AI, we encourage you to explore this role.Key...


  • Santa Clara, California, United States Summit Healthcare Inc Full time

    We are thrilled to present an opportunity for a Cloud Solutions Architect at Summit Healthcare Inc. We are in search of a dedicated professional with a keen interest in artificial intelligence and machine learning. If you are passionate about engaging in initiatives that redefine the possibilities of cloud-scale AI, we encourage you to explore this role.Key...


  • Santa Clara, California, United States Oracle Full time

    Job OverviewJoin our team as we develop innovative tools and services for cutting-edge Infrastructure-as-a-Service technologies that function at scale within a distributed multi-tenant cloud ecosystem.Our goal is to deliver exceptional DevOps solutions that empower our clients to efficiently manage their code, builds, and deployments within Oracle Cloud...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job OverviewYour Career JourneyUtilize your expertise in backend Java cloud engineering to contribute to cutting-edge cloud software and web applications. Join us in deploying and scaling the next generation of cloud security, leveraging big data and analytics.We are seeking a Principal Engineer to be part of the team dedicated to developing our latest cloud...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is at the forefront of the AI revolution, and we are seeking a seasoned Cloud Solutions Architect to facilitate the integration of GPU technology and software for our clients. This role involves crafting and implementing Machine Learning (ML), Deep Learning (DL), and data analytics solutions across various Cloud Computing Platforms. As a vital member...


  • Santa Clara, California, United States Couchbase Full time

    About the RoleCouchbase is seeking a highly skilled Cloud Solutions Engineer to join our team. As a Cloud Solutions Engineer, you will play a critical role in supporting the rapidly growing Couchbase user community and driving customer success.Key ResponsibilitiesTechnical Field Expertise: Provide technical field expertise to customers, explaining NoSQL...


  • Santa Ana, California, United States Solugenix Corp Full time

    Position: Lead Cloud Solutions EngineerLocation: RemoteContract Duration: 6-Month ContractCompany Overview:Solugenix Corp is collaborating with a prominent financial services organization in search of a Lead Cloud Solutions Engineer. This role is a contract opportunity that allows for remote work.Essential Qualifications:AWS CertificationProficient in AWS...


  • Santa Clara, California, United States NetScaler Full time

    About the TeamCitrix and TIBCO recently merged to create Cloud Software Group, a leading provider of cloud-based solutions for enterprise customers. Our team is responsible for developing and maintaining the security features of our flagship product, NetScaler.Job DescriptionJob SummaryWe are seeking an experienced Principal Software Engineer to lead the...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job OverviewCompany OverviewPalo Alto Networks is dedicated to safeguarding our digital existence. Our mission is to be the premier cybersecurity partner, ensuring a secure and safe environment for everyone.VisionWe envision a future where each day is more secure than the last. Our foundation is built on innovation and a commitment to redefining the...


  • Santa Ana, California, United States ExpertHiring Full time

    Job OverviewPosition: Senior Cloud EngineerCompany: ExpertHiringJob Type: ContractLocation: RemoteCompensation: Competitive SalaryRole SummaryAs a Senior Cloud Engineer, you will play a pivotal role in the development and implementation of cloud-based infrastructure solutions. Your expertise will be essential in creating repeatable processes and ensuring the...


  • Santa Clara, California, United States Summit Healthcare Inc Full time

    We are thrilled to present an opportunity for a Cloud Solutions Architect at Summit Healthcare Inc. We are in search of a dedicated professional with a keen interest in artificial intelligence and machine learning. If you are passionate about engaging in initiatives that redefine the possibilities of cloud-scale AI, we encourage you to explore this role.Key...


  • Santa Clara, California, United States Summit Healthcare Inc Full time

    We are thrilled to present an opportunity for a Cloud Solutions Architect at Summit Healthcare Inc. We are looking for a dedicated professional with a keen interest in artificial intelligence and machine learning. If you are passionate about engaging in initiatives that redefine the possibilities of cloud-scale AI, we encourage you to explore this role.Key...


  • Santa Clara, California, United States P17 Solutions Full time

    About the RoleWe are seeking a highly skilled Machine Learning Engineer/Solution Architect to join our team at P17 Solutions. As a key member of our organization, you will be responsible for designing and implementing cutting-edge AI and Machine Learning solutions for our clients.Key ResponsibilitiesLead Software/Application Customer Technical Engagements:...