Lead Solutions Architect for AI/ML Storage Systems

2 weeks ago


Santa Clara, California, United States NVIDIA Full time


As a Lead Solutions Architect focusing on AI/ML Storage Systems, you will play a crucial role in our innovative team, contributing to the development, implementation, and management of cutting-edge storage solutions designed specifically for Artificial Intelligence and Machine Learning applications.

This position encompasses a variety of areas, including software engineering, systems architecture, data management, and service delivery.

Your key responsibilities will include ensuring the robustness of storage solutions, optimizing data management processes, and delivering services that enhance the overall stability and performance of production environments.


In your role as a Solutions Architect at NVIDIA, you will be tasked with maintaining the reliability and availability of our GPU cloud services, both internally and externally, in line with our commitments to our users.

Additionally, you will support developers in executing system modifications through careful planning and preparation, with a strong emphasis on capacity, latency, and performance considerations.

This position requires a proactive mindset and a comprehensive set of engineering methodologies aimed at improving production system efficiency and implementing effective optimizations.

A significant focus of our software development initiatives is on automating processes, enhancing performance, and improving the overall efficiency of production systems.

Given the extensive responsibility for understanding the interconnectivity of our systems, you will utilize a wide array of tools and techniques to tackle diverse challenges.

This role offers an engaging and dynamic work environment, prioritizing continuous improvement and ensuring the success of our AI/ML initiatives. The culture at NVIDIA values diversity, intellectual curiosity, problem-solving, and openness, which are essential to our success. Our organization unites individuals from various backgrounds, experiences, and perspectives, encouraging collaboration, innovative thinking, and risk-taking in a supportive environment.

We foster self-direction in pursuing meaningful projects while striving to create an atmosphere that provides the necessary support and mentorship for professional growth.


Key Responsibilities:

Architectural Design:

Collaborate with cross-functional teams to design and deploy storage architectures optimized for AI/ML workloads, ensuring scalability, performance, and reliability.


Technical Proficiency:

Leverage extensive knowledge of storage technologies, including Lustre and Cloud storage, to create and implement solutions tailored to the unique demands of AI/ML training and inference workloads. Stay updated with industry trends and advancements in storage technologies to continually enhance the company's capabilities.


Cloud Integration:
Utilize expertise in GCP, AWS, and Azure to integrate storage solutions that emphasize efficiency, reliability, and cost-effectiveness.

Demonstrate hands-on experience in designing and deploying storage solutions, taking ownership of the entire process and troubleshooting as necessary.

Technical Advisory:
Provide expert guidance and consultation to clients and internal teams regarding storage solutions, aligning with AI/ML best practices.

Performance Enhancement:
Assess and optimize storage systems for AI/ML applications to meet performance benchmarks and improve overall system efficiency.

Seamless Integration:
Ensure that storage solutions are integrated smoothly with AI/ML frameworks, enhancing compatibility and resource utilization.

Team Collaboration:
Work closely with data scientists, engineers, and stakeholders to comprehend AI/ML storage needs and propose customized solutions.

Documentation:
Develop comprehensive technical documentation for AI/ML storage architectures, guidelines, and best practices.

Innovation and Trends:
Stay informed about industry trends and emerging technologies in AI/ML and storage to drive innovation and continuous improvement.

Qualifications:
Demonstrated experience in designing and implementing storage architectures for AI/ML workloads, with a focus on scalability and performance.

Strong technical expertise in storage technologies, AI/ML frameworks, and their integration. Proficiency in programming languages commonly used in AI/ML, such as Python, is advantageous.

Excellent communication and interpersonal skills to effectively convey complex technical concepts to both technical and non-technical audiences.

Proven ability to analyze and resolve complex technical challenges related to AI/ML storage architectures.


A collaborative mindset with the ability to work effectively in cross-functional teams and engage with clients to understand their specific AI/ML storage needs.

Prior hands-on coding experience for storage systems.

Master's degree in Computer Science, Engineering, or a related field or equivalent experience.

3+ years of relevant experience.

Preferred Qualifications:
Certifications in relevant technologies, such as NVIDIA Deep Learning Institute (DLI) certifications. Previous experience with cloud-based AI/ML services and storage solutions.

Familiarity with container orchestration platforms like Kubernetes. Adaptability to different working styles.

NVIDIA is recognized as one of the most desirable employers in the technology sector. We have some of the most innovative and dedicated professionals in the industry working with us. If you are creative and self-driven, we want to hear from you.


NVIDIA's invention of the GPU in 1999 catalyzed the growth of the PC gaming market, transformed modern computer graphics, and revolutionized parallel computing.

More recently, GPU deep learning has ignited the era of modern deep learning — with the GPU serving as the brain of computers, robots, and self-driving vehicles that can perceive and understand their environment.

Today, we are increasingly recognized as "the AI computing company." We are looking to expand our team with the most thoughtful individuals in the world.

The base salary range is competitive and will be determined based on your location, experience, and the compensation of employees in similar roles.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer.

We highly value diversity in our current and future employees and do not discriminate (including in our hiring and promotion practices) based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.



  • Santa Clara, California, United States P17 Solutions Full time

    P17 Solutions is seeking a talented Solutions Architect with a strong background in Machine Learning (ML) and Deep Learning (DL) to lead innovative projects. In this role, you will engage with cutting-edge computing technologies, collaborating with top-tier clients to implement advanced AI solutions.Key ResponsibilitiesStay abreast of the latest advancements...


  • Santa Clara, California, United States P17 Solutions Full time

    OverviewP17 Solutions is seeking a talented Solutions Architect with a strong background in Machine Learning (ML) and Deep Learning (DL) to support our innovative projects. This role involves working with cutting-edge computing technologies and collaborating with leading enterprises to drive advancements in AI.Key ResponsibilitiesStay updated on the latest...


  • Santa Clara, California, United States P17 Solutions Full time

    P17 Solutions is seeking a talented Solutions Architect with a strong background in Machine Learning (ML) and Deep Learning (DL) to enhance our technical capabilities. This role is pivotal in collaborating with leading technology firms to implement cutting-edge AI solutions both on-premises and in cloud environments.Key ResponsibilitiesStay abreast of...


  • Santa Clara, California, United States Amazon Full time

    Are you a technology enthusiast with a passion for innovation? Do you thrive in environments where hands-on experimentation is valued over mere discussion? If you possess a deep understanding of cloud architectures and are quick to adapt to new technologies, we want to hear from you. About the Role: As a Senior AI/ML Solutions Architect within the AWS...


  • Santa Clara, California, United States P17 Solutions Full time

    OverviewP17 Solutions is seeking a highly skilled Solutions Architect with a focus on Machine Learning and Deep Learning technologies. This role involves deploying advanced ML and DL models both on-premises and in cloud environments. As part of our Solution Architecture team, you will collaborate with leading technology companies, utilizing cutting-edge...


  • Santa Clara, California, United States Amazon Full time

    Are you enthusiastic about Generative AI (GenAI)? Do you aspire to shape the future of Go to Market (GTM) strategies at Amazon Web Services (AWS) through generative AI? In this position, you will assist our prominent clients in constructing and deploying GenAI-enabled applications utilizing Amazon Bedrock and SageMaker. You will fine-tune and develop...


  • Santa Clara, California, United States NVIDIA Full time

    We are currently seeking an AI Solutions Architect at NVIDIA, focusing on cloud infrastructure and hyperscale environments. Your primary role will involve spearheading technical engagements for AI/ML software with customers deploying systems at an extensive scale. Collaborating across various teams within NVIDIA and with our clients, you will play a crucial...


  • Santa Clara, California, United States P17 Solutions Full time

    Position OverviewP17 Solutions is seeking a highly skilled Solutions Architect with a strong background in Machine Learning and Deep Learning. This role involves deploying advanced ML and DL models both on-premises and in cloud environments. As part of our dedicated architecture team, you will engage with cutting-edge computing technologies, driving...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is at the forefront of the AI revolution, and we are seeking a seasoned Cloud Solutions Architect to facilitate the integration of GPU technology and software for our clients. This role involves crafting and implementing Machine Learning (ML), Deep Learning (DL), and data analytics solutions across various Cloud Computing Platforms. As a vital member...

  • Solutions Architect

    2 days ago


    Santa Clara, California, United States NVIDIA Corporation Full time

    Solutions Architect - AI and HPC Cloud ExpertNVIDIA Corporation is seeking a highly skilled Solutions Architect to join its Cloud Infrastructure Team. As a key member of the team, you will be responsible for designing and implementing sophisticated cloud solutions that cater to the infrastructure needs of various NVIDIA groups, including Graphics Processors,...


  • Santa Clara, California, United States Eightfold Full time

    About EightfoldEightfold is a leading innovator in the AI-driven HR tech space, pushing the boundaries of how organizations find, manage, and empower their talent. Our cutting-edge AI platform is revolutionizing the industry, and we're seeking exceptional engineers to join our team and drive the next wave of advancements.About the AI/ML TeamOur AI/ML team is...

  • Data Scientist

    16 hours ago


    Santa Clara, California, United States JCW Group Full time

    Job Title: Data Scientist - AI and ML ExpertAbout JCW Group: JCW Group is a leading provider of innovative solutions in the field of data science and artificial intelligence. We partner with top companies to deliver cutting-edge AI and ML solutions that drive business growth and success.Job Summary: We are seeking an experienced Data Scientist with a strong...


  • Santa Clara, California, United States JCW Group Full time

    JCW Group is collaborating with a leading Data Science firm that is addressing intricate challenges through the use of Artificial Intelligence and Machine Learning. Following a significant merger with a prominent technology organization, they are in search of an Azure Solutions Architect specializing in Machine Learning.The successful candidate will be...


  • Santa Clara, California, United States JCW Group Full time

    JCW Group is collaborating with a prominent Data Science firm that is addressing intricate challenges through the application of Artificial Intelligence and Machine Learning. Following a significant merger with a major technology enterprise, they are in search of an Azure Solutions Architect with a specialization in Machine Learning.The successful candidate...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is on the lookout for a skilled Lead AI Systems Engineer to become a vital part of our Autonomous Vehicles division. In this position, you will leverage artificial intelligence to enhance Autonomous Vehicle perception, contributing to the development of our cutting-edge autonomous driving technology. We seek an innovative and inquisitive engineer who...


  • Santa Clara, California, United States E-Solutions Full time

    Position OverviewWe are seeking a talented Gen AI Specialist to join our innovative team at E-Solutions. As a key player in our organization, you will be responsible for developing advanced AI solutions that enhance user experiences and streamline operations.Key ResponsibilitiesPrompt Engineering and AI Chatbot Development: Design and implement sophisticated...


  • Santa Clara, California, United States E-Solutions Full time

    Position OverviewWe are seeking a talented Gen AI Specialist to join our dynamic team at E-Solutions, a global leader in workforce solutions.Role ResponsibilitiesPrompt Engineering and AI Chatbot Development: Design and implement advanced AI chatbot solutions, focusing on quality assurance, translation, and search/summarization capabilities.Foundation Models...


  • Santa Clara, California, United States TechStar Group Full time

    Position: Artificial Intelligence ResearcherLocation: Santa Clara, CADuration: Long TermAs an AI Research Engineer, you will leverage your knowledge in various domains of artificial intelligence, including:- Extraction of critical context from datasets- Sequential Decision Making and Recommendation Systems- Generative AI TechniquesYou will work alongside...


  • Santa Clara, California, United States Eightfold Full time

    About EightfoldEightfold AI is the industry leader in AI-powered talent intelligence and transforming the way organizations manage their talent. Our AI-powered Talent Intelligence Platform helps companies identify, attract, and retain top talent, while also providing employees with the tools they need to grow and succeed in their careers.About The AI/ML...


  • Santa Clarita, California, United States TechnoGen Inc Full time

    Job OverviewTechnoGen Inc is seeking a highly skilled Program Manager with AI/ML Expertise to lead our AI/ML initiatives. As a key member of our team, you will be responsible for identifying and prioritizing AI use cases, leading the triaging and risk assessment process, and improving our AI governance and risk assessment process.Key...