Senior Cloud Infrastructure and DevOps Solutions Architect

4 weeks ago


New York, New York, United States NVIDIA Full time
NVIDIA is a leader in computer graphics, artificial intelligence, and accelerated computing.

We are at the forefront of research and engineering around the greatest advances in technology.

Our history of innovation drives us to solve the world's hardest problems.

We are looking for a Senior Cloud Infrastructure/DevOps Solutions Architect to join our NVIDIA Infrastructure Specialist Team.

Academic and commercial groups around the world are using NVIDIA products to revolutionize deep learning and data analytics, and to power data centers.

We are building many of the largest and fastest AI/HPC systems in the world. We are looking for someone with the ability to work on a dynamic customer-focused team that requires excellent interpersonal skills.

The scope of these efforts includes a combination of Networking, System Design, and Automation and being the face to the customer

Key Responsibilities:


Design, implement, and maintain large-scale HPC/AI clusters with monitoring, logging, and alerting. Manage Linux job/workload schedulers and orchestration tools.

Develop and maintain continuous integration and delivery pipelines .


Develop tooling to automate deployment and management of large-scale infrastructure environments, to automate operational monitoring and alerting, and to enable self-service consumption of resources.

Deploy monitoring solutions for the servers, network, and storage.

Perform troubleshooting from bare metal to application level.


As a technical resource, develop, re-define, and document standard methodologies to share with internal teams. Support Research & Development activities and engage in POCs/POVs for future improvements .


Requirements:


BS/MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, or other Engineering fields with at least 8 years of work or research experience in networking fundamentals, TCP/IP stack, and data center architecture.

Knowledge of HPC and AI solution technologies from CPU's and GPU's to high-speed interconnects and supporting software.

Direct design, implementation, and management experience with cloud computing platforms (e.g. AWS, Azure, Google Cloud).

Experience with job scheduling workloads and orchestration technologies such as Slurm, Kubernetes, and Singularity.


Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu) networking (sockets, firewalld, iptables, wireshark, etc.) and internals, ACLs, and OS-level security protection and common protocols e.g.

TCP, DHCP, DNS, etc.

Experience with multiple storage solutions such as Lustre, GPFS, zfs, and xfs. Familiarity with newer and emerging storage technologies.

Python programming and bash scripting experience.

Comfortable with automation and configuration management tools including Jenkins, Ansible, Puppet/Chef, etc.


Deep knowledge of Networking Protocols like InfiniBand, Ethernet. Deep understanding and experience with virtual systems (for example VMware, Hyper-V, KVM, or Citrix).

Strong written, verbal, and listening skills in English are critical.

What Sets You Apart:
Knowledge of CPU and/or GPU architecture .

Knowledge of Kubernetes, container-related microservice technologies.

Experience with GPU-focused hardware/software (DGX, CUDA).

Background with RDMA (InfiniBand or RoCE) fabrics.

NVIDIA is a leader in the technology world and has some of the most forward-thinking and hardworking individuals in the world working for us. If you're creative and autonomous, we want to hear from you.

The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer.

We highly value diversity in our current and future employees and do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.



  • New York, New York, United States NVIDIA Full time

    Job Title: Senior Cloud Infrastructure and DevOps Solutions ArchitectNVIDIA is a world leader in computer graphics, artificial intelligence, and accelerated computing. Our company has been at the forefront of research and engineering around the greatest advances in technology for over 25 years.About the RoleWe are seeking a Senior Cloud Infrastructure and...


  • New York, New York, United States TEKsystems Full time

    We are seeking a highly skilled Cloud Solution Architect to join our team at TEKsystems. As a Cloud Solution Architect, you will be responsible for designing and implementing cloud-based solutions that meet the needs of our clients.The ideal candidate will have prior experience as a Solution Architect and a strong understanding of cloud computing principles,...


  • New York, New York, United States VySystems Full time

    Job Summary: We are seeking a highly skilled Cloud Infrastructure Architect to lead our Cloud initiatives and develop standards for engineering and technology solutions.About the Role: As a Cloud Infrastructure Architect, you will be responsible for designing, optimizing, and documenting the engineering aspects of our Cloud platform. You will also lead and...


  • New York, New York, United States Expression Full time

    Job Title: Cloud Infrastructure ArchitectDescription:Expression is seeking a highly skilled Cloud Infrastructure Architect to design, implement, and manage the DevOps processes and cloud infrastructure necessary to support our software development and deployment lifecycle.Key Responsibilities:* Design, implement, and manage infrastructure as code (IaC) using...


  • New York, New York, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Cloud Solutions Architect to join our team at Amazon. As a Cloud Solutions Architect, you will be responsible for designing and implementing cloud-based solutions for our customers. You will work closely with our sales team to drive revenue growth and adoption of AWS services.Key Responsibilities:Design and...


  • New York, New York, United States The Dignify Solutions LLC Full time

    Job Title: Senior DevOps EngineerOverview: We are seeking a highly experienced Senior DevOps Engineer to join our team at The Dignify Solutions LLC. The ideal candidate will have a strong background in Linux/Unix Administration, experience with AWS, Cloud Foundry, and containerization technologies like Docker.Key Responsibilities:10+ years of relevant work...


  • New York, New York, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Solutions Architect to join our team at Amazon. As a key member of our Rapid Response Team, you will be responsible for designing and implementing cloud-based solutions for our enterprise customers.Key Responsibilities:Design and implement cloud-based solutions for enterprise customersCollaborate...


  • New York, New York, United States SAIC Full time

    About SAICSAIC is a premier technology integrator that solves complex modernization and systems engineering challenges across various markets. Our robust portfolio includes high-end solutions in systems engineering and integration, enterprise IT, cyber, software, advanced analytics, and simulation.Job DescriptionWe are seeking a highly skilled Cloud...


  • New York, New York, United States The Dignify Solutions LLC Full time

    We are seeking a highly skilled Senior DevOps Engineer to join our team at The Dignify Solutions LLC. As a key member of our cloud infrastructure team, you will be responsible for designing, building, and maintaining complex cloud-based systems using tools like Spinnaker and Jenkins. Your expertise in AWS, Cloud Foundry, and Linux/Unix Administration will be...


  • New York, New York, United States Amazon Web Services, Inc. Full time

    About the RoleWe are seeking a highly skilled Senior Solutions Architect to join our team at Amazon Web Services, Inc. As a key member of our team, you will play a critical role in building and maintaining relationships with our largest Life Sciences customers, providing them with expert guidance and support to help them achieve their cloud computing...


  • New York, New York, United States Box Full time

    Unlock the Power of Cloud Content ManagementBox is the market leader for Cloud Content Management, and we're seeking a highly skilled Solutions Architect to join our team. As a trusted advisor, you'll work closely with our customers to design and implement complex solutions that drive business success.With a strong background in cloud computing and...


  • New York, New York, United States The Dignify Solutions LLC Full time

    Cloud Infrastructure ExpertiseWe are seeking a highly skilled Senior Cloud Infrastructure Engineer to join our team at The Dignify Solutions LLC. The ideal candidate will have a strong background in designing and implementing cloud-based infrastructure solutions using Kubernetes and other container orchestration systems.Key Responsibilities:Design and...


  • New York, New York, United States The Dignify Solutions LLC Full time

    Key Responsibilities:As a Senior DevOps Engineer at The Dignify Solutions LLC, you will be responsible for designing and implementing complex cloud infrastructure pipelines using tools like Spinnaker and Jenkins. You will work closely with our team to ensure the smooth operation of our cloud-based applications, leveraging your expertise in AWS, Cloud...


  • New York, New York, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Solutions Architect to join our team at Amazon. As a key member of our business development and sales team, you will be responsible for formulating and executing a sales strategy to exceed revenue objectives through the adoption of AWS. Key ResponsibilitiesPartner with the sales team to develop and...


  • New York, New York, United States Open Systems Technologies Full time

    Cloud Native Messaging Solutions ArchitectWe are seeking a Cloud Native Messaging Solutions Architect to join our team at Open Systems Technologies. The successful candidate will be responsible for developing and implementing cloud native messaging and streaming solutions for high-volume transaction environments. This will involve working with Infrastructure...


  • New York, New York, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Solutions Architect to join our team at Amazon. As a key member of our AWS Startup Solutions Architecture team, you will work closely with customers to understand their business drivers and design reliable, cost-effective cloud-native architectures.Key responsibilities include engaging with customers...


  • New York, New York, United States NovumTech Partners Full time

    About the rolenOps is an AWS Advanced Technology Partner and automated FinOps platform that helps customers reduce their AWS costs on auto-pilot.We are seeking a Cloud Solutions Architect to partner with our sales and customer success teams.The ideal candidate has broad and deep AWS expertise and a proven ability to establish themselves as a trusted advisor...


  • New York, New York, United States Amazon Full time

    Job DescriptionAt Amazon, we're seeking a highly skilled Senior Cloud Solutions Architect to join our team. As a key member of our Solutions Architecture team, you will partner with customers to design and build scalable, flexible, and resilient cloud architectures and solutions.As a trusted customer advocate, you will help organizations understand best...


  • New York, New York, United States CapLeo Global Full time

    Job Title: Senior Cloud ArchitectLocation: Buffalo NYDuration: 12 monthsJob Description: We are seeking a Senior Cloud Architect to design, customize, and implement IT solutions for wholesale brokerage platforms within the organization. The ideal candidate will have solid experience with PHP, Typescript, GCP, and hands-on experience with cloud data...


  • New York, New York, United States HexaQuEST Global, Inc. Full time

    Job Summary:We are seeking a highly skilled Cloud Architect to lead our Data Center Migration Projects. As a key member of our team, you will design and implement scalable, secure, and cost-effective cloud solutions that support the migration of mission-critical applications, workloads, and data from legacy data center environments to cloud platforms.Key...