Senior Infrastructure Engineer
1 week ago
Join the Sustainable Talent team, supporting NVIDIA as a Senior Infrastructure Engineer supporting the IPP (Infrastructure, Planning and Process) Cloud Infrastructure Team. This is a W-2 full-time 1 year contract based in Santa Clara, CA. We offer competitive pay $80- $100/hr based on factors like experience, education, location, etc. and provide full benefits, PTO, and amazing company culture
What You'll Be Doing:
- Collaborate with the Infrastructure Team to manage and optimize operations within our Infrastructure and Cloud environments, with a strong focus on large-scale system configurations and automation.
- Lead the deployment, configuration, and troubleshooting of data center and cloud-based infrastructures, ensuring efficient operations for NVIDIA's latest hardware and technologies.
- Design and implement automated solutions for product onboarding into our hosted and private cloud environments, utilizing robust scripting techniques.
- Work closely with engineers, architects, and product managers to strategize and execute product launches, enhancing deployment processes.
- Tackle complex challenges related to multi-site deployments of NVIDIA products, applying innovative problem-solving skills.
- Partner with multi-functional teams, including system engineering, software engineering, and operations, to deliver reliable and scalable platforms from concept to production.
- Focus on managing systems at scale, writing code for simultaneous configuration of multiple servers, and improving deployment efficiency, including API integrations for automation.
- Bachelor's or Master's Degree in Computer Science, Software Engineering, or a related field, or equivalent practical experience.
- 5+ years of relevant experience, with a strong emphasis on DevOps practices.
- 3+ years of experience with Linux systems and scripting (Bash, Python).
- Solid background in managing large-scale infrastructure operations with an emphasis on automation and configuration management.
- Proven ability to quickly adapt to and implement new technologies, including system-level operations and tools.
- Strong understanding of embedded systems, orchestration, data centers, and cloud architecture, along with excellent communication and planning skills.
- Experience in product engineering, debugging, and hardware configuration, with a focus on system-level operations.
- Experience in large-scale QA environments and product bring-ups.
- Familiarity with operations support, bug tracking, and ticket management.
- Background in supporting GPUs, embedded device development, and CUDA applications.
- Knowledge of converged and hyper-converged infrastructure.
- Experience with configuration management tools (e.g., Puppet, Chef) for hardware setups.
- Strong expertise in system configuration protocols (e.g., IPMI/BMC, Redfish).
- Knowledge of CI/CD tools like Jenkins for automating deployment pipelines.
- Experience working with APIs for system communication and automation.
- Strong hardware knowledge, particularly in configuring hardware components (e.g., BIOS, CPU) in large-scale environments.
- Experience configuring BIOS settings remotely in large hardware deployments.
- Proven experience in configuring systems at scale, focusing on automation and efficiency.
- Familiarity with tools for managing remote server configurations, including BMC/IPMI systems.
- Ideal candidates may have experience from companies like Dell, IBM, or HP, or in organizations that produce servers or operate on-premise cloud solutions.
Sustainable Talent is a M/F+, disabled, and veteran equal employment opportunity and affirmative action employer.
-
Santa Clara, CA, United States NVIDIA Full timeWe are seeking a Senior Software Engineer to join a new team building the foundational infrastructure for Robotics Research. This new team will work very closely with NVIDIA’s Generalist Embodied Agent Research (GEAR) group. The near term focus is Project GR00T, NVIDIA’s moonshot initiative at building foundation models and full-stack technology for...
-
Senior Lab Engineer
3 days ago
Santa Clara, CA, United States Apple Full timeRole Number: 200633459-3760 Summary The Device Services (DS) Infrastructure team is seeking a senior operations engineer who thrives in complex operational environments and can drive strategic change across our lab ecosystem. We manage a fleet of thousands of Apple devices used in a centralized, automated test system which is a critical tool used by...
-
Senior Principal Engineer
21 hours ago
Santa Clara, CA, United States Oracle Full timeJob Description Interested in leading cutting edge Infrastructure Research and Development initiatives? Look no further! Oracle Cloud Infrastructure (OCI) is pioneering new initiatives to build a new class of infrastructure and platform services from the ground up. You will lead one of these initiatives, and play a critical technical leadership role in...
-
Senior Principal Engineer
4 days ago
Santa Clara, CA, United States Oracle Full timeJob Description Interested in leading cutting edge Infrastructure Research and Development initiatives? Look no further! Oracle Cloud Infrastructure (OCI) is pioneering new initiatives to build a new class of infrastructure and platform services from the ground up. You will lead one of these initiatives, and play a critical technical leadership role in...
-
Senior Principal Engineer
1 week ago
Santa Clara, CA, United States Oracle Full timeJob Description Interested in leading cutting edge Infrastructure Research and Development initiatives? Look no further! Oracle Cloud Infrastructure (OCI) is pioneering new initiatives to build a new class of infrastructure and platform services from the ground up. You will lead one of these initiatives, and play a critical technical leadership role in...
-
Sr Software Engineer
4 days ago
Santa Clara, CA, United States Oracle Full timeJob DescriptionJob Description Oracle Cloud Infrastructure (OCI) is looking for a Senior Software Engineer - AI Infrastructure to lead the development of scalable, resilient, and secure infrastructure systems that underpin the core of OCI's compute platform. This role sits within the Host Provisioning Services (HoPS) team, which owns the critical...
-
Sr Software Engineer
1 week ago
Santa Clara, CA, United States Oracle Full timeJob DescriptionJob Description Oracle Cloud Infrastructure (OCI) is looking for a Senior Software Engineer - AI Infrastructure to lead the development of scalable, resilient, and secure infrastructure systems that underpin the core of OCI's compute platform. This role sits within the Host Provisioning Services (HoPS) team, which owns the critical...
-
Senior Product Engineer
4 days ago
Santa Clara, CA, United States Tata Consultancy Services Full timeSenior Software Engineer - FullstackLocation: Santa Clara, CA Experience Level: Senior (5-10 years) Team: Product Engineering At Digitate, we are at the forefront of innovation, building next-generation solutions that redefine our industry. We believe in the transformative power of artificial intelligence and are looking for a passionate and experienced...
-
Senior Product Engineer
7 days ago
Santa Clara, CA, United States Tata Consultancy Services Full timeSenior Software Engineer - FullstackLocation: Santa Clara, CA Experience Level: Senior (5-10 years) Team: Product Engineering At Digitate, we are at the forefront of innovation, building next-generation solutions that redefine our industry. We believe in the transformative power of artificial intelligence and are looking for a passionate and experienced...
-
Senior Product Engineer
11 hours ago
Santa Clara, CA, United States Tata Consultancy Services Full timeSenior Software Engineer - FullstackLocation: Santa Clara, CA Experience Level: Senior (5-10 years) Team: Product Engineering At Digitate, we are at the forefront of innovation, building next-generation solutions that redefine our industry. We believe in the transformative power of artificial intelligence and are looking for a passionate and experienced...