Senior Manager

3 weeks ago


Santa Clara, United States NVIDIA Full time

As a Sr Manager in Site Reliability Engineering (SRE), you will lead a team dedicated to the design, construction, and maintenance of expansive production systems, emphasizing high efficiency and availability. This role spans various domains, including software and systems engineering, cloud-scale storage, data management, and services. SRE Senior Managers bring specialized expertise in areas such as systems, networking, storage, coding, database management, capacity planning, continuous delivery and deployment, and proficiency in open-source cloud-enabling technologies like Kubernetes, containers, and virtualization. Your role involves overseeing the implementation of reliable storage solutions, efficient data management, and delivering associated services to uphold the overall stability and performance of production systems.

SRE Manager at NVIDIA ensures the reliability and uptime of both our internal and external GPU cloud services, you align closely with our commitments to users. Simultaneously, you empower developers to enact system changes with meticulous preparation and planning, placing a sharp focus on critical elements such as capacity, latency, and performance. This position embodies a unique attitude and a suite of engineering strategies geared toward amplifying the efficiency of production systems and implementing innovative optimizations. A substantial part of our software development endeavors is dedicated to automating tasks, fine-tuning performance, and elevating the overall efficiency of production systems. With a comprehensive responsibility for understanding the intricate interconnectedness of our systems, you'll us a diverse range of tools and approaches to tackle a wide array of challenges. This role promises a daily dose of engaging and dynamic work, underscored by a commitment to continuous improvement, ensuring the triumphant success of our groundbreaking AI/ML solutions.

What You Will Be Doing:

  • Leadership: Formulating and executing strategic initiatives to enhance the reliability and performance of storage systems, aligning with organizational goals.

  • Team Management: Leading and mentoring a team of Storage SRE professionals, fostering a collaborative and innovative work environment.

  • Cloud Storage Expertise: Supervise the planning, execution, and enhancement of storage solutions, encompassing file, block, and object storage, to cater to the requirements of an expanding cloud infrastructure. Guarantee the efficient utilization of cloud-native storage services offered by platforms like AWS S3 and Azure Blob Storage.

  • System Optimization: Collaborating with multi-functional teams to optimize storage systems, implement best practices, and ensure seamless integration with other technology stacks.

  • Incident Response: Overseeing incident response and resolution for storage-related issues, minimizing downtime, and ensuring a resilient storage environment.

  • Conducting capacity planning exercises and collaborating with team members to forecast and meet storage demands efficiently.

  • Automation and Tooling: Driving automation initiatives to streamline storage operations and developing tools for monitoring, alerting, and performance analysis.

  • Continuous Improvement: Implementing continuous improvement processes to enhance storage systems' overall reliability and efficiency.

What We Need To See:

  • Extensive experience in a senior-level role within Site Reliability Engineering, particularly in managing storage infrastructure.

  • Technical Expertise: In-depth knowledge of storage technologies, file systems, and experience with cloud-based storage solutions. Proficiency in scripting and automation tools is essential.

  • Leadership Skills: Strong leadership and people management skills, with the ability to inspire and guide a team towards achieving common objectives.

  • Problem-Solving Skills: Exceptional analytical and problem-solving skills, with the ability to address complex storage-related issues effectively.

  • Collaboration: Demonstrated ability to collaborate with multi-functional teams and communicate effectively with technical and non-technical collaborators.

  • Prior engineering experience with hands-on coding background in storage systems

  • Master's degree in Computer Science, Information Technology, or a related field or equivalent experience

  • 10+ overall years of relevant experience and 5+ yrs of management experience

Ways to stand out from the crowd:

  • Demonstrated experience in having an SRE mindset, customer-first approach, and focus on customer satisfaction and passion for ensuring customer success.

  • Professional certifications in relevant technologies (e.g., AWS Certified Solutions Architect, Certified Kubernetes Administrator). Experience with container orchestration platforms and software-defined storage solutions.

  • Proven track record of implementing and managing storage solutions in a large-scale, enterprise environment. Thrive in collaborative environments and enjoy working with various teams. Flexible in adapting to different working styles.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and dedicated people on the planet working for us. If you're creative and autonomous, we want to hear from you

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company and establish teams with the most thoughtful people in the world.

The base salary range is 272,000 USD - 419,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.



  • Santa Clara, United States Rootshell Enterprise Technologies Inc. Full time

    Job DescriptionJob DescriptionHello All,Greetings from Rootshell Inc.Rootshell Enterprise Technologies Inc. is a recognized provider of professional IT Consulting services in the US. We are actively seeking Senior Program Manager/Product Owner for one of our client, Please share your resume with current location & full contact infoRole:Senior Program...


  • Santa Clara, United States Rootshell Inc Full time

    Hello All,Greetings from Rootshell Inc.Rootshell Enterprise Technologies Inc. is a recognized provider of professional IT Consulting services in the US. We are actively seeking Senior Program Manager/Product Owner for one of our client, Please share your resume with current location & full contact infoRole:Senior Program Manager/Product OwnerLocation:Santa...


  • Santa Clara, United States Rootshell Inc Full time

    Hello All,Greetings from Rootshell Inc.Rootshell Enterprise Technologies Inc. is a recognized provider of professional IT Consulting services in the US. We are actively seeking Senior Program Manager/Product Owner for one of our client, Please share your resume with current location & full contact infoRole:Senior Program Manager/Product OwnerLocation:Santa...


  • Santa Clara, United States Chegg US Full time

    Description Senior Product Manager, Search ML/AI enablement Location: Santa Clara, CA Department Summary: At Chegg, the Core Experience Product Team works on the experience across a student’s entire lifecycle, from acquisition to activation, engagement and retention. Job Summary: Chegg is looking for an experienced Senior Product...


  • Santa Clara, United States Palo Alto Networks Full time

    Palo Alto Networks is looking for a talented Senior Systems Engineer, Identity & Access Management who will be responsible for maintainability, build and configuration of user identity & authentication services, single sign on (SSO) and access automa Systems Engineer, Management, Platform Engineer, Systems, Senior, Engineer, Technology


  • Santa Clara, California, United States Geli Full time

    Hanwha Q CELLS Co., Ltd., is one of the world ́s largest and most recognized photovoltaic manufacturers for its high-performance, high-quality solar cells and modules.Hanwha Q CELLS is a flagship company of Hanwha Group, a FORTUNE Global 500 firm and a Top 7 business enterprise in South Korea.Our mission is to provide affordable and smart energy solutions...


  • Santa Clara, United States Ventrum Full time

    Looking for: Senior Manager, Data PlatformJob Type: Full timeLocation: Santa Clara, CA (Hybrid)Position Overview: The Senior Manager, Data Platform will play a pivotal role in driving data-based decision-making culture across the company. You will be responsible for overseeing the development and management of our data platform strategy and implementation,...


  • Santa Clara, United States Ventrum Full time

    Looking for: Senior Manager, Data PlatformJob Type: Full timeLocation: Santa Clara, CA (Hybrid)Position Overview: The Senior Manager, Data Platform will play a pivotal role in driving data-based decision-making culture across the company. You will be responsible for overseeing the development and management of our data platform strategy and implementation,...


  • Santa Clara, United States NVIDIA Full time

    We are looking for a Senior Firmware Architect - Server Manageability!NVIDIA’s invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next e

  • Senior PR Manager

    1 week ago


    Santa Clara, United States Sustainable Talent Full time

    Job DescriptionJob DescriptionAre you ready to accelerate your career with us at the forefront of the AI revolution? Sustainable Talent is seeking a Senior PR Manager with outstanding news writing, storytelling, and PR pitching skills to support the rapid growth of our client's enterprise business.Sustainable Talent is thrilled to partner with Nvidia, a...

  • Senior Manager

    3 weeks ago


    Santa Clara, California, United States Connor Group Full time

    Connor Group has several offices with hybrid work arrangements. We are also hiring remote professionals for this position.Are you an intellectually curious, deal oriented professional who enjoys coming up with innovative solutions to complex business issues? Connor Group is seeking professionals who want to build off their existing accounting, operational,...

  • Senior Manager

    2 weeks ago


    Santa Clara, United States CoreSite Realty Corp. Full time

    As a leader in CoreSite's Operations team, the Senior Manager - Data Center Operations is responsible for all operational aspects and uptime of the data center. The Senior Manager - Data Center Operations is responsible for providing Field level technical expertise and program management for the maintenance and operation of the electrical, mechanical, fire...


  • Santa Clara, United States Pure Storage Full time

    Company OverviewBE PART OF BUILDING THE FUTURE. What do NASA and emerging space companies have in common with COVID vaccine R&D teams or with Roblox and the Metaverse? The answer is data, - all fast moving, fast growing industries rely on data for a competitive edge in their industries. And the most advanced companies are realizing the full data advantage...


  • Santa Clara, United States Pure Storage Full time

    Company Overview: BE PART OF BUILDING THE FUTURE. What do NASA and emerging space companies have in common with COVID vaccine R&D teams or with Roblox and the Metaverse? The answer is data, -- all fast moving, fast growing industries rely on data for a competitive edge in their industries. And the most advanced companies are realizing the full data advantage...

  • Senior Paralegal

    4 weeks ago


    Santa Clara, United States SMS Staffing Inc. Full time

    Job DescriptionJob DescriptionSMS Staffing Inc is Hiring Immediately for a skilled Senior Paralegal! Job Title: Senior ParalegalJob Location: Sant Clara, CA, U.S.A. 95054Job Type: Contract (possible extension or convert to hire)Pay: Starts at $32.50 an hourShift Structure: ONSITE, 8:30 AM - 5:00 PM, Monday to Friday, 40 hours per weekThe role of the Senior...


  • Santa Clara, United States Telenav Full time

    Do you dream of what cars of the future will look like when you combine them with connectivity, a smartphone, and cloud services? Can you imagine uniting those dreams with a company that has the skills and relationships to make that a reality? If so, Telenav wants you! At Telenav, we believe the car is at the beginning of a massive innovation wave that...


  • Santa Clara, United States Protingent Full time

    Position Title: Senior Technical WriterPosition Description: Protingent Staffing has an exciting contract opportunity for Senior Technical Writer with our client located in Santa Clara, CA.Project Description: As a member of the Services organization, the Senior Technical Writer ensures that development, delivery, and maintenance of technical Field Service...


  • Santa Clara, United States Infoblox Full time

    Description It's an exciting time to be at Infoblox. Named a Top 25 Cyber Security Company by The Software Report and one of Inc. magazine's Best Workplaces for 2020, Infoblox is the leader in cloud-first networking and security services. Our solutions empower organizations to take full advantage of the cloud to deliver network...


  • Santa Clara, California, United States Infoblox Full time

    Description It's an exciting time to be at Infoblox. Named a Top 25 Cyber Security Company by The Software Report and one of Inc. magazine's Best Workplaces for 2020, Infoblox is the leader in cloud-first networking and security services. Our solutions empower organizations to take full advantage of the cloud to deliver network experiences that are...


  • Santa Clara, United States Pure Storage Full time

    Company Overview: BE PART OF BUILDING THE FUTURE. What do NASA and emerging space companies have in common with COVID vaccine R&D teams or with Roblox and the Metaverse? The answer is data, -- all fast moving, fast growing industries rely on data for a competitive edge in their industries. And the most advanced companies are realizing the full data advantage...