SRE and DevOps Engineer
5 days ago
Sustainable Talent is partnering with Nvidia a global leader who's been transforming computer graphics, PC gaming, and accelerated computing for over 25 years. We are looking for a SRE & DevOps Engineer to support our client's Infrastructure, Planning and Processes organization.
This is a W-2 full-time contract based in Santa Clara, CA, Onsite. We offer competitive pay based on factors like experience, education, location, etc. and provide full benefits, PTO, and amazing company culture
NVIDIA is looking for a seasoned SRE & DevOps Engineer to join its multifaceted and fast-paced Infrastructure, Planning and Processes organization. The position will be part of a fast-paced crew that develops and maintains sophisticated NVIDIA's internal infrastructure products. The team works with various other business units within NVIDIA Software such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their infrastructure & systems needs.
As an SRE & DevOps engineer, you'll also be working in conjunction with various teams such as software engineering to deploy these new products and manage our infrastructure, associated processes and systems. Keen attention to detail, problem-solving abilities, and a solid knowledge base are essential.
What you'll be doing:
- Working on systems deployed in NVIDIA's internal infrastructure products and them available and reliable for our end users.
- Monitor system performance and troubleshoot issues related to Nvidia hardware and software stack.
- Providing high quality of user support.
- Monitoring KPIs and making sure that team's SLAs are met.
- Managing and maintaining production Kubernetes clusters and Jenkins pipelines.
- Drive automation of monitoring to gain more insight into applications and system health.
- Experience of maintaining cloud and CI/CD on-prem infrastructure and highly-available production environments.
- Expert level proficiency in CI/CD systems like ArgoCD, Jenkins, Gitlab CI, Github actions etc.
- Background in Databases like SQL (MySQL) and timeseries DBs like Prometheus.
- Experience with data analytics/visualization tools like ELK, Grafana, Splunk etc. and alerting tools like Zabbix, Alertmanager and Pagerduty.
- Proficient with Ansible, Kubernetes, Containers & Virtualization platforms.
- 5+ years of proven experience along with Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent experience.
- Previous experience with SRE teams managing on-prem infrastructure.
- Experience managing NVIDIA hardware like GPUs and Tegras.
- Thrives in a multi-tasking environment with constantly evolving priorities.
- Prior experience with large scale operations team.
Sustainable Talent is a M/F+, disabled, and veteran equal employment opportunity and affirmative action employer.
-
SRE DevOps Consultant
3 days ago
Santa Clarita, CA, United States Capgemini Full timeWe are seeking a highly skilled and motivated Senior SRE DevOps Consultant to join our dynamic team. Job Description: As Senior SRE DevOps Consultant, you will be responsible for overseeing the reliability, scalability, and performance of our infrastructure and applications. You will lead a team of talented engineers, ensuring the seamless integration and...
-
SRE DevOps Consultant
1 week ago
Santa Clarita, CA, United States Capgemini Full timeWe are seeking a highly skilled and motivated Senior SRE DevOps Consultant to join our dynamic team. Job Description: As Senior SRE DevOps Consultant, you will be responsible for overseeing the reliability, scalability, and performance of our infrastructure and applications. You will lead a team of talented engineers, ensuring the seamless integration and...
-
SRE DevOps Consultant
5 days ago
Santa Clarita, CA, United States Capgemini Full timeWe are seeking a highly skilled and motivated Senior SRE DevOps Consultant to join our dynamic team. Job Description: As Senior SRE DevOps Consultant, you will be responsible for overseeing the reliability, scalability, and performance of our infrastructure and applications. You will lead a team of talented engineers, ensuring the seamless integration and...
-
Senior Production Engineer
3 days ago
Santa Clara, CA, United States Cynet Systems Full timeJob Description: Pay Range: $84.97hr - $87.97hr Responsibilities: Support the design, implementation, and sustainment of CI/CD pipelines with embedded with auditable deployment processes. Promote infrastructure-as-code using Terraform, Helm, and Ansible, incorporating HITRUST and GxP controls into reusable modules. rchitect and maintain highly...
-
Senior Production Engineer
7 days ago
Santa Clara, CA, United States Cynet Systems Full timeJob Description: Pay Range: $84.97hr - $87.97hr Responsibilities: Support the design, implementation, and sustainment of CI/CD pipelines with embedded with auditable deployment processes. Promote infrastructure-as-code using Terraform, Helm, and Ansible, incorporating HITRUST and GxP controls into reusable modules. rchitect and maintain highly...
-
Senior Production Engineer
3 days ago
Santa Clara, CA, United States Cynet Systems Full timeJob Description: Pay Range: $84.97hr - $87.97hr Responsibilities: Support the design, implementation, and sustainment of CI/CD pipelines with embedded with auditable deployment processes. Promote infrastructure-as-code using Terraform, Helm, and Ansible, incorporating HITRUST and GxP controls into reusable modules. rchitect and maintain highly...
-
Principal DevOps Engineer Cortex Observability
2 weeks ago
Santa Clara, CA, United States Palo Alto Networks Full timeJob Description NOTE: Due to government environments this team supports, the role requires a US Citizen or Permanent Resident. Your Career The Cortex team builds and delivers the industry’s most advanced SecOps platform, consisting of XSIAM, XSOAR, and XPANSE. As a Senior DevOps Engineer, you will be responsible for designing, building, and maintaining the...
-
Principal DevOps Engineer
3 days ago
Santa Clara, CA, United States Roche Full timeAt Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure...
-
Principal DevOps Engineer
5 days ago
Santa Clara, CA, United States Roche Full timeAt Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure...
-
Principal DevOps Engineer
2 weeks ago
Santa Clara, CA, United States Roche Full timeAt Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure...