Senior HPC Infrastructure Engineer
4 weeks ago
Guardant Health is a leading precision oncology company focused on helping conquer cancer globally through the use of its proprietary tests, vast data sets, and advanced analytics.
The company's oncology platform leverages capabilities to drive commercial adoption, improve patient clinical outcomes, and lower healthcare costs across all stages of the cancer care continuum.
Guardant Health has commercially launched several tests, including Guardant360, Guardant360 CDx, Guardant360 TissueNextTM, Guardant360 ResponseTM, and GuardantOMNI tests for advanced stage cancer patients, as well as the Guardant RevealTM test for early-stage cancer patients.
The company's screening portfolio, including the ShieldTM test, aims to address the needs of individuals eligible for cancer screening.
Job Responsibilities
- Assist in managing the HPC interconnect
- Assist in integrating the HPC systems with the bandwidth on-demand system
- Work with the networking infrastructure team to manage and optimize the connectivity to and from the HPC systems and locales
- Help manage multiple HPC clusters and cluster file systems
- Help research, develop, and implement the next generation HPC solution
- Troubleshoot the production system stack down to source code level
- Maintain, monitor, and support the infrastructure environment and/or facilities
- Use and maintain enhanced production monitoring and additional capability
- Support improvements for increased system reliability and performance
Requirements
- 2+ years of Linux/Unix administration experience
- Knowledge of Unix network protocols, TCP/IP network fundamentals, core infrastructure technologies, and virtualization
- 2+ years of large-scale data storage and compute clusters (HPC) infrastructure experience
- 2+ years of working in and with on-premise and cloud-based (AWS, Google, IBM, and Azure) data-centers
- 2+ years of building software release and ops processes and automation toolset
- 2+ years of providing documentation of system administration
Preferred Skills
- Experience administering IBM's General Parallel File System
- Experience administering Grid Engine scheduler
- Experience administering SLURM scheduler
- Experience with using Bright Cluster Manager
- Experience with cloud bursting technologies
- Experience with wide area file systems
- Experience with Docker and container technologies
- Experience with Kubernetes, preferably with Certified Kubernetes Administrator (CKA)
- Operating infrastructure compliant with HIPAA and SOX standards
Education
B.S. in Computer Science or related field
Hybrid Work Model
Guardant Health has a hybrid work model that allows employees to work from home and collaborate in-person. All U.S. employees who live within 50 miles of a Guardant facility will be required to be onsite on Mondays, Tuesdays, and Thursdays.
The company has found that aligning scheduled in-office days allows teams to do their best work and creates focused thinking time for innovative work.
Guardant Health is committed to providing reasonable accommodations in our hiring processes for candidates with disabilities, long-term conditions, mental health conditions, or sincerely held religious beliefs.
Guardant Health is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability.
-
Staff HPC Infrastructure Engineer
3 weeks ago
Palo Alto, California, United States Guardant Health Full timeJob DescriptionGuardant Health is a leading precision oncology company focused on helping conquer cancer globally through the use of its proprietary tests, vast data sets, and advanced analytics. The company's HPC team builds and operates the computational technology backbone of the organization, including scalable data storage, high-performance compute...
-
Staff HPC Infrastructure Engineer
4 weeks ago
Palo Alto, California, United States Guardant Health Full timeJob OverviewGuardant Health is a leading precision oncology company seeking a highly skilled HPC Infrastructure Specialist to join its team. The successful candidate will be responsible for designing, implementing, and maintaining the company's high-performance computing infrastructure.The ideal candidate will have a strong background in Linux/Unix...
-
HPC Infrastructure Specialist
4 weeks ago
Palo Alto, California, United States Tesla Full timeJob Title: HPC Engineer, AI InfrastructureTesla's AI Infrastructure team is responsible for designing and maintaining the high-performance computing systems that power our machine learning algorithms. As an HPC Engineer, you will play a critical role in ensuring the smooth operation of our AI infrastructure, including virtual simulations, Autopilot hardware,...
-
HPC Infrastructure Specialist
4 weeks ago
Palo Alto, California, United States Tesla Full timeAbout the RoleTesla's AI infrastructure team is seeking a highly skilled HPC Engineer to join our team. As a key member of our team, you will be responsible for maintaining and improving our AI infrastructure to support our Full-Self-Driving (FSD), Tesla Bot & Dojo engineering teams.Key ResponsibilitiesManage and operate our AI infrastructure, including...
-
Senior Infrastructure Reliability Engineer
4 weeks ago
Palo Alto, California, United States Foundry Technologies, Inc. Full timeAbout FoundryFoundry Technologies, Inc. is a leading provider of AI infrastructure solutions. We are seeking a highly skilled Senior Infrastructure Reliability Engineer to join our team.Job SummaryWe are looking for a talented engineer to design, deploy, and maintain our AI infrastructure. The ideal candidate will have a strong background in cloud...
-
Senior Blockchain Infrastructure Engineer
4 weeks ago
Palo Alto, California, United States Snarkify Full timeJob DescriptionSnarkify is seeking a highly skilled Senior Blockchain Infrastructure Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, developing, and maintaining scalable proof systems, libraries, and related tools to support Zero-Knowledge Proofs (ZKP) applications.Key Responsibilities:Design and...
-
Senior Cloud Infrastructure Engineer
3 weeks ago
Palo Alto, California, United States Foundry Technologies, Inc. Full timeAbout the RoleWe are seeking a highly skilled Senior Cloud Infrastructure Engineer to join our team at Foundry Technologies, Inc. As a key member of our infrastructure team, you will be responsible for designing, deploying, and maintaining our cloud infrastructure to support our AI workloads.Your primary focus will be on ensuring the reliability,...
-
Palo Alto, California, United States X (formerly Twitter) Full timeAt X, we're on a mission to become the trusted global digital public square, committed to protecting freedom of speech and building the future of unlimited interactivity.Our goal is to empower every user to freely create and share ideas, fostering open public discourse without barriers.We're seeking a talented Senior Software Engineer to join our Security...
-
Blockchain Infrastructure Engineer
4 weeks ago
Palo Alto, California, United States stakefish Full timeJob SummaryWe are seeking a highly skilled Blockchain Infrastructure Engineer to join our team at stakefish. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining our blockchain infrastructure.Your primary focus will be on ensuring the security, scalability, and reliability of our infrastructure, as well...
-
Senior Machine Learning Infrastructure Engineer
3 weeks ago
Palo Alto, California, United States Rivian Full timeAbout RivianRivian is a pioneering electric vehicle manufacturer dedicated to creating a sustainable future. Our mission is to keep the world adventurous forever, and we're seeking a talented Machine Learning Engineer to join our Platform Architecture team.In this role, you'll collaborate with cross-functional teams to develop cutting-edge machine learning...
-
Blockchain Infrastructure Engineer
4 weeks ago
Palo Alto, California, United States stakefish Full timeJob Title: DevOps EngineerWe are seeking a highly skilled DevOps Engineer to join our team at stakefish. As a DevOps Engineer, you will play a critical role in building and maintaining our blockchain infrastructure, ensuring the security, scalability, and reliability of our systems.Key Responsibilities:Design and implement secure and reliable infrastructure...
-
Palo Alto, California, United States Match Group Full timeAbout the RoleWe are seeking a highly skilled Sr. Software Engineer to join our Machine Learning Infrastructure team. As a key member of this team, you will be responsible for designing and developing scalable infrastructure to support the diverse needs of machine learning engineers across all Tinder business units.Key Responsibilities* Build robust and...
-
Palo Alto, California, United States Match Group Full timeAbout the RoleWe are seeking a highly skilled Sr. Software Engineer to join our Machine Learning Infrastructure team at Tinder. As a key member of our team, you will be responsible for designing and developing robust and scalable infrastructure to support the diverse needs of machine learning engineers across all Tinder business units.Key...
-
Senior Infrastructure Software Architect
3 weeks ago
Palo Alto, California, United States Foundry Technologies, Inc. Full timeAbout Foundry Technologies, Inc.Foundry Technologies, Inc. is revolutionizing the way AI companies access compute power. Our mission is to orchestrate the world's compute capacity, making it easier to use and optimized for AI workloads. We're building a new type of public cloud, one designed specifically for AI, where accessing high-performance compute is as...
-
Senior Cloud Engineer
3 weeks ago
Palo Alto, California, United States Flow MD Full timeAbout the CompanyFlow is a technology-driven company that aims to enhance living experiences across communities. We leverage technology to provide superior living conditions and foster vibrant communities.About the RoleWe are seeking a Senior/Staff Platform Engineer to join our engineering team. As an early member, you will significantly influence the...
-
Software Engineer, Infrastructure Specialist
4 weeks ago
Palo Alto, California, United States Matroid Full timeAbout MatroidMatroid is a pioneering company in the field of computer vision, aiming to empower businesses and industries with its cutting-edge solutions. Founded in 2016 by a Stanford professor, the company has raised $33.5 million from prominent investors and boasts a diverse range of customers and partners in manufacturing, industrial IoT, and...
-
Senior DevOps Engineer
11 hours ago
Palo Alto, California, United States Lanai Full timeAt Lanai, we're at the forefront of the GenAI revolution, empowering humans to achieve the extraordinary in the age of AI.We're looking for a highly skilled Senior DevOps Engineer to join our team and help shape the future of work in the AI era.The ideal candidate will have a proven track record in designing and managing AWS cloud infrastructure for scalable...
-
Principal Infrastructure Security Engineer
4 weeks ago
Palo Alto, California, United States Palantir Technologies Full timeJob DescriptionA World-Changing CompanyPalantir builds the world's leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more.The RoleAs a Principal Infrastructure Security...
-
Senior Security Engineer
3 weeks ago
Palo Alto, California, United States X (formerly Twitter) Full timeAt X, we're on a mission to protect freedom of speech and build the future of unlimited interactivity. Our goal is to empower every user to freely create and share ideas, fostering open public discourse without barriers. We're seeking a Senior Security Engineer to help us achieve this vision.Key Responsibilities:Help maintain the security of X's networks,...
-
Data Infrastructure Engineer
3 weeks ago
Palo Alto, California, United States Obsidian Security Full timeAbout Us:At Obsidian Security, we're dedicated to solving the unaddressed blindspot of SaaS Security. Our cutting-edge solution provides comprehensive and powerful protection for SaaS applications, safeguarding critical business information.We're a passionate team optimizing for impact by solving some of the biggest challenges in cybersecurity today. We...