Senior HPC Infrastructure Engineer

4 weeks ago


Palo Alto, California, United States Guardant Health Full time
Job Description

Guardant Health is a leading precision oncology company focused on helping conquer cancer globally through the use of its proprietary tests, vast data sets, and advanced analytics.

The company's oncology platform leverages capabilities to drive commercial adoption, improve patient clinical outcomes, and lower healthcare costs across all stages of the cancer care continuum.

Guardant Health has commercially launched several tests, including Guardant360, Guardant360 CDx, Guardant360 TissueNextTM, Guardant360 ResponseTM, and GuardantOMNI tests for advanced stage cancer patients, as well as the Guardant RevealTM test for early-stage cancer patients.

The company's screening portfolio, including the ShieldTM test, aims to address the needs of individuals eligible for cancer screening.

Job Responsibilities

  • Assist in managing the HPC interconnect
  • Assist in integrating the HPC systems with the bandwidth on-demand system
  • Work with the networking infrastructure team to manage and optimize the connectivity to and from the HPC systems and locales
  • Help manage multiple HPC clusters and cluster file systems
  • Help research, develop, and implement the next generation HPC solution
  • Troubleshoot the production system stack down to source code level
  • Maintain, monitor, and support the infrastructure environment and/or facilities
  • Use and maintain enhanced production monitoring and additional capability
  • Support improvements for increased system reliability and performance

Requirements

  • 2+ years of Linux/Unix administration experience
  • Knowledge of Unix network protocols, TCP/IP network fundamentals, core infrastructure technologies, and virtualization
  • 2+ years of large-scale data storage and compute clusters (HPC) infrastructure experience
  • 2+ years of working in and with on-premise and cloud-based (AWS, Google, IBM, and Azure) data-centers
  • 2+ years of building software release and ops processes and automation toolset
  • 2+ years of providing documentation of system administration

Preferred Skills

  • Experience administering IBM's General Parallel File System
  • Experience administering Grid Engine scheduler
  • Experience administering SLURM scheduler
  • Experience with using Bright Cluster Manager
  • Experience with cloud bursting technologies
  • Experience with wide area file systems
  • Experience with Docker and container technologies
  • Experience with Kubernetes, preferably with Certified Kubernetes Administrator (CKA)
  • Operating infrastructure compliant with HIPAA and SOX standards

Education

B.S. in Computer Science or related field

Hybrid Work Model

Guardant Health has a hybrid work model that allows employees to work from home and collaborate in-person. All U.S. employees who live within 50 miles of a Guardant facility will be required to be onsite on Mondays, Tuesdays, and Thursdays.

The company has found that aligning scheduled in-office days allows teams to do their best work and creates focused thinking time for innovative work.

Guardant Health is committed to providing reasonable accommodations in our hiring processes for candidates with disabilities, long-term conditions, mental health conditions, or sincerely held religious beliefs.

Guardant Health is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability.



  • Palo Alto, California, United States Guardant Health Full time

    Job DescriptionGuardant Health is a leading precision oncology company focused on helping conquer cancer globally through the use of its proprietary tests, vast data sets, and advanced analytics. The company's HPC team builds and operates the computational technology backbone of the organization, including scalable data storage, high-performance compute...


  • Palo Alto, California, United States Guardant Health Full time

    Job OverviewGuardant Health is a leading precision oncology company seeking a highly skilled HPC Infrastructure Specialist to join its team. The successful candidate will be responsible for designing, implementing, and maintaining the company's high-performance computing infrastructure.The ideal candidate will have a strong background in Linux/Unix...


  • Palo Alto, California, United States Tesla Full time

    Job Title: HPC Engineer, AI InfrastructureTesla's AI Infrastructure team is responsible for designing and maintaining the high-performance computing systems that power our machine learning algorithms. As an HPC Engineer, you will play a critical role in ensuring the smooth operation of our AI infrastructure, including virtual simulations, Autopilot hardware,...


  • Palo Alto, California, United States Tesla Full time

    About the RoleTesla's AI infrastructure team is seeking a highly skilled HPC Engineer to join our team. As a key member of our team, you will be responsible for maintaining and improving our AI infrastructure to support our Full-Self-Driving (FSD), Tesla Bot & Dojo engineering teams.Key ResponsibilitiesManage and operate our AI infrastructure, including...


  • Palo Alto, California, United States Foundry Technologies, Inc. Full time

    About FoundryFoundry Technologies, Inc. is a leading provider of AI infrastructure solutions. We are seeking a highly skilled Senior Infrastructure Reliability Engineer to join our team.Job SummaryWe are looking for a talented engineer to design, deploy, and maintain our AI infrastructure. The ideal candidate will have a strong background in cloud...


  • Palo Alto, California, United States Snarkify Full time

    Job DescriptionSnarkify is seeking a highly skilled Senior Blockchain Infrastructure Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, developing, and maintaining scalable proof systems, libraries, and related tools to support Zero-Knowledge Proofs (ZKP) applications.Key Responsibilities:Design and...


  • Palo Alto, California, United States Foundry Technologies, Inc. Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Infrastructure Engineer to join our team at Foundry Technologies, Inc. As a key member of our infrastructure team, you will be responsible for designing, deploying, and maintaining our cloud infrastructure to support our AI workloads.Your primary focus will be on ensuring the reliability,...


  • Palo Alto, California, United States X (formerly Twitter) Full time

    At X, we're on a mission to become the trusted global digital public square, committed to protecting freedom of speech and building the future of unlimited interactivity.Our goal is to empower every user to freely create and share ideas, fostering open public discourse without barriers.We're seeking a talented Senior Software Engineer to join our Security...


  • Palo Alto, California, United States stakefish Full time

    Job SummaryWe are seeking a highly skilled Blockchain Infrastructure Engineer to join our team at stakefish. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining our blockchain infrastructure.Your primary focus will be on ensuring the security, scalability, and reliability of our infrastructure, as well...


  • Palo Alto, California, United States Rivian Full time

    About RivianRivian is a pioneering electric vehicle manufacturer dedicated to creating a sustainable future. Our mission is to keep the world adventurous forever, and we're seeking a talented Machine Learning Engineer to join our Platform Architecture team.In this role, you'll collaborate with cross-functional teams to develop cutting-edge machine learning...


  • Palo Alto, California, United States stakefish Full time

    Job Title: DevOps EngineerWe are seeking a highly skilled DevOps Engineer to join our team at stakefish. As a DevOps Engineer, you will play a critical role in building and maintaining our blockchain infrastructure, ensuring the security, scalability, and reliability of our systems.Key Responsibilities:Design and implement secure and reliable infrastructure...


  • Palo Alto, California, United States Match Group Full time

    About the RoleWe are seeking a highly skilled Sr. Software Engineer to join our Machine Learning Infrastructure team. As a key member of this team, you will be responsible for designing and developing scalable infrastructure to support the diverse needs of machine learning engineers across all Tinder business units.Key Responsibilities* Build robust and...


  • Palo Alto, California, United States Match Group Full time

    About the RoleWe are seeking a highly skilled Sr. Software Engineer to join our Machine Learning Infrastructure team at Tinder. As a key member of our team, you will be responsible for designing and developing robust and scalable infrastructure to support the diverse needs of machine learning engineers across all Tinder business units.Key...


  • Palo Alto, California, United States Foundry Technologies, Inc. Full time

    About Foundry Technologies, Inc.Foundry Technologies, Inc. is revolutionizing the way AI companies access compute power. Our mission is to orchestrate the world's compute capacity, making it easier to use and optimized for AI workloads. We're building a new type of public cloud, one designed specifically for AI, where accessing high-performance compute is as...

  • Senior Cloud Engineer

    3 weeks ago


    Palo Alto, California, United States Flow MD Full time

    About the CompanyFlow is a technology-driven company that aims to enhance living experiences across communities. We leverage technology to provide superior living conditions and foster vibrant communities.About the RoleWe are seeking a Senior/Staff Platform Engineer to join our engineering team. As an early member, you will significantly influence the...


  • Palo Alto, California, United States Matroid Full time

    About MatroidMatroid is a pioneering company in the field of computer vision, aiming to empower businesses and industries with its cutting-edge solutions. Founded in 2016 by a Stanford professor, the company has raised $33.5 million from prominent investors and boasts a diverse range of customers and partners in manufacturing, industrial IoT, and...

  • Senior DevOps Engineer

    11 hours ago


    Palo Alto, California, United States Lanai Full time

    At Lanai, we're at the forefront of the GenAI revolution, empowering humans to achieve the extraordinary in the age of AI.We're looking for a highly skilled Senior DevOps Engineer to join our team and help shape the future of work in the AI era.The ideal candidate will have a proven track record in designing and managing AWS cloud infrastructure for scalable...


  • Palo Alto, California, United States Palantir Technologies Full time

    Job DescriptionA World-Changing CompanyPalantir builds the world's leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more.The RoleAs a Principal Infrastructure Security...


  • Palo Alto, California, United States X (formerly Twitter) Full time

    At X, we're on a mission to protect freedom of speech and build the future of unlimited interactivity. Our goal is to empower every user to freely create and share ideas, fostering open public discourse without barriers. We're seeking a Senior Security Engineer to help us achieve this vision.Key Responsibilities:Help maintain the security of X's networks,...


  • Palo Alto, California, United States Obsidian Security Full time

    About Us:At Obsidian Security, we're dedicated to solving the unaddressed blindspot of SaaS Security. Our cutting-edge solution provides comprehensive and powerful protection for SaaS applications, safeguarding critical business information.We're a passionate team optimizing for impact by solving some of the biggest challenges in cybersecurity today. We...