High-Performance Computing Expert
6 days ago
At the San Francisco Compute Company, we're revolutionizing the field of real-time compute trading platforms. Our vision is to empower thousands of startups and labs to train and serve large models without the need for extensive infrastructure. This innovative platform will enable organizations to scale their operations to tens of thousands of accelerators, making cutting-edge technology more accessible to a broader audience.
The RoleWe're seeking a skilled High-Performance Computing Engineer to join our supercomputing team. As a key member of this team, you'll be responsible for ensuring the smooth operation of our ML training clusters, which are among the most powerful computers in the world. Your expertise will be crucial in monitoring hardware health, fixing issues promptly, and implementing automation solutions to manage hardware at scale. As we continue to grow, this role will evolve into a data-driven position, predicting failures before they occur.
About You- You have hands-on experience managing GPU training clusters, preferably with over 1,000 GPUs.
- You value clear documentation and understand its importance in maintaining efficient systems.
- Your proficiency in Linux, CUDA, NCCL, and Infiniband is essential for this role.
- You enjoy designing self-correcting systems that optimize hardware performance.
- Familiarity with Rust programming language, as our VM orchestrator is built using Rust.
- Experience with distributed storage systems such as Weka, VAST, Ceph, etc.
- Knowledge of HPC network architectures including eBGP, fat-tree, VXLAN, MCLAG, etc.
- Understanding of Linux virtualization technologies like KVM, QEMU, libvirt, etc.
- Expertise in performance optimization of machine learning kernels.
Our competitive salary for this role is $$150,000 - $200,000 per year, commensurate with experience. We also offer a generous equity grant, retirement matching, medical, dental & vision insurance, unlimited paid time off, parental leave, daily lunch, and visa sponsorships.
The San Francisco Compute Company is committed to fostering an inclusive workplace culture that values diversity, equity, and inclusion. We strive to create a workplace free from discrimination and harassment, where everyone feels valued, respected, and empowered to contribute their best work.
-
High-Performance Computing Expert
6 days ago
San Jose, California, United States Syntricate Technologies Full timeCompany OverviewSyntricate Technologies is a cutting-edge technology firm that specializes in delivering innovative solutions to complex problems. Our team of experts is passionate about leveraging the latest technologies to drive business growth and success.SalaryWe are offering a competitive salary ranging from $120,000 to $180,000 per annum, depending on...
-
High-Performance Cryptography Expert
5 days ago
San Francisco, California, United States Nexus Full timeNexus, a pioneering scientific project in verifiable computing, seeks an accomplished Senior Cryptography Engineer to spearhead the development of cutting-edge zero-knowledge proof systems and cryptographic protocols. Location: San FranciscoJob OverviewThis role requires a highly skilled individual with a strong mathematics background and expertise in...
-
High Performance Ads Engineering Expert
7 days ago
San Francisco, California, United States Activision Full timeJob Title:High Performance Ads Engineering ExpertAbout Us:Activision Blizzard Media is a cutting-edge organization that empowers innovators to craft exceptional high-scale backend systems for Advertising using the latest technologies.Salary Range:$143,060 - $264,846 per annum in the U.S., and may vary based on experience and location.Key...
-
High Performance Computing Architect
3 days ago
San Francisco, California, United States Amazon Full timeAbout the RoleWe are seeking a highly skilled Senior Solutions Architect to join our team. This role will be responsible for designing and implementing scalable and secure cloud-based solutions for Financial Services customers in North America.Key ResponsibilitiesRepresent the voice of the customer, collaborating with field and central teams to bring...
-
High Performance Computing Expert
5 days ago
San Jose, California, United States Cadence Design Systems Full time**Company Overview**Cadence Design Systems is a global leader in the electronic design automation industry, providing software, hardware, and intellectual property to design advanced semiconductor chips. Our team is passionate about solving complex technical challenges and pushing the boundaries of innovation.**Salary**The annual salary range for this...
-
San Francisco, California, United States Chan Zuckerberg Biohub Network Full timeAbout the Chan Zuckerberg Biohub NetworkThe Chan Zuckerberg Biohub Network is a pioneering research organization dedicated to advancing our understanding of biology and disease. As a collaborative environment, we bring together scientists, engineers, and physicians to tackle complex scientific challenges on a grand scale.We are committed to cultivating an...
-
High Performance Computing Architect
7 days ago
San Francisco, California, United States Nexus Full timeAbout NexusNexus is a pioneering scientific and engineering effort that aims to revolutionize the field of computation. Our mission is to bring truth to verifiable computation by harnessing open science and open-source software.As a leader in HPC, we're leveraging decades of advancements in zero-knowledge cryptography to create a single software system that...
-
San Francisco, California, United States Greptile Full timeEstablished in San Francisco, Greptile is revolutionizing the way software teams navigate codebases. As a senior software engineer at Greptile, you will be part of a dynamic team building an AI expert that empowers developers to query their codebases using a natural language API.About Greptile:Greptile's mission is to provide software teams with a...
-
High-Performance Software Engineer
5 days ago
San Francisco, California, United States Fastly Full timeAbout FastlyFastly is a leading edge cloud platform that enables customers to create great digital experiences quickly, securely, and reliably. Our platform processes, serves, and secures applications at the edge of the internet, allowing customers to take advantage of modern internet capabilities.We're Building a More Trustworthy InternetAt Fastly, we...
-
Cloud Data Solutions Expert
5 days ago
San Francisco, California, United States Snowflake Computing Full timeBuild the future of data management at Snowflake Computing, a leading cloud-based data platform provider. We are seeking an experienced Cloud Data Solutions Expert to join our Professional Services team.About the RolePresents Snowflake technology and vision to executives and technical contributors to customers, positioning it as a trusted advisor for...
-
High-Performance Networking Architect
3 days ago
San Francisco, California, United States Magic AI Full timeAt Magic AI, we are on a mission to build safe artificial intelligence that accelerates humanity's progress on the world's most important problems. Our approach combines frontier-scale pre-training, domain-specific reinforcement learning, ultra-long context, and inference-time compute to achieve this goal.About the RoleWe are seeking a highly skilled HPC...
-
San Francisco, California, United States Chan Zuckerberg Biohub Network Full timeThe Chan Zuckerberg Biohub Network is a pioneering organization that brings together scientists, engineers, and physicians to tackle complex scientific challenges. We are seeking an experienced High Performance Computing (HPC) Infrastructure Engineer to join our team.About the OpportunityWe are looking for a highly skilled HPC engineer to develop, support,...
-
Cloud Solutions Architect
5 days ago
San Francisco, California, United States Amazon Full timeOverviewAt Amazon, we're committed to empowering our customers with innovative cloud solutions. As a Cloud Solutions Architect in the High Performance Computing (HPC) team, you'll play a critical role in designing and implementing scalable and efficient HPC architectures for our clients.About the RoleWe're seeking an experienced professional with a strong...
-
High-Performance Computing Software Developer
3 weeks ago
San Jose, California, United States ASML US, LLC Full timeIntroduction to the RoleASML US, LLC brings together talented individuals in science and technology to develop cutting-edge lithography machines that enable the production of faster, cheaper, and more energy-efficient microchips. Our company designs, develops, integrates, markets, and services these advanced machines, which empower our customers – the...
-
High-Performance GPU Optimization Expert
7 days ago
San Jose, California, United States Adobe Inc. Full timeAbout Adobe Inc.At Adobe, we're passionate about empowering creatives to push the boundaries of what's possible. With a legacy spanning over 40 years, we've been at the forefront of innovation in digital experiences. Our commitment to creativity and inclusivity drives us to create exceptional products that transform how companies interact with customers...
-
San Francisco, California, United States Tbwa ChiatDay Inc Full timeThe Media Foundation at Reddit is dedicated to delivering seamless, high-performance media experiences that meet and exceed industry standards. We are seeking a skilled iOS engineer with a deep understanding of scalable media solutions who can drive innovation, deliver top-tier video experiences to our users, and guide other teams in seamlessly integrating...
-
High-Performance Backend Software Engineer
5 days ago
San Francisco, California, United States Rippling Full timeRippling is a cutting-edge technology company that empowers businesses to streamline their HR, IT, and Finance operations. Our innovative platform brings together all workforce systems in one place, enabling seamless management and automation of every aspect of the employee lifecycle.About the RoleWe are seeking a highly skilled Senior Software Engineer to...
-
Sales Performance Expert
5 days ago
San Francisco, California, United States Canon USA & Affiliates Full timeAbout the OpportunityCanon Solutions America, a leader in print technology, solutions, and services, is seeking a Sales Performance Expert to drive sales results and consistently achieve individual and team revenue goals. As a Sales Performance Expert, you will master the core capabilities of innovative products, solutions, and technologies from Canon...
-
High-Performance Backend Developer
5 days ago
San Francisco, California, United States Amplitude Full timeAmplitude, a leading digital analytics platform, empowers companies to unlock the full potential of their products.The company's 3,200+ customers, including industry leaders like Atlassian and Under Armour, rely on Amplitude to gain actionable insights into customer behavior. As an organization, we prioritize humility, ownership, and continuous improvement...
-
Cloud Computing Expert
5 days ago
San Francisco, California, United States Tekfortune Inc Full timeWe are seeking a highly skilled Cloud Computing Expert to join our team at Tekfortune Inc. In this role, you will be responsible for designing and delivering Azure-based cloud services.Key Responsibilities:Design and delivery of Azure-based cloud servicesDevelopment and shipping of software using C# and Kql (Kusto) query languagesAzure ARM/bicep experience...