Senior Director of AI Infrastructure
4 days ago
CZI supports the science and technology that will make it possible to help scientists cure, prevent, or manage all diseases by the end of this century. While this may seem like an audacious goal, in the last 100 years, biomedical science has made tremendous strides in understanding biological systems, advancing human health, and treating disease.
Achieving our mission will only be possible if scientists are able to better understand human biology. To that end, we have identified four grand challenges that will unlock the mysteries of the cell and how cells interact within systems — paving the way for new discoveries that will change medicine in the decades that follow:
Building an AI-based virtual cell model to predict and understand cellular behaviorDeveloping state-of-the-art imaging systems to observe living cells in actionInstrumenting tissues to better understand inflammation, a key driver of many diseasesEngineering and harnessing the immune system for early detection, prevention, and treatment of diseaseCZI's work in science includes grantmaking programs, open-source software development, and close collaboration with the Chan Zuckerberg Biohub Network. The CZ Biohub Network includes the San Francisco, Chicago, and New York Biohubs as well as the Chan Zuckerberg Imaging Institute. CZI also collaborates with institutional partners like the Kempner Institute for the Study of Natural & Artificial Intelligence at Harvard University. Join us in accelerating science.
Our Central Tech team provides technology and security support for CZI and our grantees. We believe that Engineering, IT and Security are most effective when in sync and learning from each other on a daily basis. Across our three pillars of Infrastructure, Security, and Grantee & Partner Support, we enable our teams to achieve their goals faster and more securely. We leverage technology to automate manual processes, constantly innovate to optimize operations, provide first-class support, and build solutions to enable the scale and execution of our business partners' strategies and initiatives.
The AI/ML Infrastructure team works on building shared tools and platforms to be used across the Chan Zuckerberg Initiative, partnering and supporting the work of an extensive group of Research Scientists, Data Scientists, AI Research Scientists, as well as a broad range of Engineers focusing on Education and Science domain problems. Members of the shared infrastructure engineering team have an impact on all of CZI's initiatives by enabling the technology solutions used by other engineering teams at CZI to scale.
The OpportunityAs a Research Engineer on the AI Engineering team you will apply and optimize state-of-the-art models in artificial intelligence and machine learning to solve important problems in the biomedical sciences aligned with CZI's mission. You will work as part of a team responsible for developing and deploying AI models that use data developed by CZI and research partners all for the purpose of contributing to greater understanding of human cell function.
You will have the opportunity to work closely with teams of scientists, computational biologists, engineers within CZI and to collaborate with CZI grantees, with CZ institutes, and other external labs and organizations. Your work will inspire and enhance the production and analysis of datasets by CZ teams and collaborators. Scientific focus areas could include single cell biology, imaging, genomics, and proteomics.
What You'll DoWorking with the AI Research Scientists, iterate on, optimize, deploy, and maintain innovative machine learning models, systems, and software tools that enable the analysis and interpretation of AI models for BiologyWork with cross-functional team members to quickly iterate on system performance to meet/stay ahead of users' needs - e.g. we get feedback that the model doesn't scale to X million so working with our user researcher/scientist/product team to iterate on the solution. Partner with research scientists to build robust data loader pipelines for scalable distributed training and evaluation.Serve as an interface to product and engineering teams to understand how models may need to evolve to support multiple use cases.Develop model evaluation and interpretability frameworks that help biologists understand which data features drive model predictionsBuild reusable engineering utilities that can unlock experimentation velocity across research initiatives in the organizationOptimize model architectures to enhance performance, fine-tune accuracy, and efficiently manage infrastructure resourcesWhat You'll Bring
Experience in working with a highly interactive and cross-functional collaborative environment with a diverse team of colleagues and partners solving complex problems through applied deep learning.A track record and expertise in developing deep learning models on large-scale GPU clusters, using techniques of distributing training such as DDP, FSDP, Model parallelism, low-precision training, profiling and optimizing AI/ML code, fine tuning models.Expertise in leading end-to-end experimentation pipelines for training and evaluating deep learning models, with particular focus on experiment tracking and reproducibility.A good working knowledge of Python-based ML libraries and frameworks such as PyTorch, JAX, TensorFlow, NumPy, Pandas, and Scikit-learn.Experience in using modern frameworks for distributed computing and infrastructure management, particularly as related to ML models such as PyTorch Lightning, Deepspeed, TransformerEngine, RayScale etc.Ability to effectively balance exploratory research with robust engineering practices.A good working knowledge of general software engineering practices in a production environment.The ability to work independently and as part of a team, and have excellent communication and interpersonal skills.Have a Masters in computer science with a focus on machine learning & data analytics, or equivalent industry experience and at least 6-8 years of experience developing and applying machine learning methods.Compensation
The Redwood City, CA base pay range for a new hire in this role is $241,000 - $331,000. New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job-related skills and experience, as evaluated throughout the interview process.
Work ModeAs we grow, we're excited to strengthen in-person connections and cultivate a collaborative, team-oriented environment. This role is a hybrid position requiring you to be onsite for at least 60% of the working month, approximately 3 days a week, with specific in-office days determined by the team's manager. The exact schedule will be at the hiring manager's discretion and communicated during the interview process.
Benefits for the Whole YouWe're thankful to have an incredible team behind our work. To honor their commitment, we offer a wide range of benefits to support the people who make all we do possible.
CZI provides a generous employer match on employee 401(k) contributions to support planning for the future.Annual benefit for employees that can be used most meaningfully for them and their families, such as housing, student loan repayment, childcare, commuter costs, or other life needs.CZI Life of Service Gifts are awarded to employees to "live the mission" and support the causes closest to them.Paid time off to volunteer at an organization of your choice. Funding for select family-forming benefits. Relocation support for employees who need assistance moving to the Bay AreaAnd moreIf you're interested in a role but your previous experience doesn't perfectly align with each qualification in the job description, we still encourage you to apply as you may be the perfect fit for this or another role.
Explore our work modes, benefits, and interview process at
#LI-Hybrid
-
AI Engagement Manager/Director
1 day ago
Redwood City, California, United States C3 AI Full time $91,000 - $238,000C3 AI (NYSE: AI), is the Enterprise AI application software company. C3 AI delivers a family of fully integrated products including the C3 Agentic AI Platform, an end-to-end platform for developing, deploying, and operating enterprise AI applications, C3 AI applications, a portfolio of industry-specific SaaS enterprise AI applications that enable the digital...
-
Redwood City, California, United States C3 AI Full time $188,000 - $222,000C3 AI (NYSE: AI), is the Enterprise AI application software company. C3 AI delivers a family of fully integrated products including the C3 Agentic AI Platform, an end-to-end platform for developing, deploying, and operating enterprise AI applications, C3 AI applications, a portfolio of industry-specific SaaS enterprise AI applications that enable the digital...
-
Redwood City, California, United States C3 Ai Full time $138,600 - $237,900 per yearC3 AI (NYSE: AI), is the Enterprise AI application software company. C3 AI delivers a family of fully integrated products including the C3 Agentic AI Platform, an end-to-end platform for developing, deploying, and operating enterprise AI applications, C3 AI applications, a portfolio of industry-specific SaaS enterprise AI applications that enable the...
-
Redwood City, California, United States Fireworks AI Full time $120,000 - $180,000 per yearAbout Us:Here at Fireworks, we're building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference. We've been independently benchmarked to have the fastest LLM inference and have been getting great traction with innovative research projects, like our own...
-
Redwood City, California, United States Tempus AI Full time $190,000 - $290,000 per yearPassionate about precision medicine and advancing the healthcare industry?Recent advancements in underlying technology have finally made it possible for AI to impact clinical care in a meaningful way. Tempus' proprietary platform connects an entire ecosystem of real-world evidence to deliver real-time, actionable insights to physicians, providing critical...
-
Vice President, AI Infrastructure Products
6 days ago
Redwood City, California, United States Equinix Full time $276,000 - $414,000 per yearWho are we?Equinix is the world's digital infrastructure company, operating over 260 data centers across the globe. Digital leaders harness Equinix's trusted platform to bring together and interconnect foundational infrastructure at software speed. Equinix enables organizations to access all the right places, partners and possibilities to scale with...
-
Senior Revenue Accountant
5 days ago
Redwood City, California, United States C3 AI Full time $121,760 - $132,720 per yearC3 AI (NYSE: AI), is the Enterprise AI application software company. C3 AI delivers a family of fully integrated products including the C3 Agentic AI Platform, an end-to-end platform for developing, deploying, and operating enterprise AI applications, C3 AI applications, a portfolio of industry-specific SaaS enterprise AI applications that enable the digital...
-
Software Engineer, Multimedia
1 day ago
Redwood City, California, United States Fireworks AI Full time $170,000 - $240,000 per yearAbout Us:Here at Fireworks, we're building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference. We've been independently benchmarked to have the fastest LLM inference and have been getting great traction with innovative research projects, like our own...
-
Senior Software Engineer — Robotics
3 days ago
Redwood City, California, United States Stealth Robotics-ai Full time $150,000 - $250,000 per yearWe are a stealth-stage robotics company building the next generation of embodied AI systems — machines that learn from humans and act safely and autonomously in the physical world. Our team is small, technical, and mission-driven, combining robotics, machine learning, and scalable software systems to push the frontier of intelligent automation.What you'll...
-
Director of Marketing
24 hours ago
Redwood City, California, United States GridCARE Full time $120,000 - $180,000 per yearAbout UsGridCARE is a leading venture-backed startup solving the most critical constraint in AI's growth trajectory: immediate access to power. As demand for computing skyrockets, access to energy has become the defining bottleneck in the AI infrastructure race. While leading tech companies invest billions in speculative, long-term solutions that may take...