Infrastructure Engineer
4 days ago
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.
Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.
About The Role
We are looking for a hands-on Infrastructure Engineer to join our team and support our high-performance, on-premise server and networking infrastructure. You will be responsible for maintaining, provisioning, and troubleshooting hardware and Linux systems, working closely with network and system teams. This is an in-person role, ideal for someone who enjoys working across hardware, networking, and system layers.
Key Responsibilities
- Physically install, rack, cable, and maintain blade servers and hardware components (CPUs, DIMMs, NICs, storage devices, etc.).
- Connect servers to high-speed networks (100G/400G), verify optics/DACs, and check link status.
- Configure BIOS, firmware, and out-of-band management (IPMI/iDRAC/iLO).
- Install and provision Linux OS; configure hostnames, IPs, routing, and NFS mount points.
- Debug network issues at physical and OS level (VLAN, link issues, routing, etc.).
- Use Linux tools (e.g., ip, dmesg, netstat, ping) to isolate and fix issues.
- Follow provisioning playbooks and maintain accurate records of assets and changes.
- Use scripting (Bash, Python) to automate routine tasks and improve efficiency.
- Collaborate with internal teams (network, systems, storage) and coordinate vendor RMAs.
- Document procedures and contribute to team knowledge base.
- Troubleshoot and replace failed server components with minimal downtime.
- 3-5+ years of experience in data center, lab, or infrastructure engineering roles.
- Proficient in Linux system administration and network configuration.
- Strong hands-on knowledge of x86 server hardware and enterprise networking.
- Familiar with BIOS configuration, firmware updates, and remote management tools.
- Skilled in physical setup and troubleshooting of high-speed NICs and optical links.
- Experience with VLANs, static routing, and diagnosing layer 1-3 issues.
- Ability to write scripts for automation and diagnostics (Bash, Python preferred).
- Comfortable working on-site daily and lifting/moving server hardware.
- Experience with PXE, NFS, RAID controllers, and monitoring tools.
- Familiarity with configuration management tools (e.g., Ansible).
- Prior experience in a lab or R&D hardware/software environment.
- This is a unique opportunity to work with cutting-edge infrastructure and grow into more senior technical roles. If you enjoy bridging hardware and software with hands-on work, we'd love to hear from you.
Why Join Cerebras
People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we've reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:
- Build a breakthrough AI platform beyond the constraints of the GPU.
- Publish and open source their cutting-edge AI research.
- Work on one of the fastest AI supercomputers in the world.
- Enjoy job stability with startup vitality.
- Our simple, non-corporate work culture that respects individual beliefs.
Read our blog: Five Reasons to Join Cerebras in 2025.
Apply today and become part of the forefront of groundbreaking advancements in AI
Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.
This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.
-
Infrastructure Engineer
2 weeks ago
Sunnyvale, CA, United States INSPYR Solutions Full timeLocation: Sunnyvale, CA (Hybrid) Duration: 6-9 months with extensions Infrastructure EngineerABOUT THIS FEATURED OPPORTUNITY The client is seeking multiple Infrastructure Engineers to support the organization. This team is working on confidential projects focused on building big data internal platforms to power large-scale application environments. The...
-
Senior Infrastructure Engineer
4 days ago
Sunnyvale, CA, United States Institute of Foundation Models Full timeAbout the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge-driven economy. As part of our team, you'll have the opportunity to work on the...
-
Senior Infrastructure Engineer
3 days ago
Sunnyvale, CA, United States Institute of Foundation Models Full timeAbout the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge-driven economy. As part of our team, you'll have the opportunity to work on the...
-
Machine Learning Infrastructure Engineer
5 days ago
Sunnyvale, CA, United States Apple Full timeMachine Learning Infrastructure Engineer Sunnyvale, California, United States Machine Learning and AI Want to ship amazing experiences in Apple products? Be part of the team in the Video Computer Vision (VCV) organization that focuses on people understanding from real-time video streams and building higher level reasoning algorithms. VCV delivered features...
-
Infrastructure QA Engineer
4 days ago
Sunnyvale, CA, United States Fortinet Full timeFortinet is looking for a Network&Security QA Engineer to join the Infrastructure QA team in Sunnyvale headquarters, California. This is a technical role, delivering testing service for Fortinet datacenter infrastructure. The Infrastructure QA team performs tests designed to validate Fortinet Cloud services for end-to-end scalability and resiliency. The...
-
Infrastructure QA Engineer
1 week ago
Sunnyvale, CA, United States Fortinet Full timeOverview Fortinet is looking for a Network & Security QA Engineer to join the Infrastructure QA team in Sunnyvale headquarters, California. This is a technical role, delivering testing service for Fortinet datacenter infrastructure. The Infrastructure QA team performs tests designed to validate Fortinet Cloud services for end-to-end scalability and...
-
Infrastructure QA Engineer
1 week ago
Sunnyvale, CA, United States Fortinet Full timeFortinet is looking for a Network&Security QA Engineer to join the Infrastructure QA team in Sunnyvale headquarters, California. This is a technical role, delivering testing service for Fortinet datacenter infrastructure. The Infrastructure QA team performs tests designed to validate Fortinet Cloud services for end-to-end scalability and resiliency. The...
-
Staff Cloud Infrastructure Engineer
1 week ago
Sunnyvale, CA, United States Ceribell, Inc Full timeAbout Ceribell Ceribell is a medical technology company focused on transforming the diagnosis and management of patients with serious neurological conditions. The Ceribell System is a novel, point-of-care electroencephalography ("EEG") platform specifically designed to address the unmet needs of patients in the acute care setting, and is being used in...
-
Staff Cloud Infrastructure Engineer
2 days ago
Sunnyvale, CA, United States Ceribell, Inc Full timeAbout Ceribell Ceribell is a medical technology company focused on transforming the diagnosis and management of patients with serious neurological conditions. The Ceribell System is a novel, point-of-care electroencephalography ("EEG") platform specifically designed to address the unmet needs of patients in the acute care setting, and is being used in...
-
Machine Learning Infrastructure Engineer
2 weeks ago
Sunnyvale, CA, United States Institute of Foundation Models Full timeAbout the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge-driven economy. As part of our team, you'll have the opportunity to work on the...