Distributed Software Engineer
4 months ago
Cerebras Systems has pioneered a groundbreaking chip and system that revolutionizes deep learning applications. Our system empowers ML researchers to achieve unprecedented speeds in training and inference workloads, propelling AI innovation to new horizons.
Condor Galaxy 1 (CG-1), a supercomputer set to revolutionize the world of artificial intelligence. With an astounding processing power of 4 ExaFLOPs, 54 million cores, and a cutting-edge 64-node architecture, the CG-1 is the first milestone of a larger project that will redefine the possibilities of AI.
The successful completion and deployment of the CG-1, the first of nine powerful supercomputers, is a significant achievement for Cerebras. As we enter phase 2 of the project with CG2, we are taking a bold step towards creating a network of interconnected supercomputers that will collectively deliver a mind-boggling 36 ExaFLOPs of AI compute power upon completion.
The RoleCerebras Systems is a pioneer in large-scale AI Supercomputers. These multi-exaflop supercomputers are deployed in some of the biggest datacenters. These supercomputers are built using our Wafer-Scale Cluster technology - a cluster of several Wafer Scale Engine (WSE) chips. The Cluster engineering team is responsible for delivering software that are all-things related to cluster.
Responsibilities- Automate bare-metal configuration of networking, OS, and application software in large clusters of Cerebras WSE, servers, and switches
- Additional push button workflows for cluster upgrades, downgrades, and security patching with key metrics to minimize downtime on clusters
- An orchestration and scheduler system for resource allocation, job submission & placements for a multi-user environment on a cluster
- Seamless support for both on-premise and cloud mode deployment and operations
- A robust system for monitoring, detecting and handling failures for a variety of resources on the clusters (including High Availability of clusters)
- Broad cluster and job monitoring and visualization capabilities, along with alerting systems
- User facing tools to monitor the status of jobs and collect metrics
- Administrator facing tools to manage and operate large clusters
- Strong track record of software architecture, system design and development for over 6 years or more
- Strong track record of development in distributed cluster environment.
- Strong understanding of Kubernetes (K8s) software ecosystem, Prometheus and Grafana
- Strong development skills in GoLang, Python, bash
- Strong debugging skills with distributed systems
- Strong skill to develop tests for the new features and regress old features
People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:
- Build a breakthrough AI platform beyond the constraints of the GPU
- Publish and open source their cutting-edge AI research
- Work on one of the fastest AI supercomputers in the world
- Enjoy job stability with startup vitality
- Our simple, non-corporate work culture that respects individual beliefs
Read our blog: Five Reasons to Join Cerebras in 2024.
Apply today and become part of the forefront of groundbreaking advancements in AI.Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.
This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.
-
Distributed Systems Engineer
3 weeks ago
sunnyvale, United States Figure Full timeFigure is an AI Robotics company developing a general purpose humanoid. Our Humanoid is designed for corporate tasks targeting labor shortages and jobs that are undesirable or unsafe. We are based in Sunnyvale, CA and require 5 days/week in-office collaboration. Figure’s vision is to deploy autonomous humanoids at a global scale. Our AI team is looking...
-
Senior Software Engineer
7 days ago
Sunnyvale, United States Lynx Software Technologies Full timeThompson Software Solutions is seeking a mid to senior level Software Engineer who is ready to work with a talented team to provide innovative solutions for tomorrows problems. This position requires a software engineer to use a wide application of technical principles, theories, and concepts in the software field to develop, integrate, and test software...
-
Senior Software Engineer
1 month ago
Sunnyvale, United States Lynx Software Technologies Full timeThompson Software Solutions is seeking a mid to senior level Software Engineer who is ready to work with a talented team to provide innovative solutions for tomorrows problems. This position requires a software engineer to use a wide application of technical principles, theories, and concepts in the software field to develop, integrate, and test software...
-
Distributed Systems Engineer
1 week ago
Sunnyvale, United States Figure Full timeFigure is an AI Robotics company developing a general purpose humanoid. Our Humanoid is designed for corporate tasks targeting labor shortages and jobs that are undesirable or unsafe. We are based in Sunnyvale, CA and require 5 days/week in-office collaboration. Figure's vision is to deploy autonomous humanoids at a global scale. Our AI team is looking for...
-
Sunnyvale, California, United States Walmart Global Tech Full timeAbout Walmart Global TechImagine working in an environment where one line of code can make life easier for hundreds of millions of people. That's what we do at Walmart Global Tech.We're a team of software engineers, data scientists, cybersecurity experts, and service professionals within the world's leading retailer who make an epic impact and are at the...
-
Senior Software Engineer for NLP and ML
7 days ago
Sunnyvale, California, United States Intelliswift Software Full timeAbout the RoleWe are seeking a skilled Full Stack Software Engineer to join our team at Intelliswift Software. The ideal candidate will have a strong background in natural language processing (NLP) and machine learning (ML), as well as experience with web application development.This is an exciting opportunity to work on a cutting-edge project that involves...
-
Sunnyvale, United States SB Telecom America Corp. Full timeAbout Softbank: Softbank is making significant investments in infrastructure for AI. Softbank Corp. has recently established a new US center in Silicon Valley, focused on infrastructure software for AI and AI foundations for mobile networks. Our goals are to challenge the norms and create products making use of our SOTA infrastructure and cloud-native...
-
Software Engineer
24 hours ago
Sunnyvale, United States iHealth Labs Full timeiHealth Labs is on a mission to inspire and enable people to manage diabetes and hypertension. We are working side by side with patients and doctors in the US. iHealth is a Sunnyvale-based healthcare and technology startup that introduced the first smartphone-connected blood pressure monitor in the world and is offering a line of award-winning mobile health...
-
Software Engineer
1 day ago
Sunnyvale, United States TBWA\Chiat\Day Full timeFigure is an AI Robotics company developing a general purpose humanoid. Our Humanoid is designed for corporate tasks targeting labor shortages and jobs that are undesirable or unsafe. We are based in Sunnyvale, CA and require 5 days/week in-office collaboration. We are looking for solid, experienced software engineers with a penchant for solving complex...
-
Senior Software Engineer
3 weeks ago
sunnyvale, United States VDart Full timePosition: Senior Software EngineerLocation: Sunnyvale, CA (Onsite)Mode of Hire: ContractJob Description:As a Senior Engineer, you will participate in feature development, be responsible for overall codebase quality by participating in design reviews, code reviews, setting coding guidelines and general technical discussion, as well as be a champion for...
-
Senior Software Engineering Lead
20 hours ago
Sunnyvale, California, United States Walmart Full timeJob Summary:We are seeking a highly experienced Senior Software Engineering Lead to join our team as a Principal Software Engineer/Architect. The successful candidate will lead and direct large-scale, complex, cross-functional projects by reviewing project requirements and translating them into technical solutions.About the Role:Main Responsibilities:Define...
-
Staff Software Engineer
2 months ago
sunnyvale, United States Walmart Global Tech Full timeAbout the team:Join our Walmart’s Display Ad team of skilled engineers and help shape the performance optimization strategies of our cutting-edge systems. If you're a passionate and driven individual with a knack for uncovering system bottlenecks and fine-tuning performance, we encourage you to apply and be a part of our innovative journey.Position...
-
Staff Software Engineer
2 months ago
sunnyvale, United States Walmart Global Tech Full timeAbout the team:Join our Walmart’s Display Ad team of skilled engineers and help shape the performance optimization strategies of our cutting-edge systems. If you're a passionate and driven individual with a knack for uncovering system bottlenecks and fine-tuning performance, we encourage you to apply and be a part of our innovative journey.Position...
-
Staff Software Engineer
4 months ago
Sunnyvale, United States Walmart Global Tech Full timeAbout the team:Join our Walmart’s Display Ad team of skilled engineers and help shape the performance optimization strategies of our cutting-edge systems. If you're a passionate and driven individual with a knack for uncovering system bottlenecks and fine-tuning performance, we encourage you to apply and be a part of our innovative journey.Position...
-
Walmart Senior Software Engineer
5 days ago
Sunnyvale, California, United States Walmart Full timeJob SummaryWe are seeking an experienced Senior Software Engineer - Innovative Solution Architect to join our team at Walmart.About the RoleThis role will involve designing and developing innovative software solutions to meet the business needs of Walmart. The ideal candidate will have a strong background in software engineering, experience with distributed...
-
Cloud Network Infrastructure Engineer
5 days ago
Sunnyvale, California, United States Apple Full timeBe the driving force behind Apple's cloud network infrastructureWe're seeking an exceptional Cloud Network Infrastructure Engineer to join our team as a Software Engineering Manager. This role will play a vital part in shaping the future of Apple's cloud services, ensuring they are scalable, resilient, and highly available.About the RoleIn this position, you...
-
Sunnyvale, United States Google Full timeMinimum Qualifications: Master's degree in Computer Science or Compute Architecture, or equivalent practical experience. 15 years of experience in software development, design and architecture, data structures/logarithms, and testing. 8 years of experience with QA engineering delivery. Experience building and developing large-scale infrastructure,...
-
Software Engineer
1 week ago
Sunnyvale, United States Avispa Technology Full timeSoftware Engineer 14742 A leading professional networking company is seeking a Software Engineer to join our order management system team in an on-call operations-focused role. The successful candidate will build, ship, and release code to keep our systems up to date with the company's technical stack and standards for quality software engineering. The...
-
Software Engineer
6 days ago
Sunnyvale, United States Avispa Technology Full timeSoftware Engineer 14742 A leading professional networking company is seeking a Software Engineer to join our order management system team in an on-call operations-focused role. The successful candidate will build, ship, and release code to keep our systems up to date with the company's technical stack and standards for quality software engineering. The...
-
Software Engineer
5 days ago
Sunnyvale, United States Avispa Technology Full timeJob Description Software Engineer 14742 A leading professional networking company is seeking a Software Engineer to join our order management system team in an on-call operations-focused role. The successful candidate will build, ship, and release code to keep our systems up to date with the company's technical stack and standards for quality software...