AI Systems Testing Engineer

2 weeks ago


Sunnyvale, California, United States Cerebras Full time

Cerebras Systems has revolutionized the landscape of deep learning with its innovative chip and system, enabling machine learning researchers to achieve remarkable speeds in both training and inference tasks, thus driving AI advancements to unprecedented levels.

The recently introduced Condor Galaxy 1 (CG-1) exemplifies Cerebras' dedication to advancing AI computing capabilities. Featuring an extraordinary processing power of 4 ExaFLOPS, along with 54 million cores and a 64-node architecture, the CG-1 is the first of nine supercomputers being developed in collaboration with G42. This partnership is set to redefine AI potential by establishing a network of supercomputers that will collectively provide an astounding 36 ExaFLOPS of AI computational power upon completion.

Position Overview
  • Assess and recommend Data Center equipment such as Switches, Routers, Servers, NICs, and Transceivers for next-generation infrastructure, focusing on enhancing performance and cost-effectiveness.
Key Responsibilities
  • Identify experiments, tools, and methodologies to evaluate complex AI Infrastructure equipment, including Switches, Routers, Servers, NICs, and Transceivers, that challenge the limits of hardware design and system integration.
  • Collaborate with equipment vendors to assess the performance of newly launched hardware and address any defects.
  • Design and establish test labs and test beds to evaluate vendor equipment from leading companies.
  • Work alongside architects and software engineers to develop test cases, write test scripts, execute tests, and document evaluation results from various vendors.
  • Troubleshoot, isolate, and resolve issues through collaboration with other teams and vendors.
  • Provide innovative solutions for efficient networking design tailored for AI infrastructure.
  • Design, install, configure, and maintain complex networks specifically for AI Infrastructure.
  • Develop and optimize server system benchmarks based on a comprehensive understanding of server architecture and workload characterization.
Required Qualifications and Skills
  • Over 15 years of experience in Software Development, Quality Assurance, and System Testing of Switches and Routers within a networking equipment environment.
  • A Bachelor's degree or higher in Electrical Engineering, Computer Engineering, Computer Science, or related fields.
  • In-depth understanding of RDMA congestion control mechanisms on InfiniBand and RoCE Networks.
  • Strong knowledge of networking protocols including BGP, PFC, ECN, QoS, MLAG, ECMP, and VRF.
  • Experience with computer system architecture, particularly CPU SoC or Platform Architecture, Interconnect Fabric, and Memory subsystems.
  • Proven track record in designing and implementing large-scale switching and routing networks.
  • Exceptional technical skills, problem-solving abilities, design, coding, and debugging expertise.
  • Proficiency in Linux tools such as lspci, ping, traceroute, tcpdump, ifconfig, ip link, ip route, arp, /proc/net, /proc/sys/net, vmstat, netstat, ttcp, iperf, strac, memtest, fio, ozone, and iometer.
  • Proficient in Python programming.
  • Experience with Networking Test Tools like IXIA and Smartbits.
Why Consider Cerebras

At Cerebras, we have engineered a groundbreaking architecture that is unlocking new avenues for the AI sector. Our rapid growth and numerous model releases signify a pivotal moment in our journey. Team members often cite several compelling reasons for their association with Cerebras:

  • Contribute to a pioneering AI platform that transcends traditional GPU limitations.
  • Engage in publishing and open-sourcing cutting-edge AI research.
  • Work on one of the fastest AI supercomputers globally.
  • Experience job stability combined with the dynamism of a startup environment.
  • Thrive in a straightforward, non-corporate culture that values individual beliefs.
Become part of the forefront of transformative advancements in AI.

Cerebras Systems is dedicated to fostering an inclusive and diverse environment and is proud to be an equal opportunity employer. We celebrate various backgrounds, perspectives, and skills, believing that inclusive teams create superior products and companies. We strive daily to cultivate a work atmosphere that empowers individuals to excel through continuous learning, growth, and mutual support.

This website or its third-party tools process personal data. For more details, please review our CCPA disclosure notice.



  • Sunnyvale, California, United States Cerebras Full time

    Cerebras Systems has developed an innovative chip and system that transforms deep learning applications. Our technology enables machine learning researchers to achieve remarkable speeds in both training and inference tasks, driving AI advancements to unprecedented levels.The recently announced Condor Galaxy 1 (CG-1) exemplifies Cerebras' dedication to...


  • Sunnyvale, California, United States Cerebras Full time

    Cerebras Systems is at the forefront of AI innovation, having developed a revolutionary chip and system that transforms deep learning applications. Our technology enables machine learning researchers to achieve remarkable speeds in both training and inference tasks, driving forward the evolution of artificial intelligence.The recently introduced Condor...


  • Sunnyvale, California, United States AI Technologies LLC. Full time

    Job OverviewJob ID: ConfidentialSpecialized Area: Advanced AnalyticsJob Title: Machine Learning EngineerCompany: AI Technologies LLC.Duration: 6 MonthsTransforms business requirements into actionable machine learning strategiesDevelops and implements scalable machine learning solutions to drive business growthCollaborates with cross-functional teams to...


  • Sunnyvale, California, United States Cerebras Full time

    Cerebras Systems is at the forefront of innovation in AI computing, having developed a revolutionary chip and system that transforms deep learning applications. Our technology enables machine learning researchers to achieve remarkable speeds in both training and inference tasks, driving AI advancements to unprecedented levels.The recently introduced Condor...

  • Systems Test Engineer

    2 weeks ago


    Sunnyvale, California, United States Lockheed Martin Full time

    Position Overview:At Lockheed Martin, we unite passionate individuals to drive purposeful innovation, ensuring safety and addressing the world's intricate challenges. Our workforce comprises some of the brightest minds in the sector, making Lockheed Martin an exceptional workplace.We prioritize our employees by offering diverse career paths aimed at...


  • Sunnyvale, California, United States AppLab Systems, Inc Full time

    About the RoleWe are seeking a highly skilled Machine Learning Engineer to join our team at AppLab Systems, Inc. as an AI Vision Expert. This is an exciting opportunity to work on cutting-edge projects and contribute to the development of innovative AI solutions.Key ResponsibilitiesDevelop and Optimize AI Models: Design, implement, and refine machine...


  • Sunnyvale, California, United States Lockheed Martin Full time

    Position Overview:At Lockheed Martin, we unite individuals who are driven by a commitment to innovative solutions, ensuring safety and addressing the world's intricate challenges. Our workforce comprises some of the brightest minds in the sector, making Lockheed Martin an exceptional workplace. We prioritize our employees by offering a range of career paths...


  • Sunnyvale, California, United States Infobahn SoftWorld Inc Full time

    Job Description**Job Title:** Prompt Engineer - AI Innovation**Job Summary:** We are seeking a skilled Prompt Engineer to join our team at Infobahn SoftWorld Inc. as a key member of our Converse Platform Team. The successful candidate will be responsible for designing, evaluating, and improving our conversational AI capabilities.Key Responsibilities:Develop...


  • Sunnyvale, California, United States Iron Systems Full time

    Company Overview:Iron Systems is a forward-thinking, client-oriented provider of tailored computing infrastructure solutions, including network servers, storage systems, OEM/ODM appliances, and embedded technologies. With over 15 years of experience, we have earned the trust of our clients through our innovative problem-solving capabilities, comprehensive...


  • Sunnyvale, California, United States Iron Systems Full time

    Company Overview:Iron Systems is a forward-thinking, client-centric provider specializing in tailor-made computing infrastructure solutions, including network servers, storage systems, OEM/ODM appliances, and embedded technologies. With over 15 years of experience, we have earned the trust of our clients through our innovative problem-solving capabilities,...


  • Sunnyvale, California, United States Iron Systems Full time

    Company Overview:Iron Systems is a forward-thinking, client-centric provider of tailored computing infrastructure solutions, including network servers, storage systems, OEM/ODM appliances, and embedded technologies.Position Overview:The primary role of a Test Engineer is to investigate and evaluate the design, functionality, and upkeep of products, systems,...

  • Senior Test Engineer

    2 weeks ago


    Sunnyvale, California, United States Iron Systems Full time

    Company Overview:Iron Systems is a forward-thinking, client-oriented provider specializing in tailor-made computing infrastructure solutions, including network servers, storage systems, OEM/ODM appliances, and embedded technologies.Position Overview:The primary role of a Test Engineer is to investigate and evaluate the design, functionality, and upkeep of...


  • Sunnyvale, California, United States Iron Systems Full time

    Company Overview:Iron Systems is a forward-thinking, client-oriented provider of tailored computing infrastructure solutions, including network servers, storage systems, OEM/ODM appliances, and embedded technologies.Position Overview:The primary role of a Test Engineer is to investigate and evaluate the design, functionality, and upkeep of products, systems,...


  • Sunnyvale, California, United States Iron Systems Full time

    Company Overview:Iron Systems is a forward-thinking, client-centric provider of tailored computing infrastructure solutions, including network servers, storage systems, OEM/ODM appliances, and embedded technologies.Position Overview:The primary role of a Test Engineer is to investigate and evaluate the design, functionality, and upkeep of products, systems,...


  • Sunnyvale, California, United States NORTHROP GRUMMAN Full time

    About the RoleWe are seeking a highly skilled Systems Integration and Test Engineer / Principal Systems Integration and Test Engineer to join our dynamic team at Northrop Grumman. As a key member of our Systems Integration and Test group, you will play a critical role in ensuring the success of our propulsion and power generation equipment.Key...


  • Sunnyvale, California, United States Google Cloud - Minnesota Full time

    About the RoleAs a Software Engineering Manager at Google Cloud - Minnesota, you will be responsible for leading research explorations and applied AI efforts to develop Generative AI in partnership with Google DeepMind. This involves transforming software development workflows at Google through AI-assisted coding, debugging, testing, and chat agents.Key...


  • Sunnyvale, California, United States NORTHROP GRUMMAN Full time

    About the RoleWe are seeking a highly skilled Systems Integration and Test Engineer / Principal Systems Integration and Test Engineer to join our team at Northrop Grumman. As a key member of our Systems Integration and Test group, you will play a critical role in ensuring the quality and reliability of our propulsion and power generation systems.Key...


  • Sunnyvale, California, United States Links Technology Solutions Inc Full time

    Links Technology Solutions Inc is seeking a skilled and seasoned Lead AI Solutions Engineer to become a vital part of our client's organization. In this role, you will be instrumental in pioneering a new business segment for a Department of Defense initiative. Your leadership will be key in guiding a team of engineers to create innovative artificial...


  • Sunnyvale, California, United States L&T Technology Services Full time

    As a Battery Performance Validation Specialist, you will join the battery team, collaborating closely with cross-functional partners in systems engineering, thermal engineering, and software engineering to ensure that the battery module meets performance metrics at the system level.Key Responsibilities:Collaborate with validation engineers to implement test...


  • Sunnyvale, California, United States Bosch Group Full time

    Position Overview:The Bosch Group is seeking a dedicated engineer to enhance our research and development efforts in electrochemical technologies.Key Responsibilities:Conduct routine assessments, upkeep, and calibration of testing apparatus.Collaborate with equipment manufacturers to diagnose issues and enhance performance.Innovate and build devices for...