We have other current jobs related to this field that you can find below


  • Menlo Park, United States META Full time

    Meta is seeking an engineer to join our Release to Production (RTP) team. Our servers and data centers are the foundation upon which our rapidly scaling infrastructure operates efficiently to deliver our innovative services. The RTP team is responsible for the Hardware Lifecycle of all Meta servers including pre-production hands-on system and hardware...


  • Menlo Park, United States META Full time

    Hardware Systems Engineer, NPI Apply to this job Location pin icon Menlo Park, CA Apply to this job Meta is seeking an engineer to join our Release to Production (RTP) team. Our servers and data centers are the foundation upon which our rapidly scaling infrastructure operates efficiently to deliver our innovative services. The RTP team is responsible for...


  • Menlo Park, United States META Full time

    Hardware Systems Engineer, NPI Apply to this job Location pin icon Menlo Park, CA Apply to this job Meta is seeking an engineer to join our Release to Production (RTP) team. Our servers and data centers are the foundation upon which our rapidly scaling infrastructure operates efficiently to deliver our innovative services. The RTP team is responsible for...


  • Menlo Park, United States META Full time

    Hardware Engineer, Power Apply to this job Location pin icon Menlo Park, CA •Fremont, CA Apply to this job Meta's mission is backed by a massive hardware infrastructure. Our computational challenges are big, complex, and consistently evolving. Your work has the potential to shape the compute hardware and AI hardware going into our cutting-edge data...


  • Menlo Park, California, United States META Full time

    Meta is on the lookout for a Systems Engineering Specialist to become a vital part of our Release to Production (RTP) team. Our infrastructure, comprising servers and data centers, is crucial for the seamless operation of our rapidly expanding services. The RTP team oversees the Hardware Lifecycle of all Meta servers, engaging in hands-on system and hardware...

  • Optical Engineer

    1 month ago


    Menlo Park, United States META Full time

    The Optical Technologies Group enables optical communication hardware for Meta data center networking and AI/ML systems to support Meta's mission of bringing the world together. We are looking for an Optical Engineer to qualify optical communications modules for Meta's cutting-edge, global, data center network. As an Optical Engineer, you will have a unique...


  • Menlo Park, California, United States Facebook Full time

    Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing uses cases of AI. This results in a dramatic scaling challenge that our engineers have to deal with on a daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like GPUs together. In addition, we need...


  • Menlo Park, United States Facebook Full time

    Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing uses cases of AI. This results in a dramatic scaling challenge that our engineers have to deal with on a daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like GPUs together. In addition, we need...


  • Menlo Park, California, United States Modern Mechanical Systems, Inc. Full time

    Meta is seeking a software engineer to join our AI & Systems Co–Design team to drive the definition of our next–generation compute and storage architectures. This person will work cross–functionally with internal software and platforms engineering teams to understand the workloads and infrastructure requirements. They will drive technology...

  • Optical Engineer

    2 months ago


    Menlo Park, United States META Full time

    The Optical Technologies Group enables optical communication hardware for Meta data center networking and AI/ML systems to support Meta's mission of bringing the world together. We are looking for an Optical Engineer to qualify optical communications modules for Meta's cutting-edge, global, data center network. As an Optical Engineer, you will have a unique...


  • Menlo Park, California, United States Mainspring Energy, Inc. Full time

    Position OverviewCompany BackgroundMainspring Energy, Inc. is at the forefront of revolutionizing power generation with its innovative linear generator technology. Our mission is to facilitate the transition to a sustainable, net-zero carbon energy grid, providing scalable and adaptable power solutions.Our linear generator technology is uniquely positioned...


  • Menlo Park, California, United States Character AI Full time

    About the role Responsibilities As a Multimodal Site Reliability Engineer (SRE) at Character, you will be responsible for ensuring the reliability, scalability, and performance of our app and AI multimodal services (eg, voice interfacing services). You will work closely with our development team to design and implement processes and systems that ensure the...

  • Optical Engineer

    1 month ago


    Menlo Park, California, United States Meta Full time

    The Optical Technologies Group enables optical communication hardware for Meta data center networking and AI/ML systems to support Meta's mission of bringing the world together. We are looking for an Optical Engineer to qualify optical communications modules for Meta's cutting-edge, global, data center network. As an Optical Engineer, you will have a unique...


  • Menlo Park, California, United States AI Technologies LLC. Full time

    Position Overview:Field of Expertise: Machine LearningPosition: Machine Learning EngineerContract Length: 8 MonthsEmployer: To Be Discussed LaterWe are seeking a proficient Machine Learning Engineer to contribute to our innovative team at AI Technologies LLC. This role encompasses engagement with advanced technologies in the realm of machine learning systems...


  • Menlo Park, California, United States AI Technologies LLC. Full time

    Position Overview:Field of Expertise: Machine LearningRole: Machine Learning EngineerContract Length: 8 MonthsEmployer: To Be Discussed LaterWe are seeking a proficient Machine Learning Engineer to contribute to our innovative team at AI Technologies LLC. This role entails engaging with advanced technologies in the realm of machine learning systems and...


  • Menlo Park, United States META Full time

    Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing uses cases of AI. This results in a dramatic scaling challenge that our engineers have to deal with on a daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like GPUs together. In addition, we need...


  • Menlo Park, United States nSpire AI Full time

    Senior Full Stack Engineer (React/Next.js & Django/Python)Company Overview:Join nSpire AI ( a cutting-edge AI startup that is redefining Talent Intelligence. Led by a team of industry veterans with a storied history of success in the AI space, we're developing transformative solutions across B2C, B2B2C, and B2B sectors. As we look to continue our growth in...


  • Menlo Park, United States nSpire AI Full time

    Senior Full Stack Engineer (React/Next.js & Django/Python)Company Overview:Join nSpire AI (www.nspireai.com), a cutting-edge AI startup that is redefining Talent Intelligence. Led by a team of industry veterans with a storied history of success in the AI space, we're developing transformative solutions across B2C, B2B2C, and B2B sectors. As we look to...


  • Menlo Park, California, United States HCLTech Full time

    Position OverviewWe are seeking a Senior Solutions Engineer to leverage advanced AI and machine learning methodologies in the development and deployment of innovative generative AI solutions utilizing HCLTech's Llama and other cutting-edge LLMs.Key ResponsibilitiesGain a comprehensive understanding of HCLTech's AI and Llama frameworks, along with their...

  • Mechanical Engineer

    4 weeks ago


    Menlo Park, United States Mainspring Energy, Inc. Full time

    Job DescriptionJob DescriptionCompany OverviewDriven by our vision of the affordable, reliable, net-zero carbon grid of the future, Mainspring has developed a new category of power generation — the linear generator — that delivers local, scalable, and fuel-flexible power to help accelerate the transition to the net-zero carbon grid.The unique combination...

Hardware Systems Engineer, NPI AI

2 months ago


Menlo Park, United States Facebook Full time

Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI/ML initiatives supporting large scale AI Training and Inference. Our servers and data centers are the foundation upon which our rapidly scaling infrastructure operates efficiently to deliver our innovative services. The RTP team is responsible for the end-to-end Hardware Lifecycle of all Meta servers including prototyping of experimental HW, pre-production hands-on system and hardware debugging and stress testing, enabling production-ready system monitoring, automated provisioning and automated remediation of issues. RTP team also helps in exploring, developing and productizing high-performance software and hardware technologies for AI at datacenter scale.RTP Engineers work closely with HW/SW co-design teams, hardware designers, networking teams, system manufacturers, component vendors, capacity engineering, production engineering, production services, and data center operations teams to enable new systems that will be deployed in our production data centers. Ramping to production and solving the datacenter scaling and deployment challenges requires us to take a systems based approach to Silicon bring up and validation. The ideal candidate has hands-on experience with at least a couple of hardware/software (silicon, power, thermal, firmware) lifecycle phases: design/bring up, server integration, system validation, supporting customer deployment, production issue triage.

Hardware Systems Engineer, NPI AI

Responsibilities:

Lead and execute comprehensive end-to-end system validation. Collaborate with hardware design and silicon validation teams to define test strategy at system level. Contribute to new feature/technology development/validation across hardware/software stack. Contribute to enabling hacks for future technology explorations in AI silicon and system space such as memory, network and storage interdependencies in the context of AI workloads. Proactively create experiments and tooling to detect and diagnose hardware/firmware/software health issues. Troubleshoot, diagnose and root cause of system failures and isolate the components/failure scenarios while working with internal & external partners. Develop visibility through data visualization and implement systemic solutions to hardware health issues. Leverage production experience to drive external and internal teams to continuously improve product quality. Communicate complex technical findings to diverse stakeholders at all levels. Proactively identify and mitigate potential product risks based on testing insight.

Minimum Qualifications:

Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta. BS or advanced degree in Electrical Engineering, Computer Engineering, Computer Science, Engineering, Math, Physics or a related field or equivalent experience. 3+ years of experience in hands-on SW/FW/HW engineering to build systems/products for data center environments, consumer devices/HW, or similar. Work experience in one or more domains such as: AI ASIC development (Silicon design, bringup, characterization, validation), board level debug, firmware validation, system validation. Experience troubleshooting and debugging using lab tools. Experience in developing test specifications, procedures, and debug guides for test solutions. Experience with one or more of the following modules/domains: PCIe, Networking, Flash, Memory, CPU, GPU, DRAM (DDR4/5 or HBM)

Preferred Qualifications:

Proficient in HPC or AI system architecture and Cluster Interconnect technologies. Experience with embedded systems' architecture and components, performance optimization of algorithms, test automation, and instrument communication (oscilloscopes, protocol analyzers, traffic generators, etc.) Experience in debugging tools for systems-on-chip (SoCs) - eg. JTAG, GDB, Trace32 Knowledge of common bus protocols such as I2C, SPI, USB, and/or PCIe. Experience with Linux systems and server systems management. Experience authoring test plans for complex chipsets for functional, stress and performance testing. Experienced in the integration of lab tools for automated workflows with large scale deployments. 2+ years experience scripting automation in Python. Proficiency in continuous integration/continuous delivery tools.

Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com. #J-18808-Ljbffr