Production Network Engineer

3 weeks ago


Menlo Park, United States Meta Platforms Full time

Production Network Engineer Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing use cases of AI. We need to build, scale and evolve our network infrastructure that connects myriads of GPUs together. Simple, elegant, and scalable network design, automation, and data analytics are the keys to meeting our demands. In this role, you will be part of a team that is responsible for conceiving design solutions, developing, testing and deploying network software, systems, and tools that keep the Data Center network operating at maximum reliability, scalability, and efficiency. Engineers in this role are hybrid software and network engineers who leverage their network engineering skills to research and design new generation of network architectures and related systems and use their software development skills to reliably introduce them at scale in production. Production Network Engineer Responsibilities Partner with network hardware, software, and vendor teams on the design and development of network topologies and network platforms (switch and optics) Codify the network designs by partnering with the in-house Software Engineer, Tooling, Planning, Simulation, and Delivery teams Develop test automation frameworks integrated in Continuous Integration/Continuous Deployment pipeline to qualify network hardware and software stack for both in-house Facebook Open Switching System(FBOSS) and Vendor platforms before push in production Develop tests that qualify complex network migration procedures in lab/emulation before executing the same in production Work closely with our hardware, software and sourcing teams to develop new networking solutions and influence the future of networking and its associated infrastructure Be oncall to learn from real world production challenges and take the lessons to improve current and future generation products Minimum Qualifications Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience 6+ years of experience working on networks supporting large scale training workloads Experience in designing, deploying and operating datacenter networks at scale Experience coding in languages like Python, C++, Go Experience in network automation software leveraging software defined networking principles Experience configuring and troubleshooting routing and switching protocols (BGP, IS-IS, OSPF, MPLS, RSVP-TE) Working knowledge of network protocols (TCP/UDP, DHCP, DNS) and experience with IPv4 and IPv6 Preferred Qualifications Understanding of AI training workloads and demands they exert on networks Understanding of RDMA congestion control mechanisms on RoCE Networks Working knowledge of 40/100/400G Ethernet and CWDM, DWDM and optical transport network technologies Understanding of different Optics and internals of a switch ASIC



  • Menlo Park, United States META Full time

    Summary: Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing use cases of AI. We need to build, scale and evolve our network infrastructure that connects myriads of GPUs together. Simple, elegant, and scalable network design, automation, and data analytics are the keys to meeting our demands. In this role, you...


  • Menlo Park, CA, United States META Full time

    Summary: Meta is seeking a Production Engineer with in-depth understanding of networking, systems, automation, and tooling to join the PE Network team. This team is responsible for deploying and managing one of the world's largest and most complex networks. Meta's network is a foundational component in achieving the company's AI goals and this role would...


  • Menlo Park, United States META Full time

    Summary: The Network Infrastructure team is responsible for designing, building and operating one of the largest networks in the world. Networking is at the core of all Meta products and experiences, and we are looking for Production Engineers who are interested in solving complex technical challenges in the Backbone or Datacenter Network domains.Production...


  • Menlo Park, United States META Full time

    The Network Infrastructure team is responsible for designing, building and operating one of the largest networks in the world. Networking is at the core of all Meta products and experiences, and we are looking for Production Engineers who are interested in solving complex technical challenges in the Backbone or Datacenter Network domains. Production Network...


  • Menlo Park, CA, United States Meta Inc Full time

    The Network Infrastructure team is responsible for designing, building and operating one of the largest networks in the world. Networking is at the core of all Meta products and experiences, and we are looking for Production Engineers who are interested in solving complex technical challenges in the Backbone or Datacenter Network domains. Production Network...


  • Menlo Park, United States Meta Full time

    Network Production EngineerNetwork Production Engineer Responsibilities* Design, develop, and operate large-scale network systems to support AI training jobs/workloads.* Research and deploy new technologies and network topologies to evolve and scale AI networks, collaborating closely with hardware, software, and datacenter teams.* Qualify and test new...


  • Menlo Park, United States Meta Platforms Full time

    Network Production Engineer, DesignData Center Network Engineers at Meta are hybrid software and network engineers who design, build, and operate our worldwide Data Center network. This team owns the complete lifecycle of the Data Center network, which includes areas of planning, design, product definition, QA, deployment, and monitoring. Simple, elegant,...


  • Menlo Park, United States META Full time

    Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens...


  • Menlo Park, California, United States Meta Full time $120,000 - $180,000 per year

    Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens...


  • Menlo Park, California, United States Meta Full time $120,000 - $250,000 per year

    Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens...