Scaling Cloud Network Infrastructure For The Ai Era

Серверы с серверами

The world has changed dramatically since generative AI made its debut. Businesses are starting to use it to summarize online reviews. Consumers are getting problems resolved through chatbots. Employees are accomplishing their jobs faster with AI assistants. What these AI applications have in common is they rely on generative AI models that have been trained on high-performance, back-end networks in the data center and served through AI inference clusters deployed in data center front-end networks.

Training models can use billions or even trillions of parameters to process massive data sets across artificial intelligence/machine learning (AI/ML) clusters of graphics processing unit (GPU)-based servers. Any delays-such as from network congestion or packet loss-can dramatically impact the accuracy and training time of these AI models. As AI/ML clusters grow ever larger, the platforms that are used to build them need to support higher port speeds as well as higher radices (such as the number of ports). A higher radix allows the building of flatter topologies, which reduces layers and improves performance.

Meeting the demands of high-performance AI clusters

In recent years, we have seen the GPU needs for scale-out bandwidth increase from 200G to 400G to 800G, which is accelerating connectivity requirements compared to traditional CPU-based compute solutions. The density of the data center leaf must increase accordingly, while also maximizing the number of addressable nodes with flatter topologies.

To address these needs, we are introducing theCisco 8122-64EH/EHF with support for 64 ports of 800G. This new platform is powered by the Cisco Silicon One G200-a 5 nm 51.2T processor that uses 512G x 112G SerDes, which enables extreme scaling capabilities in just a two-rack unit (2RU) form factor (see Figure 1). With 64 QSFP-DD800 or OSFP interfaces, the Cisco 8122 supports options for 2x 400G and 8x 100G Ethernet connectivity.

Figure 1. Cisco 8122-64EH

Cisco Silicon One architecture, with its fully shared packet buffer for congestion control and P4 programmable forwarding engine, along with the Silicon One software development kit (SDK), are proven and trusted by hyperscalers globally. Through major innovations, the Cisco Silicon One G200 supports 2x the performance and power efficiency, as well as lower latency, compared to the previous-generation device.

With the introduction of Cisco Silicon One G200 last year, Cisco was first to market with 512-wide radix, which can help cloud providers lower costs, complexity, and latency by designing networks with fewer layers, switches, and optics. Advancements in load balancing, link-failure avoidance, and congestion reaction/avoidance help improve job completion times and reliability at scale for better AI workload performance (see Cisco Silicon One Breaks the 51.2 Tbps Barrier for more details).

The Cisco 8122 supports open network operating systems (NOSs), such as Software for Open Networking in the Cloud (SONiC), and other third-party NOSs. Through broad application programming interface (API) support, cloud providers can use tooling for management and visibility to efficiently operate the network. With these customizable options, we are making it easier for hyperscalers and other cloud providers that are adopting the hyperscaler model to meet their requirements.

In addition to scaling out back-end networks, the Cisco 8122 can also be used for mainstream workloads in front-end networks, such as email and web servers, databases, and other traditional applications.

Improving customer outcomes

With these innovations, cloud providers can benefit from:

Simplification:Cloud providers can streamline networks by reducing the number of platforms needed to scale with high-capacity compact systems, as well as depending on fewer networking layers and optics and less cabling. Complexity can also be reduced through fewer platforms to manage, which can help lower operational costs.
Flexibility:Using an open platform allows cloud providers to choose the network optimization service (NOS) that best suits their needs and allows them to develop custom automation tools to operate the network through APIs.
Network velocity:Scaling the infrastructure efficiently leads to fewer potential bottlenecks and delays that could lead to slower response times and undesirable outcomes with AI workloads. Advanced congestion management, optimized reliability capabilities, and increased scalability help enable better network performance for AI/ML clusters.
Sustainability:The power efficiency of the Cisco Silicon One G200 can help cloud providers meet data center sustainability goals. The higher radix helps reduce the number of devices by using a flatter structure to better control power consumption.

The future of cloud network infrastructure

We are giving cloud providers the flexibility to meet critical cloud network infrastructure requirements for AI training and inferencing with the Cisco 8122-64EH/EHF. With this platform, cloud providers can better control costs, latency, space, power consumption, and complexity in both front-end and back-end networks. At Cisco, we are investing in silicon, systems, and optics to help build scalable, high-performance data center networks for cloud providers to help deliver high-quality results and insights quickly with AI and mainstream workloads.

The Open Compute Project (OCP) Global Summit meeting is October 15-17, 2024, in San Jose. Come visit us in the community lounge to learn more about our exciting new innovations; customers can sign up to see a demo here.

Explore Cisco 8000 Series

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

Серверы с серверами

Новости по теме

S5735-L48LP4S-A-V2 Powers Smarter Campus Networks with Advanced PoE and Cloud Management

S5735-L24T4X-A1 Empowers Installers with Scalable, Reliable, and Efficient Network Access

Best Ethernet Switches for Business (2025): Selection Guide and Top Picks

Huawei S5735-L24T4S-A1: A Compact, Stackable Access Switch Built for the Future

Huawei S5735-L24T4S-A: High-Performance Stacking Meets Zero-Noise Deployment

S5735-L24P4XE-A-V2: Huawei’s Smart Choice for High-Density Campus Deployments

S5735-L24P4X-A1: Huawei’s High-Performance Access Switch Redefining Campus Networking

Huawei S5735-L24P4S-A1 Review: Reliable Gigabit Access with Enterprise-Grade Features

What Is an Orthogonal Architecture?

Huawei s5735-l24p4s-a-v2 Delivers Scalable, Secure, and Smart PoE Access for Modern IT Infrastructures

Huawei S5735-L48T4XE-A-V2 Switch Delivers Enterprise-Grade Performance in a Compact Design

Huawei S5735-L48P4XE-A-V2 Review: Versatile Campus Switch with iStack and Full L3 Support

Differences Between Huawei CE Series and S Series Switches

Huawei CloudEngine S5735 Switches Set the Benchmark for High-Performance, Energy-Efficient Switching

Huawei CloudEngine S5731‑S48P4X Datasheet

Huawei CloudEngine S5731‑S24P4X Datasheet

Huawei S5731-S Empowers Next-Generation Campus Networks with Advanced Capabilities

Huawei S5731-H24P4XC Switch Review: Power-Packed Performance and Smart PoE

Huawei S5731-H Series Switches Redefine Campus Networking with Intelligent High-Performance Architecture

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Scaling Cloud Network Infrastructure for the AI Era

Meeting the demands of high-performance AI clusters

Improving customer outcomes

The future of cloud network infrastructure

Explore Cisco 8000 Series

Горячие метки: Организация < < международная амнистия > > cloud providers cloud scale 800G Cisco 8000 800G networking 800G Ethernet back-end networks AI/ML clusters

Ordering Guide

Ресурсы по программам

О нас

Huawei CloudEngine S5731‑S48P4X Datasheet