Site icon Converge Digest

Cisco Expands AI Infrastructure with New GPU Servers and AI PODs for Enterprise

Cisco introduced substantial updates to its AI infrastructure portfolio at the Cisco Partner Summit in Los Angeles, unveiling a dedicated family of AI servers and AI PODs designed to accelerate and simplify AI adoption for enterprise customers. The newly launched UCS C885A M8 servers are built specifically for GPU-intensive AI workloads and harness NVIDIA’s HGX supercomputing platform, integrating H100 and H200 Tensor Core GPUs to deliver high-performance AI computing. These eight-way accelerated servers represent Cisco’s first dedicated foray into the AI server market and are equipped with NVIDIA NICs or SuperNICs to optimize AI networking and bolster security through NVIDIA BlueField-3 DPUs. Meanwhile, Cisco’s AI PODs provide modular, use-case-specific stacks of compute, networking, storage, and cloud management, allowing businesses to deploy AI solutions with greater speed and reduced complexity.

The increasing volume of AI workloads across industries has amplified demand for secure, scalable, and programmable infrastructure capable of supporting advanced AI applications. Cisco’s latest offerings are designed to bridge this gap, providing modular, adaptable options for enterprises whether they’re beginning AI integration or expanding existing capabilities. Managed through Cisco Intersight, both the UCS servers and AI PODs bring centralized control and automation, simplifying everything from initial configuration to daily operations, making AI adoption smoother. According to Cisco’s AI Readiness Index, 89% of enterprises plan to implement AI workloads within the next two years, though only 14% currently have infrastructure that’s fully capable of handling these advanced tasks. Cisco aims to provide the technological foundation to support this rapid AI growth.

Cisco’s new infrastructure offerings come as part of a broader ecosystem strategy, collaborating closely with technology partners to facilitate innovation in the enterprise AI space. The new systems integrate seamlessly with Cisco’s Nexus switching platforms, including the 800G-capable models powered by the Cisco Silicon One G200 chip, which ensure robust data transfer speeds and reliability for demanding AI applications. Both the UCS C885A M8 servers and the AI PODs will be available for order in late 2024, with Cisco offering financing solutions to help customers manage payment risks and cash flow. Cisco also highlights partner solutions to enhance AI deployment and build out capabilities efficiently.

“A new era of intelligent generative AI applications is driving transformation for enterprises worldwide. With the debut of its dedicated AI server portfolio, AI PODs, and expanding systems with full-stack NVIDIA acceleration, Cisco is advancing the infrastructure enterprises need to grow in the AI age,” stated Bob Pette, vice president, Enterprise Platforms at NVIDIA.

Exit mobile version