• Home
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Subscribe to Daily Newsletter
  • NextGenInfra.io
No Result
View All Result
Converge Digest
Friday, April 10, 2026
  • Home
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Subscribe to Daily Newsletter
  • NextGenInfra.io
No Result
View All Result
Converge Digest
No Result
View All Result

Home » IBM Launches Spyre Accelerator to Power Generative and AgenticAI

IBM Launches Spyre Accelerator to Power Generative and AgenticAI

October 8, 2025
in Data Centers, Enterprise
A A

IBM is bringing its in-house AI accelerator to market with the commercial launch of the Spyre Accelerator, designed to deliver low-latency inference for generative and agentic AI workloads. The chip will debut on October 28 for IBM z17 and LinuxONE 5 systems, followed by Power11 servers in December. Built to operate alongside the company’s Telum II processor, Spyre enables enterprises to run AI models directly on mainframes and servers without compromising on security or throughput.

Developed through the IBM Research AI Hardware Center, Spyre integrates 32 accelerator cores and 25.6 billion transistors on a 5nm system-on-chip, packaged in a 75-watt PCIe card. Up to 48 cards can be clustered in IBM Z or LinuxONE systems and 16 cards in Power servers to scale inferencing workloads. IBM said Spyre maintains on-prem data control while improving efficiency and performance for real-time AI applications such as fraud detection, transaction analytics, and retail automation.

On Power systems, Spyre integrates with IBM’s AI services catalog, enabling one-click deployment of enterprise AI workflows. The accelerator also enhances generative AI throughput with on-chip mixed-matrix acceleration (MMA) and can process up to 8 million documents per hour for knowledge-base integration. The product reflects IBM’s strategy to pair its semiconductor innovation pipeline with enterprise infrastructure that supports AI at scale across hybrid cloud environments.

• Commercial availability: Oct. 28 for z17/LinuxONE 5; Dec. for Power11

• Architecture: 32 AI cores, 25.6B transistors, 5nm SoC, 75W PCIe form factor

• Scalability: Up to 48 cards (IBM Z/LinuxONE) or 16 cards (Power)

• Integration: Works with Telum II CPU and MMA accelerators

• Primary workloads: Generative and agentic AI, fraud detection, retail automation

• Origin: Developed by IBM Research AI Hardware Center, Yorktown Heights

“With the Spyre Accelerator, we’re extending the capabilities of our systems to support multi-model AI — including generative and agentic AI,” said Barry Baker, COO of IBM Infrastructure. “This innovation positions clients to scale their AI-enabled mission-critical workloads with uncompromising security, resilience, and efficiency.”

🌐  Analysis: Spyre marks a major milestone in IBM’s push to embed AI acceleration directly into its enterprise systems rather than relying on external GPUs. The approach mirrors broader industry trends toward heterogeneous compute, as seen in NVIDIA’s Grace Hopper and AMD’s MI300 platforms, but emphasizes secure, on-prem inference for regulated industries. IBM’s integration of Spyre with z17 and LinuxONE further strengthens its hybrid cloud portfolio and positions the company as a unique AI infrastructure provider bridging mainframes and AI compute.

🌐 We’re tracking the latest developments in semiconductors. Follow our ongoing coverage at: https://convergedigest.com/category/semiconductors/

Tags: IBM
ShareTweetShare
Previous Post

AST SpaceMobile and Verizon Sign Definitive Agreement for Space-Based Cellular Broadband

Next Post

Cisco Advances Open Networking with SONiC Support on 8000 Series Switches

Jim Carroll

Jim Carroll

Editor and Publisher, Converge! Network Digest, Optical Networks Daily - Covering the full stack of network convergence from Silicon Valley

Related Posts

IBM and Cisco Aim for Networked, Fault-Tolerant Quantum by Early 2030s
Quantum

IBM and Cisco Aim for Networked, Fault-Tolerant Quantum by Early 2030s

November 20, 2025
IBM Doubles Quantum R&D Speed With 300 mm Fab
Quantum

IBM Doubles Quantum R&D Speed With 300 mm Fab

November 12, 2025
Airtel and IBM Partner to Expand AI-Ready Cloud Infrastructure
All

Airtel and IBM Partner to Expand AI-Ready Cloud Infrastructure

October 23, 2025
Video: AI Automation for Real-Time, Multi-Vendor Networks
Video

#AINetworking25: What’s Next for AI-Driven Networking – Director’s Cut

October 6, 2025
IBM Launches Power11 Server Family with AI Acceleration
Data Centers

IBM Launches Power11 Server Family with AI Acceleration

July 8, 2025
Lumen Brings AI to the Edge with IBM watsonx and Sub-5ms Infrastructure
Clouds and Carriers

Lumen Brings AI to the Edge with IBM watsonx and Sub-5ms Infrastructure

May 6, 2025
Next Post
Cisco Advances Open Networking with SONiC Support on 8000 Series Switches

Cisco Advances Open Networking with SONiC Support on 8000 Series Switches

Categories

  • 5G / 6G / Wi-Fi
  • AI Infrastructure
  • All
  • Automotive Networking
  • Blueprints
  • Clouds and Carriers
  • Data Centers
  • Enterprise
  • Explainer
  • Feature
  • Financials
  • Last Mile / Middle Mile
  • Legal / Regulatory
  • Optical
  • Quantum
  • Research
  • Security
  • Semiconductors
  • Space
  • Start-ups
  • Subsea
  • Sustainability
  • Video
  • Webinars

Archives

Tags

5G All AT&T Australia AWS Blueprint columns BroadbandWireless Broadcom China Ciena Cisco Data Centers Dell'Oro Ericsson FCC Financial Financials Huawei Infinera Intel Japan Juniper Last Mile Last Mille LTE Mergers and Acquisitions Mobile NFV Nokia Optical Packet Systems PacketVoice People Regulatory Satellite SDN Service Providers Silicon Silicon Valley StandardsWatch Storage TTP UK Verizon Wi-Fi
Converge Digest

A private dossier for networking and telecoms

Follow Us

  • Home
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Subscribe to Daily Newsletter
  • NextGenInfra.io

© 2025 Converge Digest - A private dossier for networking and telecoms.

No Result
View All Result
  • Home
  • Events Calendar
  • Blueprint Guidelines
  • Privacy Policy
  • Subscribe to Daily Newsletter
  • NextGenInfra.io

© 2025 Converge Digest - A private dossier for networking and telecoms.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
Go to mobile version