IBM Introduces the Spyre Accelerator for Commercial Availability
IBM (NYSE: IBM) announced general availability of the Spyre Accelerator to run generative and agentic AI with low-latency inferencing while prioritizing security and resilience for core workloads.
Key facts: Spyre will be generally available on Oct 28, 2025 for IBM z17 and LinuxONE 5, and in early December 2025 for Power11. Each Spyre is a 5nm system-on-chip with 32 accelerator cores, 25.6 billion transistors, and ships on a 75-watt PCIe card. Systems can cluster up to 48 cards on IBM Z/LinuxONE or 16 cards on IBM Power.
IBM positions Spyre for on-prem AI acceleration, integration with Telum II on mainframes, use cases like advanced fraud detection and retail automation, and one-click AI services on Power.
IBM (NYSE: IBM) ha annunciato la disponibilità generale del Spyre Accelerator per eseguire AI generativa e agentica con inferenze a bassa latenza, mantenendo sicurezza e resilienza per i carichi core.
Fatti chiave: Spyre sarà disponibile in generale il 28 ottobre 2025 per IBM z17 e LinuxONE 5, e all'inizio di dicembre 2025 per Power11. Ogni Spyre è un sistema su chip a 5 nm con 32 core acceleratori, 25,6 miliardi di transistor, e si sviluppa su una scheda PCIe da 75 watt. I sistemi possono collegare fino a 48 schede su IBM Z/LinuxONE o 16 schede su IBM Power.
IBM posiziona Spyre come accelerazione AI on-prem, integrazione con Telum II sui mainframe, casi d'uso come rilevamento avanzato delle frodi e automazione al dettaglio, e servizi AI con un clic su Power.
IBM (NYSE: IBM) anunció la disponibilidad general del Spyre Accelerator para ejecutar IA generativa y agentiva con inferencia de baja latencia, priorizando la seguridad y la resiliencia para cargas de trabajo centrales.
Datos clave: Spyre estará disponible en general el 28 de octubre de 2025 para IBM z17 y LinuxONE 5, y a principios de diciembre de 2025 para Power11. Cada Spyre es un sistema en chip de 5 nm con 32 núcleos aceleradores, 25,6 mil millones de transistores, y se presenta en una tarjeta PCIe de 75 W. Los sistemas pueden agruparse hasta 48 tarjetas en IBM Z/LinuxONE o 16 tarjetas en IBM Power.
IBM sitúa Spyre como aceleración de IA on-prem, integración con Telum II en mainframes, casos de uso como detección avanzada de fraude y automatización minorista, y servicios de IA con un clic en Power.
IBM (NYSE: IBM)은 핵심 워크로드에 대한 보안 및 회복력을 우선시하면서 저지연 추론으로 생성적이고 에이전트형 AI를 실행하기 위한 Spyre Accelerator의 일반 공급을 발표했습니다.
주요 정보: Spyre는 IBM z17 및 LinuxONE 5용으로 2025년 10월 28일에 일반 공급되며, Power11용으로는 2025년 12월 초에 공급됩니다. 각 Spyre는 32개의 가속 코어, 약 256억 개의 트랜지스터, 75와트 PCIe 카드에 탑재된 5nm 시스템 온 칩입니다. 시스템은 IBM Z/LinuxONE에서 최대 48장, IBM Power에서 16장의 카드를 클러스터링할 수 있습니다.
IBM은 Spyre를 온프레미스 AI 가속, 메인프레임의 Telum II와의 통합, 고급 사기 탐지 및 소매 자동화와 같은 사용 사례, 그리고 Power에서 원클릭 AI 서비스에 초점을 맞춰 제시합니다.
IBM (NYSE: IBM) a annoncé la disponibilité générale du Spyre Accelerator pour exécuter une IA générative et agentique avec une inférence à faible latence, tout en privilégiant la sécurité et la résilience pour les charges de travail essentielles.
Faits clés : Spyre sera disponible en général le 28 octobre 2025 pour IBM z17 et LinuxONE 5, et au début décembre 2025 pour Power11. Chaque Spyre est un système sur puce (SoC) de 5 nm avec 32 cœurs d'accélération, 25,6 milliards de transistors, et se présente sur une carte PCIe de 75 watts. Les systèmes peuvent être regroupés jusqu'à 48 cartes sur IBM Z/LinuxONE ou 16 cartes sur IBM Power.
IBM positionne Spyre comme accélération IA sur site, intégration avec Telum II sur les mainframes, des cas d'utilisation tels que la détection avancée de fraude et l'automatisation du commerce de détail, et des services IA en un clic sur Power.
IBM (NYSE: IBM) hat die allgemeine Verfügbarkeit des Spyre Accelerator bekannt gegeben, um generative und agentische KI mit geringer Latenz-Inferenz auszuführen, während Sicherheit und Resilienz für Kernarbeitslasten priorisiert werden.
Hauptfakten: Spyre wird am 28.10.2025 allgemein verfügbar sein für IBM z17 und LinuxONE 5, und im Anfang Dezember 2025 für Power11. Jeder Spyre ist ein 5-nm-System-on-Chip mit 32 Beschleuniger-Kernen, 25,6 Milliarden Transistoren und kommt auf einer 75-Watt PCIe-Karte daher. Systeme können bis zu 48 Karten auf IBM Z/LinuxONE oder 16 Karten auf IBM Power clustern.
IBM positioniert Spyre als On-Premises-KI-Beschleunigung, Integration mit Telum II auf Mainframes, Anwendungsfälle wie fortschrittliche Betrugserkennung und Einzelhandel-Automatisierung, sowie One-Click-KI-Dienste auf Power.
IBM (NYSE: IBM) أعلنت عن التوفر العام لـ Spyre Accelerator لتشغيل الذكاء الاصطناعي التوليدي والوكيل مع استدلال منخفض الكمون مع إعطاء الأولوية للأمان والصلابة لحِمل العمل الأساسي.
الحقائق الرئيسية: سيكون Spyre متاحاً بشكل عام في 28 أكتوبر 2025 لـ IBM z17 و LinuxONE 5، وفي أوائل ديسمبر 2025 لـ Power11. كل Spyre هو شريحة نظام على شريحة 5 نانومتر تحتوي على 32 نواة مسرِّعة، 25.6 مليار ترانزستر، ويأتي في بطاقة PCIe بقدرة 75 واط. يمكن أن تتحد الأنظمة حتى 48 بطاقة على IBM Z/LinuxONE أو 16 بطاقة على IBM Power.
تضع IBM Spyre كتعجيل AI داخلي، وتكامل مع Telum II على الماكينرات، وحالات الاستخدام مثل اكتشاف الاحتيال المتقدم وأتمتة البيع بالتجزئة، وخدمات AI بنقرة واحدة على Power.
IBM (NYSE: IBM) 宣布将通用提供 Spyre Accelerator,以低延迟推断运行生成型和代理型 AI,同时优先考虑核心工作负载的安全性和弹性。
要点: Spyre 将于 2025年10月28日 在 IBM z17 和 LinuxONE 5 上通用,并在 2025年12月初 为 Power11 提供。每个 Spyre 都是一个 5nm 的片上系统,具备 32 个加速核心、254 亿个晶体管,并以 75 瓦特 PCIe 卡 形式出货。系统可以在 IBM Z/LinuxONE 上聚集最多 48 张卡,在 IBM Power 上聚集 16 张卡。
IBM 将 Spyre 定位为本地 AI 加速、与主机 Telum II 的集成、用例包括高级欺诈检测和零售自动化,以及在 Power 上的一键 AI 服务。
- General availability on Oct 28, 2025 for z17 and LinuxONE 5
- Commercial 5nm chip with 32 cores and 25.6B transistors
- Scales to 48 cards in IBM Z/LinuxONE systems
- On-prem AI for low-latency inferencing and mission-critical workloads
- Power11 availability delayed to early December 2025
- Maximum scale on Power systems limited to 16 cards
Insights
IBM's Spyre Accelerator reaches commercial availability, enabling low-latency on‑prem generative and agentic AI on IBM Z, LinuxONE and Power.
IBM Spyre Accelerator is now scheduled for general availability on
Functionally, Spyre targets low-latency inferencing for generative and agentic AI while keeping data on-prem to preserve security and resilience. The announcement ties performance claims to specific hardware facts: core counts, process node, card power, and cluster limits. The release also cites integration points such as Telum II coupling on mainframes and a Power AI services catalog with one-click installs.
Risks and dependencies include delivery timing across platforms and the actual deployed stack performance versus stated throughput. Key monitorable items include availability on
Coming this Fall to IBM Z, LinuxONE and Power, IBM Spyre Accelerator Enables Enterprises to Scale Generative and Agentic AI Workloads
Today's IT landscape is changing from traditional logic workflows to agentic AI inferencing. AI agents require low-latency inference and real-time system responsiveness. IBM recognized the need for mainframes and servers to run AI models along with the most demanding enterprise workloads without compromising on throughput. To address this demand, clients need AI inferencing hardware that supports generative and agentic AI while maintaining the security and resilience of core data, transactions, and applications. The accelerator is also built to enable clients to keep mission-critical data on-prem to mitigate risk while addressing operational and energy efficiency.
The IBM Spyre Accelerator reflects the strength of IBM's research-to-product pipeline, combining breakthrough innovation from the IBM Research AI Hardware Center with enterprise-grade development from IBM Infrastructure. Initially introduced as a prototype chip, Spyre was refined through rapid iteration, including cluster deployments at IBM's
The IBM Research prototype has evolved into an enterprise-grade product for use in IBM Z, LinuxONE and Power systems. Today, the Spyre Accelerator is a commercial system-on-a-chip with 32 individual accelerator cores and 25.6 billion transistors. Produced using 5nm node technology, each Spyre is mounted on a 75-watt PCIe card, which makes it possible to cluster up to 48 cards in an IBM Z or LinuxONE system or 16 cards in an IBM Power system to scale AI capabilities.
"One of our key priorities has been advancing infrastructure to meet the demands of new and emerging AI workloads," said Barry Baker, COO, IBM Infrastructure & GM, IBM Systems. "With the Spyre Accelerator, we're extending the capabilities of our systems to support multi-model AI – including generative and agentic AI. This innovation positions clients to scale their AI-enabled mission-critical workloads with uncompromising security, resilience, and efficiency, while unlocking the value of their enterprise data."
"We launched the IBM Research AI Hardware Center in 2019 with a mission to meet the rising computational demands of AI, even before the surge in LLMs and AI models we've recently seen," said Mukesh Khare, GM of IBM Semiconductors and VP of Hybrid Cloud, IBM. "Now, amid increasing demand for advanced AI capabilities, we're proud to see the first chip from the Center enter commercialization, designed to deliver improved performance and productivity to IBM's mainframe and server clients."
For IBM clients, Spyre Accelerators offer fast, secured processing with on-prem AI acceleration. This marks a significant milestone, allowing businesses to leverage AI at scale while keeping data on IBM Z, LinuxONE and Power systems. In mainframe systems, coupled with the Telum II processor for IBM Z and LinuxONE, Spyre offers enhanced security, low latency, and high transaction rate processing power. Leveraging this advanced hardware and software stack, businesses can use Spyre to scale multiple AI models to power predictive use cases such as advanced fraud detection and retail automation.
On IBM Power-based servers, Spyre customers can leverage a catalog of AI services, enabling end-to-end AI for enterprise workflows. Clients can install the AI services from the catalog with just one click.1 Spyre Accelerator for Power, combined with an on-chip accelerator (MMA), also accelerates data conversion for generative AI to deliver high throughput for deep process integrations. Additionally, with a prompt size of 128, it enables the ingestion of more than 8 million documents for knowledge base integration in an hour2. This performance, combined with the IBM software stack, security, scalability, and energy efficiency, supports clients on their journey to integrating generative AI frameworks into their enterprise workloads.
To learn more about the IBM Spyre Accelerator, visit http://www.ibm.com/solutions/ai-accelerator.
Additional resources:
About IBM
IBM is a leading provider of global hybrid cloud and AI, and consulting expertise. We help clients in more than 175 countries capitalize on insights from their data, streamline business processes, reduce costs and gain a competitive edge in their industries. Thousands of governments and corporate entities in critical infrastructure areas such as financial services, telecommunications and healthcare rely on IBM's hybrid cloud platform and Red Hat OpenShift to affect their digital transformations quickly, efficiently and securely. IBM's breakthrough innovations in AI, quantum computing, industry-specific cloud solutions and consulting deliver open and flexible options to our clients. All of this is backed by IBM's long-standing commitment to trust, transparency, responsibility, inclusivity and service. Visit www.ibm.com for more information.
Media Contacts
Willa Hahn, willa.hahn@ibm.com
Chase Skinner, Chase.Skinner@ibm.com
1 AI service of the IBM-supported catalog is delivered as one or a set of containers that can be deployed with a single deployment command. The provided UI for the catalog executes such commands in the backend based on a single click within the UI page of the respective AI service.
2 Based upon internal testing running 1M unit data set with prompt size 128, batch size 128 using 1-card container. Individual results may vary based on workload size, use of storage subsystems and other conditions.
View original content to download multimedia:https://www.prnewswire.com/news-releases/ibm-introduces-the-spyre-accelerator-for-commercial-availability-302576909.html
SOURCE IBM