STOCK TITAN

NVIDIA Unveils Rubin CPX: A New Class of GPU Designed for Massive-Context Inference

Rhea-AI Impact
(Neutral)
Rhea-AI Sentiment
(Positive)
Tags

NVIDIA (NASDAQ:NVDA) has unveiled the Rubin CPX, a revolutionary GPU specifically designed for massive-context AI processing. The new GPU is integrated into the Vera Rubin NVL144 CPX platform, delivering 8 exaflops of AI compute power and 100TB of fast memory in a single rack.

The Rubin CPX offers 30 petaflops of compute power with NVFP4 precision and features 128GB of GDDR7 memory. It provides 3x faster attention capabilities compared to previous GB300 NVL72 systems. The platform enables unprecedented monetization potential, generating $5B in token revenue for every $100M invested.

Leading AI companies including Cursor, Runway, and Magic are already exploring Rubin CPX's capabilities for software development and video processing applications. The platform will be available by the end of 2026.

NVIDIA (NASDAQ:NVDA) ha presentato la Rubin CPX, una GPU rivoluzionaria pensata per l'elaborazione AI su contesti di grande dimensione. Questa GPU è integrata nella piattaforma Vera Rubin NVL144 CPX, offrendo 8 exaflops di potenza di calcolo AI e 100TB di memoria ad accesso rapido in un singolo rack.

La Rubin CPX fornisce 30 petaflops di potenza con precisione NVFP4 e dispone di 128GB di memoria GDDR7. Garantisce capacità di attenzione 3x più veloci rispetto ai precedenti sistemi GB300 NVL72. La piattaforma apre nuove opportunità di monetizzazione, generando $5B di ricavi da token per ogni $100M investiti.

Aziende leader nel settore AI come Cursor, Runway e Magic stanno già testando le potenzialità della Rubin CPX per lo sviluppo software e l'elaborazione video. La piattaforma sarà disponibile entro la fine del 2026.

NVIDIA (NASDAQ:NVDA) ha presentado la Rubin CPX, una GPU revolucionaria diseñada para el procesamiento de IA con contextos masivos. Esta GPU está integrada en la plataforma Vera Rubin NVL144 CPX, que ofrece 8 exaflops de potencia de cómputo para IA y 100TB de memoria rápida en un solo rack.

La Rubin CPX proporciona 30 petaflops de cómputo con precisión NVFP4 y cuenta con 128GB de memoria GDDR7. Ofrece capacidades de atención 3x más rápidas que los anteriores sistemas GB300 NVL72. La plataforma permite un potencial de monetización sin precedentes, generando $5B en ingresos por tokens por cada $100M invertidos.

Compañías líderes en IA como Cursor, Runway y Magic ya están explorando las capacidades de Rubin CPX para desarrollo de software y procesamiento de vídeo. La plataforma estará disponible para finales de 2026.

NVIDIA (NASDAQ:NVDA)가 대규모 컨텍스트 AI 처리를 위해 설계한 혁신적인 GPU인 Rubin CPX를 공개했습니다. 이 GPU는 Vera Rubin NVL144 CPX 플랫폼에 통합되어 단일 랙에서 8 exaflops의 AI 연산 성능과 100TB의 고속 메모리를 제공합니다.

Rubin CPX는 NVFP4 정밀도로 30 petaflops의 연산 성능을 제공하며 128GB의 GDDR7 메모리를 탑재했습니다. 이전의 GB300 NVL72 시스템보다 주의(attention) 처리 속도가 3배 빠릅니다. 이 플랫폼은 전례 없는 수익화 가능성을 열어, $100M 투자당 $5B의 토큰 수익을 창출할 수 있습니다.

Cursor, Runway, Magic 등 주요 AI 기업들이 이미 Rubin CPX의 기능을 소프트웨어 개발 및 영상 처리에 적용하는 방안을 검토하고 있습니다. 플랫폼은 2026년 말까지 제공될 예정입니다.

NVIDIA (NASDAQ:NVDA) a dévoilé la Rubin CPX, un GPU révolutionnaire conçu pour le traitement IA sur de très grands contextes. Ce GPU est intégré à la plateforme Vera Rubin NVL144 CPX, offrant 8 exaflops de puissance de calcul IA et 100TB de mémoire rapide dans un seul rack.

La Rubin CPX fournit 30 petaflops de calcul en précision NVFP4 et embarque 128GB de mémoire GDDR7. Elle propose des capacités d'attention 3x plus rapides que les systèmes GB300 NVL72 précédents. La plateforme ouvre un potentiel de monétisation inédit, générant 5 milliards $ de revenus token pour chaque 100 millions $ investis.

Des acteurs majeurs de l'IA, comme Cursor, Runway et Magic, explorent déjà les possibilités de la Rubin CPX pour le développement logiciel et le traitement vidéo. La plateforme sera disponible d'ici la fin 2026.

NVIDIA (NASDAQ:NVDA) hat die Rubin CPX vorgestellt, eine wegweisende GPU, die speziell für KI-Verarbeitung mit sehr großem Kontext entwickelt wurde. Die GPU ist in die Vera Rubin NVL144 CPX-Plattform integriert und liefert 8 Exaflops KI-Rechenleistung sowie 100TB schnellen Speicher in einem einzigen Rack.

Die Rubin CPX bietet 30 Petaflops Rechenleistung mit NVFP4-Präzision und verfügt über 128GB GDDR7-Speicher. Sie ermöglicht eine 3x schnellere Attention-Leistung im Vergleich zu den bisherigen GB300 NVL72-Systemen. Die Plattform eröffnet beispiellose Monetarisierungsmöglichkeiten und generiert $5B Token-Einnahmen für je $100M investiert.

Führende KI-Unternehmen wie Cursor, Runway und Magic prüfen bereits den Einsatz der Rubin CPX für Softwareentwicklung und Videoverarbeitung. Die Plattform wird bis Ende 2026 verfügbar sein.

Positive
  • Revolutionary 8 exaflops of AI compute power, 7.5x more than previous systems
  • Exceptional monetization potential with $5B token revenue per $100M investment
  • 3x faster attention capabilities compared to previous systems
  • Significant industry adoption with partnerships from Cursor, Runway, and Magic
  • Integration with complete NVIDIA AI stack and extensive developer ecosystem
Negative
  • Long time to market - not available until end of 2026
  • High initial investment requirements for implementation
  • Requires significant infrastructure upgrades for full capability utilization

Insights

NVIDIA's Rubin CPX represents a breakthrough in AI processing for long-context applications, positioning them to dominate the next frontier of AI infrastructure.

NVIDIA has strategically positioned itself at the forefront of the next wave of AI computing with the announcement of Rubin CPX, a purpose-built GPU designed specifically for massive-context processing. This isn't merely an incremental improvement—it's a fundamental architectural shift that creates an entirely new product category optimized for handling million-token workloads.

The technical specifications are nothing short of extraordinary. The Vera Rubin NVL144 CPX platform delivers 8 exaflops of AI compute—7.5x more than their GB300 NVL72 systems—with 100TB of fast memory and 1.7 petabytes per second of memory bandwidth in a single rack. For context, this level of performance was unimaginable in commercial systems just a few years ago.

Most critically, NVIDIA has directly tied this technological advancement to revenue potential, claiming $5 billion in token revenue for every $100 million invested—a 50x return. This positions Rubin CPX as essential infrastructure for any company processing long-context data, particularly in the burgeoning fields of AI coding assistants and generative video.

The partnerships with cutting-edge AI companies like Cursor, Runway, and Magic demonstrate immediate real-world applications. Cursor will leverage the technology for massive-scale code understanding, while Runway and Magic will use it for long-context video generation and autonomous coding agents, respectively. These applications represent massive markets that current hardware simply can't adequately address.

While availability isn't expected until late 2026, this announcement serves as a powerful market signal that NVIDIA intends to maintain its dominance in AI infrastructure through continued innovation rather than resting on current products. The company is effectively future-proofing its business by creating purpose-built hardware for the next generation of AI applications requiring massive context windows.

News Summary:

  • The NVIDIA Rubin CPX GPU is purpose-built to handle million-token coding and generative video applications.
  • The NVIDIA Vera Rubin NVL144 CPX platform packs 8 exaflops of AI performance and 100TB of fast memory in a single rack.
  • Companies can monetize at an unprecedented scale, with $5B in token revenue for every $100M invested.
  • AI innovators like Cursor, Runway and Magic are exploring how Rubin CPX can accelerate their applications.

SANTA CLARA, Calif., Sept. 09, 2025 (GLOBE NEWSWIRE) -- AI Infra Summit -- NVIDIA® today announced NVIDIA Rubin CPX, a new class of GPU purpose-built for massive-context processing. This enables AI systems to handle million-token software coding and generative video with groundbreaking speed and efficiency.

Rubin CPX works hand in hand with NVIDIA Vera CPUs and Rubin GPUs inside the new NVIDIA Vera Rubin NVL144 CPX platform. This integrated NVIDIA MGX system packs 8 exaflops of AI compute to provide 7.5x more AI performance than NVIDIA GB300 NVL72 systems, as well as 100TB of fast memory and 1.7 petabytes per second of memory bandwidth in a single rack. A dedicated Rubin CPX compute tray will also be offered for customers looking to reuse existing Vera Rubin 144 systems.

“The Vera Rubin platform will mark another leap in the frontier of AI computing — introducing both the next-generation Rubin GPU and a new category of processors called CPX,” said Jensen Huang, founder and CEO of NVIDIA. “Just as RTX revolutionized graphics and physical AI, Rubin CPX is the first CUDA GPU purpose-built for massive-context AI, where models reason across millions of tokens of knowledge at once.”

NVIDIA Rubin CPX enables the highest performance and token revenue for long-context processing — far beyond what today’s systems were designed to handle. This transforms AI coding assistants from simple code-generation tools into sophisticated systems that can comprehend and optimize large-scale software projects.

To process video, AI models can take up to 1 million tokens for an hour of content, pushing the limits of traditional GPU compute. Rubin CPX integrates video decoder and encoders, as well as long-context inference processing, in a single chip for unprecedented capabilities in long-format applications such as video search and high-quality generative video.

Built on the NVIDIA Rubin architecture, the Rubin CPX GPU uses a cost‑efficient, monolithic die design packed with powerful NVFP4 computing resources and is optimized to deliver extremely high performance and energy efficiency for AI inference tasks.

Advancements Offered by Rubin CPX
Rubin CPX delivers up to 30 petaflops of compute with NVFP4 precision for the highest performance and accuracy. It features 128GB of cost-efficient GDDR7 memory to accelerate the most demanding context-based workloads. In addition, it delivers 3x faster attention capabilities compared with NVIDIA GB300 NVL72 systems — boosting an AI model’s ability to process longer context sequences without a drop in speed.

Rubin CPX is offered in multiple configurations, including the Vera Rubin NVL144 CPX, that can be combined with the NVIDIA Quantum‑X800 InfiniBand scale-out compute fabric or the NVIDIA Spectrum-X™ Ethernet networking platform with NVIDIA Spectrum-XGS Ethernet technology and NVIDIA ConnectX®-9 SuperNICs™. Vera Rubin NVL144 CPX enables companies to monetize at an unprecedented scale, with $5 billion in token revenue for every $100 million invested.

Industry Leaders Look to Rubin CPX
AI innovators are exploring how Rubin CPX can accelerate their applications, ranging from large-scale software development to the analysis of dynamic visual content to better understand moving images.

Cursor, an AI-powered software company that offers an advanced code editor, sees the benefits of Rubin CPX to boost developer productivity with intelligent code generation and collaborative tools directly in the coding environment.

“With NVIDIA Rubin CPX, Cursor will be able to deliver lightning-fast code generation and developer insights, transforming software creation,” said Michael Truell, CEO of Cursor. “This will unlock new levels of productivity and empower users to ship ideas once out of reach.”

Runway, an American generative AI company, will use NVIDIA technologies to enable creators to produce cinematic content and sophisticated visual effects with unmatched scale and efficiency.

“Video generation is rapidly advancing toward longer context and more flexible, agent-driven creative workflows,” said Cristóbal Valenzuela, CEO of Runway. “We see Rubin CPX as a major leap in performance, supporting these demanding workloads to build more general, intelligent creative tools. This means creators — from independent artists to major studios — can gain unprecedented speed, realism and control in their work.”

Magic is an AI research and product company developing foundation models to power AI agents that can automate software engineering.

“With a 100-million-token context window, our models can see a codebase, years of interaction history, documentation and libraries in context without fine-tuning,” said Eric Steinberger, CEO of Magic. “This enables users to coach the agent at test time through conversation and access to their environments, bringing us closer to autonomous agentic experiences. Using a GPU like NVIDIA Rubin CPX greatly accelerates our compute workloads.”

Software Support
NVIDIA Rubin CPX will be supported by the complete NVIDIA AI stack — from accelerated infrastructure to enterprise‑ready software. The NVIDIA Dynamo platform efficiently scales AI inference, dramatically boosting throughput while cutting response times and model serving costs.

The processors will be able to run the latest in the NVIDIA Nemotron™ family of multimodal models that provide state-of-the-art reasoning for enterprise-ready AI agents. For production-grade AI, Nemotron models can be delivered with NVIDIA AI Enterprise, a software platform that includes NVIDIA NIM™ microservices as well as AI frameworks, libraries and tools that enterprises can deploy on NVIDIA-accelerated clouds, data centers and workstations.

Built on decades of innovation, the Rubin platform extends NVIDIA’s developer ecosystem — with NVIDIA CUDA‑X™ libraries, a community of over 6 million developers and nearly 6,000 CUDA applications.

Availability
NVIDIA Rubin CPX is expected to be available at the end of 2026.

Learn more by watching NVIDIA Vice President of Hyperscale and High-Performance Computing Ian Buck’s keynote at AI Infra Summit on Sept. 9 at 10am PT.

About NVIDIA
NVIDIA (NASDAQ: NVDA) is the world leader in accelerated computing.

For further information, contact:
Kristin Uchiyama
NVIDIA Corporation
+1-408-313-0448
kuchiyama@nvidia.com

Certain statements in this press release including, but not limited to, statements as to: Vera Rubin systems continuing to deliver extraordinary performance and efficiency; with Rubin CPX, building a GPU uniquely suited for million-token context processing, cutting the cost of inference and unlocking advanced capabilities for developers and creators everywhere; the benefits, impact, performance, and availability of NVIDIA’s products, services, and technologies; expectations with respect to NVIDIA’s third party arrangements, including with its collaborators and partners; expectations with respect to technology developments; and other statements that are not historical facts are forward-looking statements within the meaning of Section 27A of the Securities Act of 1933, as amended, and Section 21E of the Securities Exchange Act of 1934, as amended, which are subject to the “safe harbor” created by those sections based on management’s beliefs and assumptions and on information currently available to management and are subject to risks and uncertainties that could cause results to be materially different than expectations. Important factors that could cause actual results to differ materially include: global economic and political conditions; NVIDIA’s reliance on third parties to manufacture, assemble, package and test NVIDIA’s products; the impact of technological development and competition; development of new products and technologies or enhancements to NVIDIA’s existing product and technologies; market acceptance of NVIDIA’s products or NVIDIA’s partners’ products; design, manufacturing or software defects; changes in consumer preferences or demands; changes in industry standards and interfaces; unexpected loss of performance of NVIDIA’s products or technologies when integrated into systems; and changes in applicable laws and regulations, as well as other factors detailed from time to time in the most recent reports NVIDIA files with the Securities and Exchange Commission, or SEC, including, but not limited to, its annual report on Form 10-K and quarterly reports on Form 10-Q. Copies of reports filed with the SEC are posted on the company’s website and are available from NVIDIA without charge. These forward-looking statements are not guarantees of future performance and speak only as of the date hereof, and, except as required by law, NVIDIA disclaims any obligation to update these forward-looking statements to reflect future events or circumstances.

Many of the products and features described herein remain in various stages and will be offered on a when-and-if-available basis. The statements above are not intended to be, and should not be interpreted as a commitment, promise, or legal obligation, and the development, release, and timing of any features or functionalities described for our products is subject to change and remains at the sole discretion of NVIDIA. NVIDIA will have no liability for failure to deliver or delay in the delivery of any of the products, features or functions set forth herein.

© 2025 NVIDIA Corporation. All rights reserved. NVIDIA, the NVIDIA logo and all other NVIDIA trademarks mentioned herein are trademarks and/or registered trademarks of NVIDIA Corporation in the U.S. and other countries. Other company and product names may be trademarks of the respective companies with which they are associated. Features, pricing, availability and specifications are subject to change without notice.

A photo accompanying this announcement is available at https://www.globenewswire.com/NewsRoom/AttachmentNg/3266451c-18af-4394-8290-db8d9ae220b4


FAQ

What are the key features of NVIDIA's new Rubin CPX GPU?

The Rubin CPX GPU delivers 30 petaflops of compute power, 128GB of GDDR7 memory, and enables processing of million-token software coding and generative video. It provides 3x faster attention capabilities compared to previous systems.

How much performance improvement does the Vera Rubin NVL144 CPX platform offer?

The platform delivers 8 exaflops of AI compute power, providing 7.5x more AI performance than NVIDIA GB300 NVL72 systems, with 100TB of fast memory and 1.7 petabytes per second of memory bandwidth in a single rack.

What is the revenue potential for NVIDIA's Rubin CPX platform?

The platform enables companies to generate $5 billion in token revenue for every $100 million invested, representing significant monetization potential.

When will NVIDIA's Rubin CPX be available?

NVIDIA Rubin CPX is expected to be available at the end of 2026.

Which companies are already working with NVIDIA's Rubin CPX?

Leading AI companies including Cursor (for code generation), Runway (for video generation), and Magic (for software engineering automation) are exploring Rubin CPX's capabilities.
Nvidia Corporation

NASDAQ:NVDA

NVDA Rankings

NVDA Latest News

NVDA Latest SEC Filings

NVDA Stock Data

4.10T
23.24B
4.32%
68.69%
0.79%
Semiconductors
Semiconductors & Related Devices
Link
United States
SANTA CLARA