NVIDIA CEO Jensen Huang introduced the new Vera Rubin AI Platform during his keynote at CES 2026 in Las Vegas on January 5. This signals a move to releasing new AI chips each year after the launch of the Blackwell platform.  

Highlights from the Vera Rubin launch 

  • The platform is named after astronomer Vera Rubin and is built to meet the growing computing needs of next-generation agentic AI.  
  • The Rubin platform, featuring the Rubin GPU, is designed for high efficiency. It offers 5x better AI inference performance (50 petaflops of NVFP4) and trains models 3.5 times faster than the earlier Blackwell architecture.  
  • The platform includes 6 main components:  
  • Rubin GPU  
  • Vera CPU with 88 ARM cores  
  • NVLink 6 switch  
  • ConnectX-9 SuperNIC  
  • Bluefield 4 DPU  
  • Spectrum 6 Ethernet switch  
  • Huang said Rubin is already in full production, and shipments to major partners are expected in the second half of 2026.  
  • The Rubin platform has been designed to lower the cost of generating AI tokens by up to 10x compared to previous architectures.  

Background: After Blackwell Ultra 

NVIDIA introduced the Vera Rubin platform as it continues to release new products every year. For 2026, the main high-performance product is the Blackwell Ultra GB300.  

  • Blackwell Ultra, which has 128GB of GDDR7 memory and 3x faster attention capabilities, will be widely deployed in early 2026 to support advanced long-context AI.  
  • Blackwell Ultra will be the main product for 2026, but the Rubin platform was announced at CES to prepare for an even more powerful AI platform coming in late 2026.  

Major partners for the Rubin platform include Microsoft, AWS, Google Cloud, Meta, and XAI. The systems will be integrated into Dell and HPE servers.  

Six-Chip Architecture Design 

The Rubin architecture is a six-chip system that combines advanced components for improved performance and efficiency. NVIDIA says the Rubin platform uses extreme codesign across the six chips to slash training time and inference token costs.  

Component Specification 
CPU  NVIDIA VERA CPU, 88 cores, designed for agent reasoning.  
GPU  Nvidia Rubin GPU (2 units per system)  
Networking  NVIDIA NVLink 6 switch.  
SuperNIC  NVIDIA ConnectX9 SuperNIC  
DPU  Nvidia BlueField 4 DPU.  
Ethernet switch  NVIDIA Spectrum 6 Ethernet Switch  

Performance Breakthrough Over Blackwell 

NVIDIA’s own tests show that Vera Rubin performs much better than the current Blackwell generation, establishing new standards for AI processing.  

Performance metric Rubin vs Blackwell Improvement. 
AI Training Performance  3.5x faster  
 
AI inference performance.  5x faster  
Peak performance.  50 petaflops  
Inference Compute Efficiency  8x more per watt.  
Operational cost.  Lower cost per result using fewer components.  

The improved performance meets the evolving needs of AI systems, especially for networks that process large data sets in multiple steps. During his talk, Huang said, “Vera Rubin is intended to address the basic challenge that we have: the amount of computation necessary for AI is skyrocketing.”  

Product Status and Market Development 

The Rubin Architecture is now in full production, having finished its testing phase. Huang told the CES audience, “Today I can tell you that Vera Rubin is in full production, and said more expansion is planned for later this year.”  

Deployment information. Details. 
Production status  Full production active.  
Timeline  Second Half 2025 Expansion  
Early customers.  Amazon Web Services Anthropic OpenAI.  
Supercomputer Integration  HPE’s Blue Lion Doudna at Lawrence Berkeley National Lab.  
Product Availability  DJX SuperPod systems and modular components.  

Cutting-Edge Storage and Infrastructure Solutions 

New architecture brings major upgrades to storage and connectivity thanks to upgraded Bluefield and NVLink systems. Dion Harris, NVIDIA’s Senior Director of AI Infrastructure Solutions, explained the importance of these changes: “As you start to enable new types of workflows like agent-based AI or long-term tasks that put a lot of stress and requirements on your KV cache,”  

The Vera CPU is designed for agent-based thinking tasks, and the two Rubin GPUs offer powerful parallel processing for demanding AI workloads.  

Strategic Market Impact 

NVIDIA’s rapid development has made it the world’s most valuable company, and the Rubin architecture is poised to further extend its lead in the AI market. Instead of upgrading just one part, the new 6-chip design offers a complete solution for next-generation AI across robotics, healthcare, and heavy industry.  

Leading cloud providers and research groups are already planning to adopt Rubin technology, demonstrating strong confidence in its capabilities and NVIDIA’s ongoing leadership as AI technology evolves.

Source: Nvidia Launches Vera Rubin Architecture at CES 2026 with Major Performance Gains 

Amazon

Leave a Reply

Your email address will not be published. Required fields are marked *