<p>➀ NVIDIA launched the Rubin CPX GPU, a specialized accelerator for massive-context AI models, delivering 30 PetaFLOPS of NVFP4 performance and 128 GB of GDDR7 memory on a monolithic die;</p><p>➁ The GPU is optimized for disaggregated inference, separating compute-bound context phases and memory bandwidth-bound generation phases to enhance throughput, reduce latency, and improve resource utilization;</p><p>➂ Integrated with NVIDIA Vera CPUs and Rubin GPUs in the Vera Rubin NVL144 CPX platform, it provides 8 exaflops of AI compute, 7.5x faster than previous systems, and scales to 100TB of memory and 1.7PB/s memory bandwidth per rack.</p>
Related Articles
- Next-Gen GPU Platform Redefines AI Performanceabout 1 month ago
- NVIDIA GeForce RTX 5090 and the Age of Neural Rendering at Hot Chips 2025about 2 months ago
- Nvidia introduces compact Blackwell professional graphics cards — RTX Pro 4000 SFF and Pro 2000 GPUs arrive at SIGGRAPH 20252 months ago
- NVIDIA Starts to Tackle GPU Power Smoothing with the NVIDIA GB300 NVL723 months ago
- AI Gigafactory Bollox4 months ago
- Onward and upward for Nvidia5 months ago
- AMD's New Sense of Urgency: MI450X, Chance to Beat NVIDIA, and NVIDIA's New Moat6 months ago
- Next-Gen Ti Graphics Cards6 months ago
- Prototype of a Particularly Sustainable and Energy-Autonomous E-Bike Terminal Developed at HKA6 months ago
- Nvidia writes off $5.5 billion in GPUs as US gov't chokes off supply of H20s to China6 months ago