➀ NVIDIA introduces the Rubin CPX GPU, a specialized accelerator for AI inference's context phase using cost-effective GDDR7 memory;
➁ GDDR7 reduces power and cost by 50% compared to HBM3E/HBM4, avoiding CoWoS packaging and enabling distributed AI processing;
➂ The Vera Rubin NVL144 CPX system integrates CPX GPUs with 8 ExaFLOPS performance and Dynamo orchestration for automatic workload optimization.