Nvidia reportedly cancels quad-die Rubin Ultra GPU in favor of dual-GPU design, report claims — complex design purportedly scrapped over 'manufacturing execution concerns'

tomshardware.com 2026-06-30 Anton Shilov

Entities

Technologies:3nm EUV chiplet HBM4E HBM4 liquid cooling Kyber rack-scale system

Tags

Nvidia Rubin Ultra GPU design AI accelerator Chiplet packaging Manufacturing execution concerns HBM4E memory Data center GPU Liquid cooling Rack-scale computing Semiconductor manufacturing Artificial intelligence chips

News Summary

According to SemiAnalysis, Nvidia had originally planned to launch its Rubin Ultra AI accelerator in 2027 with a quad-chiplet design aimed at doubling performance. However, due to manufacturing execut... Read original →

Industry Analysis

Nvidia’s pivot from a quad-chiplet Rubin Ultra to a dual-GPU design isn’t merely a yield concession—it signals that 3nm chiplet integration has hit hard limits in both physics and supply chain execution. This delays HBM4E’s volume adoption, forcing memory suppliers like SK Hynix to recalibrate capacity plans. Liquid-cooled Kyber racks now serve as the performance backstop, marking a strategic shift from per-GPU FLOPS to system-level thermal orchestration in AI data centers. AMD stands to gain: its MI500 series could capture premium training workloads, especially as U.S. and EU policies favor locally resilient AI infrastructure with flexible chiplet ecosystems. Crucially, this reversal underscores that advanced packaging can’t indefinitely compensate for scaling bottlenecks. Over the next 18 months, the industry will pivot from ‘more chiplets’ to ‘smarter architectures,’ redirecting capex from wafer fabs toward cooling and power delivery systems.

Read Original Article →

This page displays AI-generated summaries and metadata for research purposes. Original content belongs to the respective publishers.