According to the linked article below, NVIDIA's upcoming ARM-based Grace server chip is now shipping, and comes in two different configurations: "a Grace Superchip module with two Grace CPUs and a Grace+Hopper Superchip with one Grace CPU connected to a Hopper H100 GPU".
The article goes on to say the latter config features a unified memory architecture; NVIDA's own announcement calls it a "CPU+GPU coherent memory model". The unified RAM is 240 GB LPDDR5x, and provides 500 GB/s bandwidth to both the CPU and GPU. In addition, the GPU has 96 GB of dedicated HBM RAM, with a 4000 GB/s bandwidth (see @dada_dave's post immediately below, including the linked slides):
From a big-picture viewpoint, I'm curious where this is qualitatively similar to Apple's approach, and where it differs (other than that the GPU has its own dedicated memory in addition to the memory it shares with the CPU).
wccftech.com
The article goes on to say the latter config features a unified memory architecture; NVIDA's own announcement calls it a "CPU+GPU coherent memory model". The unified RAM is 240 GB LPDDR5x, and provides 500 GB/s bandwidth to both the CPU and GPU. In addition, the GPU has 96 GB of dedicated HBM RAM, with a 4000 GB/s bandwidth (see @dada_dave's post immediately below, including the linked slides):
From a big-picture viewpoint, I'm curious where this is qualitatively similar to Apple's approach, and where it differs (other than that the GPU has its own dedicated memory in addition to the memory it shares with the CPU).

NVIDIA Grace CPU Delivers Up To 30% Higher Performance At 70% Better Efficiency Versus Latest x86 Data Center Chips
NVIDIA has announced sampling of its Grace CPU Superchip which will be delivering some major performance efficiency gains versus x86 chips.

Last edited: