The Hardcard
Member
- Joined
- Jun 1, 2024
- Posts
- 21
I am referring to Strix Halo and GB10/N1X.
EDIT: - accidentally hit the reply button too soon.
These are both large CPU/GPU combos with varying degrees of unified memory, essentially akin to Apple’s Max, except both only have 256-bit buses so about half the bandwidth depending on the memory speed.
This puts them at a significant disadvantage for LLMs, before we even get to linking Maxes to make an Ultra. Even wilder, GB10 and Halo were specced with generative AI in mind, whereas I don’t believe it was in Apple’s sights for the Max.
EDIT: - accidentally hit the reply button too soon.
These are both large CPU/GPU combos with varying degrees of unified memory, essentially akin to Apple’s Max, except both only have 256-bit buses so about half the bandwidth depending on the memory speed.
This puts them at a significant disadvantage for LLMs, before we even get to linking Maxes to make an Ultra. Even wilder, GB10 and Halo were specced with generative AI in mind, whereas I don’t believe it was in Apple’s sights for the Max.
Last edited: