Comment by p1necone

2 months ago

I'm holding out for someone to ship a gpu with dimm slots on it.

10 comments

p1necone

DDR5 is a couple of orders of magnitude slower than really good vram. That’s one big reason.

zrm 2 months ago
DDR5 is ~8GT/s, GDDR6 is ~16GT/s, GDDR7 is ~32GT/s. It's faster but the difference isn't crazy and if the premise was to have a lot of slots then you could also have a lot of channels. 16 channels of DDR5-8200 would have slightly more memory bandwidth than RTX 4090.
- tymscar 2 months ago
  
  Yeah, so DDR5 is 8GT and GDDR7 is 32GT. Bus width is 64 vs 384. That already makes the VRAM 4*6 (24) times faster.
  You can add more channels, sure, but each channel makes it less and less likely for you to boot. Look at modern AM5 struggling to boot at over 6000 with more than two sticks.
  So you’d have to get an insane six channels to match the bus width, at which point your only choice to be stable would be to lower the speed so much that you’re back to the same orders of magnitude difference, really.
  Now we could instead solder that RAM, move it closer to the GPU and cross-link channels to reduce noise. We could also increase the speed and oh, we just invented soldered-on GDDR…
  
  1 reply →
dawnerd 2 months ago
But it would still be faster than splitting the model up on a cluster though, right? But I’ve also wondered why they haven’t just shipped gpus like cpus.
- cogman10 2 months ago
  
  Man I'd love to have a GPU socket. But it'd be pretty hard to get a standard going that everyone would support. Look at sockets for CPUs, we barely had cross over for like 2 generations.
  But boy, a standard GPU socket so you could easily BYO cooler would be nice.
  
  1 reply →
cogman10 2 months ago

For AI, really good isn't really a requirement. If a middle ground memory module could be made, then it'd be pretty appealing.

Would that be worth anything, though? What about the overhead of clock cycles needed for loading from and storing to RAM? Might not amount to a net benefit for performance, and it could also potentially complicate heat management I bet.

kristianp 2 months ago

A single CAMM might suit better.