Comment by arjie

15 hours ago

I was looking into this for LLMs but it's clearly a graphics-processing focused card. The memory bandwidth is too low for that much RAM to be useful in an LLM context. The 5090 I have has the same amount of RAM but far more bandwidth and that makes it much more useful.

10 comments

arjie

Mindless2112 15 hours ago

Compared to a B70, a 5090 is 1x the memory with 3x the bandwidth at 4x the price. Yeah, the 5090 is better, but you're paying for it.

arjie 11 hours ago
On actual market it’s $1100 vs $3200 now, right? I actually got mine at $2200 at cost in the before days.
- Mindless2112 11 hours ago
  
  Current lowest price for a new card on Newegg: $949.99 vs $3,699.99.
  
  1 reply →

girvo 15 hours ago

Oh wow, I really would've expected higher memory bandwidth. That's only ~2-3x the little DGX Spark-alike I have to play with. Would've expected more.

askl 3 hours ago

> it's clearly a graphics-processing focused card.

Yes, that's what the G in GPU stands for. It's great to see that there are still manufacturers that understand this.

cmxch 15 hours ago

It’s 32gb for people who can’t go for scalped 5090s but have a 3090 budget.

I have a pair of them with a 9480 and the only thing I have to do is keep the cache happy.

fluoridation 15 hours ago
Eh. Trading CUDA for 8 more gigs seems like bad deal, unless you know absolutely for certain what you want to run will run on it.
- cmxch 14 hours ago
  
  Until NVidia prices get better, I’ll build out with the Intel stack and keep the cache (and prompt processing speeds) happy.
  As for software, anything that has a SYCL or Vulkan backend, and/or can be Intel optimized (especially to the same degree as llama.cpp) can run well.
arjie 11 hours ago

[dead]