← Back to context

Comment by butvacuum

1 day ago

Also, isn't memory a bit moot? At scale I thought that the ASICs frequently sat idle waiting for memory.

You're doing operations on the memory once it's been transferred to gpu memory. Either shuffling it around various caches or processors or feeding it into tensor cores or other matrix operations. You don't want to be sitting idle.