Comment by dmitrygr
13 hours ago
mem bw between cores matters for .... literally all workloads that are not single-core (read: all). And FP8 matters not at all cause inference on cpu is too slow to be of any use whatsoever in the days of proper accelerators
No comments yet
Contribute on Hacker News ↗