Comment by dmitrygr
14 hours ago
mem bw between cores matters for .... literally all workloads that are not single-core (read: all). And FP8 matters not at all cause inference on cpu is too slow to be of any use whatsoever in the days of proper accelerators
No comments yet
Contribute on Hacker News ↗