Comment by kristianp

1 day ago

You're only going to get an incremental improvement with an M5 Pro mini compared to an M4 Pro mini. Memory bandwidth goes from 273GB/s to 307GB/s, about 12.5% improvement for LLMs.

M5's have the neural accelarator that boosts prefill speed a lot. But token generation itself will not change that much, that's true.