← Back to context Comment by turblety 1 day ago Are you running the full 65GB model on a MacBook Pro? What tokens per second do you get? What specs? M5? 4 comments turblety Reply spullara 19 hours ago I am running the full model on an 128GB M3 Max. jonaustin 1 day ago On an m4 pro 128gb: 75 t/s.Caveat: That's just for the first prompt. iAMkenough 1 day ago If they're running 120B on a M5 (32GB max of memory today), I'd like to know how. thaw13579 1 day ago Probably an M4 which has up to 128GB currently
iAMkenough 1 day ago If they're running 120B on a M5 (32GB max of memory today), I'd like to know how. thaw13579 1 day ago Probably an M4 which has up to 128GB currently
I am running the full model on an 128GB M3 Max.
On an m4 pro 128gb: 75 t/s.
Caveat: That's just for the first prompt.
If they're running 120B on a M5 (32GB max of memory today), I'd like to know how.
Probably an M4 which has up to 128GB currently