Comment by vessenes
12 hours ago
I'm thinking the next step would be to include this as a 'junior dev' and let Opus farm simple stuff out to it. It could be local, but also if it's on cerebras, it could be realllly fast.
12 hours ago
I'm thinking the next step would be to include this as a 'junior dev' and let Opus farm simple stuff out to it. It could be local, but also if it's on cerebras, it could be realllly fast.
Cerebras already has GLM 4.7 in the code plans
Yep. But this is like 10x faster; 3B active parameters.
Cerebras is already 200-800 tps, do you need even faster ?
2 replies →