Comment by dalemhurley
6 days ago
This is a win for agents, speed and intelligence is crucial to the loop. If the time and token cost is small you can iterate many times to correct mistakes.
Got to wonder why Wall Street is dumping NVIDIA.
6 days ago
This is a win for agents, speed and intelligence is crucial to the loop. If the time and token cost is small you can iterate many times to correct mistakes.
Got to wonder why Wall Street is dumping NVIDIA.
I mean they are only running a small version of codex can they run the full one? Or the technology isn't there yet?
1000 tokens/sec for a highly specialised model is where we are going to see agents requiring.
Dedicated knowledge, fast output, rapid iteration.
I have been trying out SMOL models as coding models don't need to the full corpus of human history.
My most recent build was good but too small.
I am thinking of a model that is highly tuned to coding and agentic loops.