Comment by dalemhurley

6 days ago

This is a win for agents, speed and intelligence is crucial to the loop. If the time and token cost is small you can iterate many times to correct mistakes.

Got to wonder why Wall Street is dumping NVIDIA.

2 comments

dalemhurley

SamDc73 6 days ago

I mean they are only running a small version of codex can they run the full one? Or the technology isn't there yet?

dalemhurley 19 hours ago

1000 tokens/sec for a highly specialised model is where we are going to see agents requiring.
Dedicated knowledge, fast output, rapid iteration.
I have been trying out SMOL models as coding models don't need to the full corpus of human history.
My most recent build was good but too small.
I am thinking of a model that is highly tuned to coding and agentic loops.