Comment by Moosdijk
1 day ago
I meant going to the likeliest output (flash) or (iteratively) generating multiple outputs and (iteratively) choosing the best one (thinking/pro)
1 day ago
I meant going to the likeliest output (flash) or (iteratively) generating multiple outputs and (iteratively) choosing the best one (thinking/pro)
That's not how these models work.
Thinking models produce thinking tokens to reason out the answer.