Comment by Moosdijk

1 day ago

I meant going to the likeliest output (flash) or (iteratively) generating multiple outputs and (iteratively) choosing the best one (thinking/pro)

That's not how these models work.

Thinking models produce thinking tokens to reason out the answer.