Comment by brookst
1 day ago
That’s… not how thinking models work. They tend to be iterative and serial, not parallel and then pick-one.
1 day ago
That’s… not how thinking models work. They tend to be iterative and serial, not parallel and then pick-one.
Parallel test time compute is exactly what SOTA models do, including Claude 4 Opus extended, o3 Pro, Grok 4 Heavy, and Gemini 2.5 Pro.