Comment by kgeist
2 months ago
Qwen3 32b with reasoning (which I run locally) gives the correct answer. A pretty good model for its size.
Pretty sure GPT5 with reasoning should be able to solve it, too. I guess the real problem here is that GPT5's router doesn't understand that it's a problem which requires reasoning.
Can it count all 3 letter words in a paragraph? Can it average number of words inside parentheses in a paragraph?
I asked Qwen3 32b (with reasoning) to count all 3 letter words in your post. Its answer was:
Sounds about right.
Then I took this answer and asked to count the average number of words inside parentheses. The answer was:
Correct again. Everything on the first try.