Comment by MountDoom

14 hours ago

I remember people making the exact same argument about asking LLMs math questions back when they couldn't figure out the answer to 18 times 7. "They are text token predictors, they don't understand numbers, can we put this nonsense to rest."

The whole point of LLMs is that they do more than we suspected they could. And there is value in making them capable of handling a wider selection of tasks. When an LLM started to count the numbers of "r"s in "strawberry", OpenAI was taking a victory lap.

7 comments

MountDoom

viraptor 9 hours ago

They're better at maths now, but you still shouldn't ask them maths questions. Same as spelling - whether they improve or not doesn't matter if you want a specific, precise answer - it's the wrong tool and the better it does, the bigger the trap of it failing unexpectedly.

vanviegen 11 hours ago

> When an LLM started to count the numbers of "r"s in "strawberry", OpenAI was taking a victory lap.

Were they? Or did they feel icky about spending way to much post-training time on such a specific and uninteresting skill?

ACCount37 4 hours ago
It's not as specific of a skill as you would think. Being both aware of tokenizer limitations and capable of working around them is occasionally useful for real tasks.
- _flux 3 hours ago
  
  What tasks would those be, that wouldn't be better served by using e.g. a Python script as a tool, possibly just as component of the complete solution?
  
  3 replies →