Comment by th0ma5
1 day ago
Since when do people like the fuzziness of outputs? I think you make an interesting point but it also seems to imply that benchmarking will never truly be possible, which I think is true unless we can also make them observable which also as you say gives up the mystique.
There is also the possibility that LLM might help us understand language better and make this domain more rigorous. Only researchers can see if thats gonna be happening
A lot of tasks are fuzzy by nature, there are multiple valid results, multiple interpretation of the situation / context etc. We’re gonna discover new areas where computers will be useful finally