Comment by andrewla
2 months ago
> OpenAI definitely tarnished the name of GPT-5 by allowing these issues to occur
For a certain class of customer maybe that is true.
But the reality is that the fact that this occurs is very encouraging -- they are not micro-optimizing to solve cosmetic problems that serve no functional purpose. They are instead letting these phenomena serve as external benchmarks of a sort to evaluate how well the LLM can work on tasks that are outside of its training data, and outside of what one would expect the capabilities to be.
No comments yet
Contribute on Hacker News ↗