Comment by red75prime
4 hours ago
> basically how well a universal function approximator can fit to a function we define
That's what you've got wrong. We don't define functions that an LLM approximates. Autoregressive pretraining approximates an unknown function that produces text (that is what the brain does). RL doesn't approximate functions, it optimizes objective by finding an unknown function that performs better.
No comments yet
Contribute on Hacker News ↗