Comment by acjohnson55
2 years ago
My understanding is that LLMs are basically approximations of Markov chains where the state and probability distribution is thousands of words long. If you could directly compute and use that matrix, you'd get the same result. But that would be insane.
No comments yet
Contribute on Hacker News ↗