Comment by vman512
1 day ago
Sounds right. The policy for rejection can depend on what you want - you might accept the top K highest probability tokens or top P probability mass. Or you can do something like importance sampling and probabilistically reject based on the ratio of likelihoods
No comments yet
Contribute on Hacker News ↗