Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by rugina

1 year ago

I think NM translation was broken all along. Not in the neural network part but in choosing the right answer. https://aclanthology.org/2020.coling-main.398.pdf

2 comments

rugina

Reply

astrange  1 year ago

Since LLMs are loosely based on NM models, it seems research on newer sampling methods like Mirostat might help here.

earngurus234  1 year ago

[dead]

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities