Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library
← Back to context

Comment by colechristensen

12 hours ago

No, they just need to be trained to have adversarial self review "thinking" processes.

You ask an LLM "What's wrong with your answer?" and you get pretty good results.

2 comments

colechristensen

Reply

binary0010  12 hours ago

Or you get the original output result was perfect and the adversarial "rethinking" switches to an incorrect result.

  • byzantinegene  11 hours ago

    this seems to happen far more than i would like

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities