← Back to context

Comment by olmo23

12 hours ago

How do you prevent degenerate strategies? I could trivially give a model a SHA256 hash and ask it to provide the source input.

In class you'd probably want a rule saying at least one LLM should be able to figure out the answer, but in a head-to-head I'm not sure how to solve it.

Maybe make the LLM:s write questions that they can solve (without seeing the question writing context) but not other LLm:s.

On the other hand then maybe a good strategy would be to write questions that the LLM just happen to have in a nich dataset in its training ”what did user5455 say to user6835?”

Nevermind my idea.