Comment by pama
1 year ago
The author explains what they did: restrict the move options to valid ones when possible (for open models with the ability to enforce grammar during inference) or sample the model for a valid move up to ten times, then pick a random valid move.
No comments yet
Contribute on Hacker News ↗