Comment by AMavorParker
1 hour ago
The teachers never attempt to solve their own problems, only the students solve problems.
Regarding the TrueSkill of the teachers, the self-play settings we operate in in this paper are zero-sum competitive which means that the population skills cannot both increase together, as the objective of one population is adversarial against the other -- generating difficult tasks (teachers) but making difficult tasks easy (students learning to solve them)
No comments yet
Contribute on Hacker News ↗