Comment by cratermoon
9 days ago
It's pretty well known that the AI companies are heavy users of Amazon mturk for their RLHF post-training.
9 days ago
It's pretty well known that the AI companies are heavy users of Amazon mturk for their RLHF post-training.
He meant the opposite, you could be paid to be a worker on MTurk but actually just be feeding it LLM output.
In this case it's Model Distillation. Training a new model with the outputs of a another.