Comment by vanuatu
2 days ago
all the labs "clean" their pretraining data, and you can have your pretraining data to be minimally ai generated but also spam synthetic post-training data
2 days ago
all the labs "clean" their pretraining data, and you can have your pretraining data to be minimally ai generated but also spam synthetic post-training data
No comments yet
Contribute on Hacker News ↗