Comment by mistrial9
10 months ago
regarding training data -- is the main base model here trained only in FineWeb-2 ? or is it more also ..
10 months ago
regarding training data -- is the main base model here trained only in FineWeb-2 ? or is it more also ..
No comments yet
Contribute on Hacker News ↗