Comment by orbital-decay
8 hours ago
DS v4 is an undertrained snapshot, which is mentioned in their model card. The full version is supposed to be released later and have multimodal input. That said, hallucination rate likely depends on the training policy and different optimization tradeoffs a lot more than on the scale.
No comments yet
Contribute on Hacker News ↗