Comment by macleginn
5 hours ago
They are poor at generalising from a small number of examples; this is why the real generalisation power is achieved in pre-training.
5 hours ago
They are poor at generalising from a small number of examples; this is why the real generalisation power is achieved in pre-training.
No comments yet
Contribute on Hacker News ↗