Comment by BobbyJo
2 months ago
This is less a deficiency of the model, and more of a deficiency of the encoder IMO. You can consider the encoder part of the model, but I think the semantics of our conversation require differentiating between the two.
No comments yet
Contribute on Hacker News ↗