Comment by _0ffh
2 years ago
The trick is to make sure the recursive dependency stays linear, that's how you enable parallel training.
2 years ago
The trick is to make sure the recursive dependency stays linear, that's how you enable parallel training.
No comments yet
Contribute on Hacker News ↗