Comment by RestartKernel
7 hours ago
What are the costs looking like to run this? I wonder whether you would be able to use this approach within a mixture-of-experts model trained end-to-end in ensemble. That might take out some guesswork insofar the roles go.
No comments yet
Contribute on Hacker News ↗