Comment by monocasa
3 days ago
Not immediately, but it's not a much larger amount of work for llama than a new foundational model which typically has a tweaked compute graph.
3 days ago
Not immediately, but it's not a much larger amount of work for llama than a new foundational model which typically has a tweaked compute graph.
No comments yet
Contribute on Hacker News ↗