Comment by rob_c
17 days ago
Not just can i guarantee the models are bad with numbers, unless it's a highly tuned and modified version they're too slow for this arena. Stick to using attention transformers in better model designs which have much lower latencies than pre-trained llms...
No comments yet
Contribute on Hacker News ↗