Comment by CamperBob2
6 hours ago
Everything uses transformers and despite all the novel architectures that have come out in those years, transformers are still the best and I'm not sure how to come to terms with that. Does it mean that researchers wasted their time on useless dead end architectures, or are they ahead of the curve and commercial companies are slow to adopt them?
I don't quite follow. Are you saying researchers are wasting their time working with transformer networks now, or that they wasted too much time in the past, or...?
Even the coding agents are more primitive than expected.
What did you expect, exactly? I don't know about you, but I bought my GPU to play games, and now it's finding bugs in my C code, writing better code to replace it, and checking it into Github. That doesn't signal "primitive" to me. More like straight outta Roswell.
No comments yet
Contribute on Hacker News ↗