Comment by ftxbro
3 years ago
how is your paper different from all the ones like 'transformers are really x' where x is the author's special field of study
3 years ago
how is your paper different from all the ones like 'transformers are really x' where x is the author's special field of study
IMO it is important to understand transformer mechanics through core ML themes like SVM and feature-selection. Our results are not interpretation, they are mathematically rigorous and numerically verifiable. That said, we have no intention of trivializing a complex model like GPT-4 as a simple SVM. That is a tall order :)
If there is actually equivalence between different type systems and algorithms, that opens the door for simplification through unification.