Comment by ftxbro

3 years ago

how is your paper different from all the ones like 'transformers are really x' where x is the author's special field of study

IMO it is important to understand transformer mechanics through core ML themes like SVM and feature-selection. Our results are not interpretation, they are mathematically rigorous and numerically verifiable. That said, we have no intention of trivializing a complex model like GPT-4 as a simple SVM. That is a tall order :)

If there is actually equivalence between different type systems and algorithms, that opens the door for simplification through unification.