Deep learning actually simplifies the extremely complex math of previous machine learning and statistics/stochastics into a very reasonable set of operations:
matrix multiplications and some very simple activation functions
(plus automatic derivates, some magic and some scientific glasses which you can ignore)
This is actually simpler than most self attention methods.
Deep learning actually simplifies the extremely complex math of previous machine learning and statistics/stochastics into a very reasonable set of operations:
matrix multiplications and some very simple activation functions (plus automatic derivates, some magic and some scientific glasses which you can ignore)