Comment by albertzeyer

2 years ago

I would say the opposite. This paper was a very easy read, totally clear from the first reading what it is about, etc.

The background matters. Attention was already very well known in the community (machine translation), so nothing new for this paper, and it was written for such an audience which already knows these basics concepts like attention.

If you want to learn about attention, read some of the actual background papers which introduced it.