Comment by viccy-kobuletti

7 months ago

How does one begin to educate oneself on the way LLMs work beyond layman understanding of it being a "word predictor"? I use LLMs very heavily and do not perceive any differences between models. My math background is very weak and full of gaps, which i'm currently working on through khan academy, so it feels very daunting to approach this subject for a deeper dive. I try to read some of the more technical discussions (e.g waluigi effect on lesswrong), however it feels like I lack the needed knowledge to not have it completely go over my head, not taking into account some of the surface-level insights.

2 comments

viccy-kobuletti

quonn 7 months ago

Start here:

https://udlbook.github.io/udlbook/

TuringNYC 7 months ago

I had not heard of this, wow, this is GOLD!