Comment by khamidou
2 days ago
Sorry to self-promote but I did exactly that a few months back: https://khamidou.com/gpt2/
Generally, I think the Karpathy tutorials are a good starting point but they're very mathy (despite people telling you you only need high school math to understand llms, a lot of the abstractions and concepts he uses are a bit foreign to programmers).
I found out rebuilding inference of a known model taught me a lot more than passively sitting through the videos and maybe retyping his code. You should try it with something simple, like a model from a few years back!
No comments yet
Contribute on Hacker News ↗