Comment by growingswe
19 hours ago
Great stuff! I wrote an interactive blogpost that walks through the code and visualizes it: https://growingswe.com/blog/microgpt
19 hours ago
Great stuff! I wrote an interactive blogpost that walks through the code and visualizes it: https://growingswe.com/blog/microgpt
> By the end of training, the model produces names like "kamon", "karai", "anna", and "anton". None of them are copies from the dataset.
All 4 are in the dataset, btw
You should totally submit that to HN as an article, if you haven't already.
We've put https://news.ycombinator.com/item?id=26998308), so it will get a random placement on HN's front page.
This is awesome! Normally I'm pretty critical of LLM-assisted-blogging, but this one's a real winner.
That’s beautifully done, thanks for posting. As helpful again to an ML novice like me as Karpathy’s original.
Great!
really nice, thanks