Comment by growingswe

2 months ago

Great stuff! I wrote an interactive blogpost that walks through the code and visualizes it: https://growingswe.com/blog/microgpt

8 comments

growingswe

> By the end of training, the model produces names like "kamon", "karai", "anna", and "anton". None of them are copies from the dataset.

All 4 are in the dataset, btw

mym1990 2 months ago

This is likely because the blog is AI generated and keys off this point from Karpathy: "As a preview, by the end of the script our model will generate (“hallucinate”!) new, plausible-sounding names.", so the LLM just repackaged that into something that is obviously wrong, which is kind of ironic.

This is awesome! Normally I'm pretty critical of LLM-assisted-blogging, but this one's a real winner.

You should totally submit that to HN as an article, if you haven't already.

dang 2 months ago

We've put https://news.ycombinator.com/item?id=26998308), so it will get a random placement on HN's front page.

That’s beautifully done, thanks for posting. As helpful again to an ML novice like me as Karpathy’s original.

Great!

really nice, thanks