Comment by coldtea
19 days ago
>Those little black boxes of AI can be significantly demystified by, for example, watching a bunch of videos (https://karpathy.ai/zero-to-hero.html) and spending at least 40 hours of hard cognitive effort learning about it yourself.
That's like saying you can understand humans by watching some physics or biology videos.
No it’s not
Nobody has built a human so we don’t know how they work
We know exactly how LLM technology works
We know _how_ it works but even Anthropic routinely does research on its own models and gets surprised
> We were often surprised by what we saw in the model
https://www.anthropic.com/research/tracing-thoughts-language...
Which is…true of all technologies since forever
9 replies →
We know why they work, but not how. SotA models are an empirical goldmine, we are learning a lot about how information and intelligence organize themselves under various constraints. This is why there are new papers published every single day which further explore the capabilities and inner-workings of these models.
You can look at the weights and traces all you like with telemetry and tracing
If you don’t own the model then you have a problem that has nothing to do with technology
2 replies →