Comment by coldtea

19 days ago

>Those little black boxes of AI can be significantly demystified by, for example, watching a bunch of videos (https://karpathy.ai/zero-to-hero.html) and spending at least 40 hours of hard cognitive effort learning about it yourself.

That's like saying you can understand humans by watching some physics or biology videos.

16 comments

coldtea

AndrewKemendo 19 days ago

No it’s not

Nobody has built a human so we don’t know how they work

We know exactly how LLM technology works

abustamam 19 days ago
We know _how_ it works but even Anthropic routinely does research on its own models and gets surprised
> We were often surprised by what we saw in the model
https://www.anthropic.com/research/tracing-thoughts-language...
- AndrewKemendo 19 days ago
  
  Which is…true of all technologies since forever
  
  9 replies →
soulofmischief 19 days ago
We know why they work, but not how. SotA models are an empirical goldmine, we are learning a lot about how information and intelligence organize themselves under various constraints. This is why there are new papers published every single day which further explore the capabilities and inner-workings of these models.
- AndrewKemendo 19 days ago
  
  You can look at the weights and traces all you like with telemetry and tracing
  If you don’t own the model then you have a problem that has nothing to do with technology
  
  2 replies →