Comment by peterisza

21 hours ago

right, 888 kB would be impossible for local inference

however, it is really not that impressive for just a client

8 comments

peterisza

It's not completely impossible, depending on what your expectations are. That language model that was built out of redstone in minecraft had... looks like 5 million parameters. And it could do mostly coherent sentences.

godelski 19 hours ago
> built out of redstone in minecraft
Ummm...
> 5 million parameters
Which is a lot more than 888kb... Supposing your ESP32 could use qint8 (LOL) that's still 1 byte per parameter and the k in kb stands for thousand, not million.
- Dylan16807 16 hours ago
  
  https://www.youtube.com/watch?v=VaeI9YgE1o8
  Yes I know how much a kilobyte is. But cutting down to 2 million 3 bit parameters or something like that would definitely be possible.
  And a 32 bit processor should be able to pack and unpack parameters just fine.
  Edit: Hey look what I just found https://github.com/DaveBben/esp32-llm "a 260K parameter tinyllamas checkpoint trained on the tiny stories dataset"
  
  4 replies →

js8 15 hours ago

I disagree, in the future it might be possible. But perhaps not in English, but in some more formal (yet fuzzy) language with some basic epistemology.

I mean, there is a lambda calculus self-interpreter in 29 bytes. How many additional logical rules are required for GAI inference? Maybe not that many as people think. Understanding about 1000 concepts of basic english (or say, lojban) might well be sufficient. It is possible this can be encoded in 800kB, we just don't know how.