Comment by verdverm
20 days ago
getting tricks embedded into the weights is expensive, it doesn't happen in a single pass
they's why we teach them new tricks on the fly (in-context learning) with instruction files
20 days ago
getting tricks embedded into the weights is expensive, it doesn't happen in a single pass
they's why we teach them new tricks on the fly (in-context learning) with instruction files
Right, it sounds like an artificial limitation.
it's more a mathematical / algorithmic limitation
I’ll counter it’s an architectural issue
2 replies →