Comment by cousin_it
2 years ago
Unlike most articles about machine learning, this one feels like it's written in my language.
The section on Lempel-Ziv is cool. Making a large LZ model is probably not very interesting because LZ is too strict, it matches strings exactly. What could be more interesting is something like "jpeg for text", strategically allowing some non-exactness to achieve a reduction in size. Which is probably what LLMs do.
JPEG is FFT which has also been used in DNNs (Fnet encoder/decoder).