Comment by wg0

3 years ago

Jaw dropping... so essentially DNNs also just "compress the information? is the take away here?

9 comments

wg0

Why does this conclusion follow?

Of course similar text compresses more efficiently, but NNs don’t work with compressed (varying-size) representations, they work with vector representations which happen to be close in similarity space

johnthewise 3 years ago
they work with compressed representations, you take an arbitrary information with varying entropy into a fized size vector representation, that's a compression.
- gcr 3 years ago
  
  That's like saying hashing is compression because the output is always x bits. You see what I mean, right?
  
  3 replies →

CGamesPlay 3 years ago

Well, yeah, but the training process means that the compression is both lossy and much less efficient than a standard compression method like gzip. You could even train your NN on its ability to losslessly recall, but we generally call that "overfitting" in the lingo.

CodesInChaos 3 years ago

The way you'd do compression using a NN, is using the NN to predict the probability of the next symbol, and feeding that into an arithmetic coder to produce a compressed representation. This process is lossless, and better prediction quality directly translates into better compression.

pas 3 years ago

yes, biggest mindfuck is autoencoders. literally brute-force train a lossy compressor.