Comment by adzm
3 hours ago
It is worth noting that as the length of data increases it becomes extremely unlikely that the index and length of the sequence within pi would actually be smaller than the data.
3 hours ago
It is worth noting that as the length of data increases it becomes extremely unlikely that the index and length of the sequence within pi would actually be smaller than the data.
That seems easy enough to solve. Simply record the index and length in pi of the index and length in pi.
See also: Recursion
See also: Recursion
2 replies →
Back in college, I thought I could compress my phone number by telling people its index in pi, but my 7 digit phone number is at an 8 digit index.
I didn’t have the compute to find my 10 digit number with the area code.
TFA addresses this
> Now, we all know that it can take a while to find a long sequence of digits in π, so for practical reasons, we should break the files up into smaller chunks that can be more readily found.
> In this implementation, to maximise performance, we consider each individual byte of the file separately, and look it up in π.
The index of your 20 line file is <20TB number>
Unless, in turn, you locate the index itself in pi at a much smaller index. And so on...
Find k candidate indices for your data, then locate each of them. If the smallest one is a significantly smaller index space, repeat.
Can't tell if you're in on the joke or not, but for anyone who is genuinely wondering whether this might work: Consider that there are at most 256 different indexes that could be represented by a 1-byte index value, but if you're trying to store 9 bits of data, there are already 512 different possible things it could be that each need to be represented by a different index value, otherwise you won't be able to tell them apart. Those pigeons aren't gonna fit.
yes I believe that's the joke
He’s aware, he just added some curious information.
Point taken about the index potentially being really long. Why would the length be longer than the data? Don’t you need to find the right sequence?