Comment by tester756

1 month ago

Why not just make screenshoot of every PDF page?

6 comments

tester756

It could still be identifiable, for example if the document has been prepared such that the intended recipient's identity is encoded into subtle modulation of the widths of spaces.

yyyk 1 month ago

That's outside this threat model? The idea here is trying to foil outside analysis, not limit the document authors (which are allowed to add/update and even write openly 'the intended recipient's identity').
sincerely 1 month ago
Print and re-scan wouldn’t fix that though.
- jeffbee 1 month ago
  
  That was my point. If you want to erase its origin you need to semantically extract the contents and reduce them to their most basic representation.
tester756 1 month ago

Sure, but all those not-essential information hidden in PDFs format are removed
idiotsecant 1 month ago

In PDF file format?