← Back to context

Comment by dredmorbius

2 days ago

There are various permutations.

There are scanned-in books whose index pages don't precisely match the digital pages. Good PDFs will account for that offset themselves, but manual recalculation may be necessary.

Worse are books half-assedly converted from print to digital. These often include an index (useful for all the reasons others have mentioned elsewhere in this thread), but the "faithful" reproduction of the print text means that the page enumeration in the index bears a nonconstant relationship to the digital text. The offsets are not constant.

Then there are ePubs with the above feature. The sane thing to do would be to link the index entry to occurances. Often you'll find, again, print-edition page mentions which are of little use in locating the passage within your digital edition.

One of the underlying problems is that the print notion of "page" is increasingly archaic. For languages / typographies in which paragraphs are a useful convention, paragraph numbering might be preferable (this should be constant across formats). Direct symlinks are of course useful, but these conceal information revealed in a conventional (print) index such as passages where a topic is discussed at some length, or clusters of appearances, as well as cross-references or associated references which a well-constructed index will reveal.

I can't remember examples like those, but one I deal with is where the index has a different kind of numbering scheme like "E403.1-a". But at least cmd+f maybe works in that case, unless of course that string shows up everywhere.

  • Technical / government docs often feature such numbering schemes. I believe part of the history is that those documents were often composed in segments or sections, often by independent teams, such that a fixed page enumeration wasn't readily available and/or would change frequently.

    There's a pre-digital publishing trend of loose-leaf or removeable bindings, with publications prepared in sections or with periodic supplements, which began in the late 19th / early 20th century. Those would typically be organised and numbered by section. The concept is somewhat trite now, but I think of it as a significant stage and evolution of publishing, somewhere between fixed-format codices, periodicals, and eventually databases and wikis.