Comment by JTrehan

1 month ago

I'm still working on my PDF search engine for desktop: https://www.docgoblin.com/ I'm implementing a bookmark utility right now and hope to add support multiple E-books format in the near future.

A one time payment app - interesting (I'm also working on something with similar moneytization solution). How are things going? I'd love to know the experience of another solopreneur, what stack are you using? I wonder - What are you using to parse PDFs and extract the text? I found that is a nightmare when was doing something similar for WithAudio (my app). - Are you just extracting the text or you are doing any post processing to identify which lines belong to the same paragraph or not?

  • Things are going slow, but it is a passion project so it's ok :) A few people have bought a licence and it seems most people who try the app are very happy with it so I'm happy too.

    The app is entirely in Java, with javaFX for the UI and Lucene for the search engine. To read and render PDFs I use PDFium.

I'm working on a self-hostable ebook library (https://github.com/colibri-hq/colibri), and currently tinkering with searching over book content. Have you written about your approach to search somewhere, perhaps? Would be very interested in learning how others go about this. Kudos for DocGlobin, looks great :-)