← Back to context

Comment by dbl000

4 days ago

Echoing what everyone else has said here - awesome site, love how fast it was.

I did notice that when I put in a single book in a series (in my case Going Postal, Discworld #33) that tended to dominate the rest of the selection. That does make sense, but I don't want recommendations for a series I'm already well into.

Also noticed that a few books (Spycraft by Nadine Akkerman and Pete Langman, Tribalism is Dumb by Andrew Heaton) that I know are in goodreads and reviewed didn't show up in the search. I tried both author's name and the title of the book. Maybe they aren't in the dataset.

It did stumble with some books more niche books (The Complete Yes Minister). Trying the "Similar" button gave me more books that were _technically_ similar because they were novelizations of British comedy shows, but not what I was looking for.

For more common books though it lined up very well with books already on my wishlist!

Yes I would say the handling of series is probably the biggest problem. Once my test metrics got to a point I was happy with and my quality spot checks passed (can I follow the models recommendations from one generic history book to Steven Runciman, also making sure popular books don't always dominate the results), I was ready to release because I had been working on this project for so long. The solution is probably using the transformer model to generate 100-200 candidates and then having a reranker on top.

  • Not just series, but I seem to mostly get a list of other books from the same authors.

    The recommendations from other authors are good, but as far as I can tell I’ve read every single one of them.

    Continuing to aggressively add everything it recommends eventually does seem to result in some interesting books I wasn’t familiar with, but I also end up with more and more books that are of zero interest to me.

    For what it’s worth, I started with:

      Infinite Jest David Foster Wallace
      Europe Central William T. Vollmann
      Gravity’s Rainbow Thomas Pynchon
      White Noise Don DeLillo
      One Hundred Years of Solitude Gabriel García Márquez
    

    It is possible that there simply aren’t many books like these in existence, so the pool of relevant recommendations gets exhausted fairly quickly. I’d guess trending towards unrelated popular books is also just a feature of the source data, that largely sums up my experience with goodreads anyway.

    Very cool project though. I did end up ordering a couple of new books, so thank you very much.

  • Releasing is the right choice, well done with this it’s really cool.

    I’ve only had a short play but a solution to this problem might be to show authors rather than books. Or select authors outside of the list the user has shared and then a top n (1,3,5) for each of those.

    I feel like that’s how you’d recommend to someone else - type of book -> unknown author -> best matching few books from them.

    After that the other side would be trying to find some diversity (if you think I’d like author X, personally you might suggest three different styles of book from them rather than three very similar books from them)

Going Postal is awesome. The flood of the mails and the test to be the post master where you would have a slide mail into the hole where a vicious dog is barking.