Comment by gwern
3 years ago
I think I would skip trying to do any entity or term extraction initially. That's going well beyond basic linking. Stick with something like TF-IDF similarity or LDA topic modeling to keep things fast and straightforward until you have the UI/UX worked out, which is always more work than it looks like. (The reason I used neural net embeddings was that the OA API had just added them, and it was a fun excuse to try that out.)
No comments yet
Contribute on Hacker News ↗