Comment by sethkim
4 months ago
Under-discussed superpower of LLMs is open-set labeling, which I sort of consider to be inverse classification. Instead of using a static set of pre-determined labels, you're using the LLM to find the semantic clusters within a corpus of unstructured data. It feels like "data mining" in the truest sense.
OP here. This is exactly right! You perfectly encapsulated the idea I stumbled up so beautifully.
problem is these dont bin properly