Comment by jazzyjackson
13 hours ago
I figure an LLM would be way better at classifying insults than regexing against a bad word list. Why would character level be desirable?
13 hours ago
I figure an LLM would be way better at classifying insults than regexing against a bad word list. Why would character level be desirable?
I'd imagine for simplicity - just skip the tokenizer and feed bytes.
Might a character-level LLM be better at recognizing poorly spelled (or deliberately misspelled) profanity?