Comment by redox99

4 months ago

Character level LLMs are used for detecting insults and toxic chat in video games and the like.

6 comments

redox99

Can you give an example of a video game explicitly using character-level LLMs? There were prototypes of char-rnns back in the day for chat moderation but it has significant compute overhead.

redox99 4 months ago

It's something I heard through the grapevine. But there's only a few big enough competitive games where toxicity is such a big deal, so it's not hard to guess.
Character level helps with players disguising insults.
Compute wise it's basically the same, but multiply token count by 4. Which doesn't really matter for short chat in video games.

viraptor 4 months ago

Yes, for small messages and relatively small scope dictionary, character level will work. But that's very different from what's tested here.

jazzyjackson 4 months ago

I figure an LLM would be way better at classifying insults than regexing against a bad word list. Why would character level be desirable?

vanviegen 4 months ago

I'd imagine for simplicity - just skip the tokenizer and feed bytes.
duskwuff 4 months ago

Might a character-level LLM be better at recognizing poorly spelled (or deliberately misspelled) profanity?