← Back to context

Comment by caturopath

9 days ago

LLMs are great with minority languages compared to almost anything else. Including better than the by the natural language generation employed to use Abstract Wikipedia, which whiffs at relatively large languages like Zulu and Xhosa, let alone many of the rarer languages that popular LLMs speak fluently.

This program is aimed at getting actual humans to write their actual language. Nothing beats that.

At a minimum, it provides more material to train an LLM on.

LLMs are really bad at smaller European languages even, e.g. scandinavian ones, or Finnish. Much worse than the NLP situation before LLMs.

  • My experience with their Norwegian has been fantastic, I'd be shocked if the other Scandanavian languages aren't at least as good.