Comment by dotancohen

9 days ago

  > give me the information on this page in my preferred language

I'm sure that works great for European languages and other languages with huge corpus. Those are not the target languages of the program in question.

LLMs are great with minority languages compared to almost anything else. Including better than the by the natural language generation employed to use Abstract Wikipedia, which whiffs at relatively large languages like Zulu and Xhosa, let alone many of the rarer languages that popular LLMs speak fluently.

  • This program is aimed at getting actual humans to write their actual language. Nothing beats that.

    At a minimum, it provides more material to train an LLM on.

  • LLMs are really bad at smaller European languages even, e.g. scandinavian ones, or Finnish. Much worse than the NLP situation before LLMs.

    • My experience with their Norwegian has been fantastic, I'd be shocked if the other Scandanavian languages aren't at least as good.