Comment by KaiserPro
2 years ago
> a model trained on 500b tokens on English and 50b tokens of french will not speak french like a model trained on only 50b tokens of french but much much better.
Thats because french and english are reasonably similar, and share the same context.
Whilst latin and english are distantly related (latin is more related to french) they do not share the same cultural context.
Which version of latin are you translating? medieval?
Whilst its fun to do, and it has its place. There needs to be massive caveats about accuracy.
It doesn't matter whether it's French or Korean. That was just an illustration.