Comment by mariano54

8 months ago

Yes we do have this issue, but it's improved a bit over chatgpt due to using multiple transcribers.

The models are improving though, and they are at a very good place for English at the moment. I expect by next year we will switch over to full voice to voice models.

2 comments

mariano54

harles 8 months ago

This reply seems to miss the question, or at least doesn’t answer it clearly. Is this service overly tolerant of mispronunciations? Foundational models are becoming more tolerant, not less, over time which is the opposite of what I’d want in this case.

mariano54 8 months ago

It's less tolerant of mispronunciations. There is custom promting to explicitly leave in mistakes and to not fix them. It's still not perfect and it (the speech to text module) sometimes corrects the user's pronunciation mistakes.