Comment by SchemaLoad

6 hours ago

I'd say speech to text is unsolvable for a more fundamental reason that it's hard to actually speak out an entire document flawlessly in one take.

Spoken language is very different to written language, which is why for example you can easily tell when an article is transcribing a spoken interview.

Yes, it's a UX thing. You'd still have to edit it by typing afterwards as well.

Similarly, raw LLM/chat interfaces are usually not the best option.