Comment by Jeff_Brown
10 hours ago
This surprises me: "These modern systems are developed to sound human, natural, and conversational. Unfortunately this seems to come at the expense of accuracy. In my testing, both models had a tendency to skip words, read numbers incorrectly, chop off short utterances, and ignore prosody hints from text punctuation. "
They also have built-in abbreviation dictionaries. For example, Acapela likes to expand AST to Atlantic Standard Time, even when the context is so obviously (not) talking about time zones.
Why does it surprise you?