Comment by Jeff_Brown

17 days ago

This surprises me: "These modern systems are developed to sound human, natural, and conversational. Unfortunately this seems to come at the expense of accuracy. In my testing, both models had a tendency to skip words, read numbers incorrectly, chop off short utterances, and ignore prosody hints from text punctuation. "

2 comments

Jeff_Brown

ethin 16 days ago

They also have built-in abbreviation dictionaries. For example, Acapela likes to expand AST to Atlantic Standard Time, even when the context is so obviously (not) talking about time zones.

layer8 17 days ago

Why does it surprise you?