← Back to context

Comment by embedding-shape

1 day ago

Isn't this project the one Microsoft published but then soon after pulled it for security/safety reasons? What has changed since then?

Look at the "News" section in the readme - The original TTS model is gone from this repo (you can still find it other places), but the SST/ASR, long form TTS, and streaming TTS models are newer.

It’s confusing (at least for me) because the project covers a number of things including what you are mentioning.

  • [off topic]

    When explanations get posted directly in HN comments, I imagine someone somewhere in the world is able to learn in spite of their Internet restrictions/firewalls

    People will also post their own interpretations in response to comments, and quickly find out they missed something.

    … But if you try to automate it, like include a summary under every HN post, you encourage laziness too much and are pre-chewing too heavily. Some balance here.

    [on topic]

    (OK I’m done making excuses, time to read the article… thanks for the encouragement!)

    I thought this was not explained in the readme directly but in fact I missed it. I wasn’t going to read Microsoft entire changelog! But it was substantive, thanks to sibling commenter:

    “2025-09-05: VibeVoice is an open-source research framework intended to advance collaboration in the speech synthesis community. After release, we discovered instances where the tool was used in ways inconsistent with the stated intent. Since responsible use of AI is one of Microsoft’s guiding principles, we have removed the VibeVoice-TTS code from this repository.”