Comment by swyx

8 months ago

i think there's only one llm backbone for voice. it's 4o.

No, there's the original voice mode (4o+tts), the original AVM (4o with native audio-to-audio), the new AVM announced in this post, and the mini AVM (4o-mini with native audio-to-audio)