Comment by zaptrem
10 hours ago
My point is you should consider creating truly undetectable audio end to end with AI to be effectively impossible for the foreseeable future (i.e., I would bet money it is still trivially detectable five years from now). It won't be detectable to humans, though, only models.
in the broad strokes of ai generated, i wouldnt be so sure.
if the ai picked a bunch of samples and combined them together and mastered using an mcp to a DAW, how is that particularly distinguishable vs a person doing the same thing badly?
i can see how the llm generation pictures of spectrograms is essy to spot, but much less so with tool following.
even worse of you using a vla to have it actually play the guitar and use the recording as a sample.
theres some time and setup to make it happen sure, but somebody put that all in a studio and expose an mcp
Agreed, that’s why I specified end to end (I.e., text to waveform)