Comment by KellyCriterion

3 hours ago

Question that I do not understand:

How should a local-run Chinese Model "phone home" if someone runs it locally on the hardware? I think Im missing some understanding here?

4 comments

KellyCriterion

monsieurbanana 2 hours ago

I don't think they do at the moment, but they could be trained subtly add backdoors to code or make "phone home" api calls during dev time, triggering on certain conditions ("is user employee of xyz")

rjsw 3 hours ago

I think the fear is that it might insert some "phone home" routine into the source code that it generates.

Jtarii 2 hours ago
Has anyone demonstrated that this type of attack is even possible? Also the moment anyone detects this attack it will nuke deepseek/other chinese AI labs reputation completely, it is the most high risk low reward attack ever.
- fragmede 2 hours ago
  
  Yes.
  https://arxiv.org/abs/2401.05566
  In that paper, if it LLM was told it was 2023, then the code it generated was fine. If the prompt included the fact that it was 2024, then it intentionally wrote exploitable code.