← Back to context

Comment by int_19h

7 months ago

With most Chinese models, you can run them locally.

You can then specifically prompt the model to do a CoT before answering (or refusing to answer) the question about e.g. Tiananmen. In my experiments, both QwQ and DeepSeek will exhibit awareness of the 1989 events in their CoT, but will specifically exclude it from their final answer on the basis that it is controversial and restricted in China.

It gets even funnier if you do multi-turn, and on the next turn, point out to the model that you can see its CoT, and therefore what it thought about Tiananmen. They are still finetuned into doing CoT regardless and just can't stop "thinking about the white elephant" while refusing to acknowledge it in more and more panicked ways.