Comment by robbie-c 2 months ago Probably not, see https://www.anthropic.com/research/reasoning-models-dont-say... 1 comment robbie-c Reply kevinventullo 2 months ago Would be interesting to apply Interpretability techniques in order to understand how the model really reasons about it.
kevinventullo 2 months ago Would be interesting to apply Interpretability techniques in order to understand how the model really reasons about it.
Would be interesting to apply Interpretability techniques in order to understand how the model really reasons about it.