Comment by tripplyons

1 year ago

How do you know it didn't use a code interpreter if they don't share the chain-of-thought?

When Code Interpeter is used on ChatGPT OpenAI make it very clear that it is being used through UI hints.

I really hope they don't ever change that UI pattern, this stuff is hard enough to understand already.

If you really want to test this, you can take advantage of the fact that Code Interpeter runs in a persistent sandbox VM. Tell the o3-mini prompt to save a file, then switch to GPT-4o (which can use Code Interpreter for real) and have it run Python code to show if that file exists or not.