Comment by ddjohnson

1 year ago

One of the blog post authors here! We evaluated o3 through the API, where the model does not have access to any specific built-in tools (although it does have the capability to use tools, and allows you to provide your own tools). This is different than when using o3 through the ChatGPT UI, where it does have a built-in tool to run code.

(Interestingly, even in the ChatGPT UI the o3 model will sometimes state that it ran code on its personal MacBook Pro M2! https://x.com/TransluceAI/status/1912617941725847841)

2 comments

ddjohnson

TobiWestside 1 year ago

I see, thanks for the clarification!

jlaternman 1 year ago

Just throwing this out there. Is it possible that in some way, it does have a MacBook Pro M2? For example, that the tools the ChatGPT UI have are exposed to it through access to one, which is can run whatever tools it wants through? That might actually be quite a sensible way to expose a “tools” UI to an LLM. If what it’s saying is technically accurate on the “flagship product” (ChatGPT), it could be the API version is simply confused about its differences (no access to tools).