Comment by mlmonkey

20 hours ago

> We were cautious to only run after each model’s training cutoff dates for the LLM models

Grok is constantly training and/or it has access to websearch internally.

You cannot backtest LLMs. You can only "live" test them going forward.

Via api you can turn off websearch internally. We provided all the models with their own custom tools that only provided data up to the date of the backtest.