Comment by famouswaffles
1 year ago
Gpt-3.5-turbo-instruct had something like 5(or less) illegal moves in 8205
https://github.com/adamkarvonen/chess_gpt_eval
I expect the rest to be much worse if 4's performance is any indication
1 year ago
Gpt-3.5-turbo-instruct had something like 5(or less) illegal moves in 8205
https://github.com/adamkarvonen/chess_gpt_eval
I expect the rest to be much worse if 4's performance is any indication
And the most notable part of that:
> Most of gpt-4's losses were due to illegal moves
3.5-turbo-instruct definitely has some better chess skills.