← Back to context

Comment by AlexCoventry

2 days ago

Can you give an example? Have you tried it recently with the higher-end models?

I have asked chatgpt reasoning model to solve chess endgames where white had king and a queen vs king and a rook on a 7x8 chessboards. So to compute value for all positions and find the position which is the longest win for white.

Not creative, not novel and not difficult algorithmic task. But it requires some reasoning, planning and precision.

It failed miserably.

My favorite example is implementing NEAT with keras dense layers instead of graphs. Last time I tried with claude 3.7, it wrote code to mutate the output layer (??). I tried to prevent that a few times and gave up.