Comment by Casteil

1 month ago

Qwen3.5/3.6 are really prone to looping and 'overthinking'. Gemma4 doesn't seem to have the same problems.

Gemma also doesn't have the same 'agentic' capabilities of qwen3.6.

Simple test failed: sending "1","2","3" as separate messages using an openclaw harness.

I tested a few other "follow these instructions" tests. Qwen3.5/6 were able to follow along, gemma was not able to.