← Back to context

Comment by itemize123

12 hours ago

u seem to be the only one who used it here - how did it compare to opus and gpt5.5? in theory it should be at least on par if not better at times right.

I only had time to use it for a couple of deep reviews of large Rust projects, and a few agentic coding tasks (implement plan X, refactor Y in fashion Z) before my quota ran out. My impression is that the reviews were quite strong - maybe Opus 4.8+ or around GPT 5.5 (for my particular use case) - but very slow. For implementation I found it weaker, it made a few mistakes that I haven't seen frontier models make in a long time.