Comment by b0a04gl
2 days ago
i gave it a 4 step research task with branching subtasks. told it upfront what the goal was. halfway through it forgot why it was doing step 2. asked it to summarise progress so far and it hallucinated a step i never mentioned. restarted from scratch with memory enabled. same thing. no state carryover. no grounding. if you don’t constantly babysit the thread and refeed everything, it breaks. persistent memory is surface-level. no real continuity. just isolated task runner. autonomy without continuity is not autonomy
Sounds pretty useless
must’ve taken years to refine that diagnostic toolkit. meanwhile the most are stuck tracing emergent behaviour in stochastic models, but yeah, glad you solved it in 3 words.