Comment by Legend2440

2 months ago

Isn't this just subagents? You call another LLM to go read a file and extract some piece of information or whatever, so that you don't clutter up the main context with the whole file.

Neat idea, but not a new idea.

11 comments

Legend2440

adagradschool 2 months ago

Yes! Contrary to the anthropomorphized subagents, I view them as ways of managing context primarily. I'm exploring this idea in Scope[0] to have observable subagents that recursively break down the task to avoid having to compact. One thing I haven't been able to figure out is how to evaluate/improve this planning step. I am using markdown files to encode heuristics for planning but it feels too unstructured for me to measure. Would love it if someone pointed me to some existing literature/projects around this idea!

[0] https://github.com/adagradschool/scope

schmuhblaster 2 months ago

Hi, I stumbled on this article in my twitter feed and posted it because I found it to be very practical, despite the somewhat misleading title. (and I also don't like encoding agent logic in .md files). For my side project I am experimenting with describing agents / agentic workflows in a Prolog-based DML [1]
[1] https://www.deepclause.ai

seeknotfind 2 months ago

Yeah, from the title, it sounds like perhaps the entire operation is differentiable and therefore trainable as a whole model and that such training is done. However, upon close inspection, I can't find any evidence that more is done than calling the model repeatedly.

AlexCoventry 2 months ago
No, there's no training going on, here, as far as I can tell. E.g., they use GPT-5 as their base model. Also, AFAICT from a quick skim/search there's no mention of loss functions or derivatives, FWIW.
- alextheparrot 2 months ago
  
  The derivative being a grad(ient) student sampling scaffolds against evals + qualitative observations: most prompt-based llm papers

wiesbadener 2 months ago

They state:

> RLMs are not agents, nor are they just summarization. The idea of multiple LM calls in a single system is not new — in a broad sense, this is what most agentic scaffolds do. The closest idea we’ve seen in the wild is the ROMA agent that decomposes a problem and runs multiple sub-agents to solve each problem. Another common example is code assistants like Cursor and Claude Code that either summarize or prune context histories as they get longer and longer. These approaches generally view multiple LM calls as decomposition from the perspective of a task or problem. We retain the view that LM calls can be decomposed by the context, and the choice of decomposition should purely be the choice of an LM.

nostrebored 2 months ago

lol this is literally one of the only reason competent people are using subagents. it is literally
@summarizable(recursive=True)
def long_running_task(Subagent)
on my long horizon tasks, where the hierarchy is determined at agent execution time…

lelanthran 2 months ago

Unless that subagebt you call can call subagents itself which can call subagents themselves, ad infinitum, it's not recursive.

songodongo 2 months ago
The paper says they used a recursive depth of 1. Does that mean subagents or sub-subagents?
- johnnyfived 2 months ago
  
  A recursive depth of 1? So it's just subagents..? How exactly can this be described as recursive then?

daralthus 2 months ago

sub-agents that access (and manipulate) the SAME context (a file system or variables in the REPL)