Comment by AndyNemmity

1 month ago

I don't understand the use case. I think if you don't use agents, and skills currently effectively, then perhaps this is useful.

If you're using them though, we no longer have the problem of Claude forgetting things.

That makes sense, and I agree that for a single agent using skills well, Claude’s native context handling has gotten much better.

This wasn't mentioned in the first post, but the use case we’re focused on isn’t really “Claude forgetting,” but context living beyond a single agent or tool. Even if Claude remembers well within a session, that context is still owned by that agent instance.

The friction shows up when you switch tools or models (Claude → Codex / Cursor / etc.), run multiple agents in parallel, or want context created in one place to be reused elsewhere without re-establishing it.

In those cases, the problem isn’t forgetting so much as fragmentation. If someone is happy with one agent and one tool, there are probably a bunch of memory solutions to choose from. The value of this external memory network that you can plug into any model or agent shows up once context needs to move across tools and people.

I'm curious how those replace this? I've barely used either, and would love to hear more.

  • Okay, Claude.md is an md file with instructions.

    Agents are an md file with instructions.

    Skills are an md file with instructions.

    Commands are.. you get the point.

    We're just dealing with instructions. Claude.md is handled by Claude Code. It is forgotten almost entirely often when the context fills.

    Okay, what is an agent? An agent is basically a Claude.md file, but you make it extremely granular. So it only has instructions of let's say, Typescript.

    We're all just doing context management here. We're trying to make sure our instructions that matter stay.

    To do that, we have to remove all other instructions from the picture.

    When you're doing typescript, you only know type script things.

    Okay, what's a skill? A skill is doing a single thing with type script. Why? So that the context is even smaller.

    Instead of the agent having every single instruction you need about typescript, you put them in skills so they only get put into context when that thing is needed.

    But skills are also where you connect deterministic programs. For example, I have a skill for creating images in nano banana.

    So when the Typescript Agent needs to create an image, it calls the skill, that calls the python script, to create images in nano banana.

    We're managing all the context to only be available when it's needed, keeping all other instructions out.

    Does that help?

    • A little. Thanks for the detailed reply. But I wouldn't expect a smaller, more targeted Claude.md would make a big difference. My impression is that we're filling up the context less with Claude.md than with the session work: back and forth about the specific task, Claude's chain-of-thought as it does that, the relevant sections of code, etc. That's what this is trying to compact and reference, iiuc.

      What I'd think would help, would be doing things in smaller chunks: an agent to do this small subtask, another to do that small subtask, and the parent task context only grows with the sub-agents reporting back, not with their chain-of-thought.

      Could be I should have a much bigger, more detailed Claude.md. Mine tend to be small project overviews and a list of TODOs, not that different from README.md: https://github.com/lkbm/sideways_math/blob/main/CLAUDE.md

      1 reply →