Comment by mercat

1 month ago

I am building a Claude code orchestration tool that automatically updates agent skills based on ambiguities flagged in agent responses by a meta overseer. The overseer reasons on top of the worker agents’ reasoning. When that happens, it sends me async requests and early on there are a lot of them. Based on my input, the system updates skills or splits them into smaller ones for tighter context control. This sets up a HITL driven training loop that constantly refreshes skills and adapts the agent swarm to new tasks.

I think of it as recursive transfer of human expertise into skills.

I know this is vague and probably raises more questions than it answers. I will share more once it reaches a semi autonomous working state.

0 comments

mercat

No comments yet

Contribute on Hacker News ↗