Comment by mercat
1 month ago
I am building a Claude code orchestration tool that automatically updates agent skills based on ambiguities flagged in agent responses by a meta overseer. The overseer reasons on top of the worker agents’ reasoning. When that happens, it sends me async requests and early on there are a lot of them. Based on my input, the system updates skills or splits them into smaller ones for tighter context control. This sets up a HITL driven training loop that constantly refreshes skills and adapts the agent swarm to new tasks.
I think of it as recursive transfer of human expertise into skills.
I know this is vague and probably raises more questions than it answers. I will share more once it reaches a semi autonomous working state.
No comments yet
Contribute on Hacker News ↗