A perusal of the source code of, say, Ollama -- or the agentic harnesses of Crush / OpenCode -- will convince you that yes, this should be an extremely a simple feature (management of contexts are part and parcel).
Also, these companies have the most advanced agentic coding systems on the planet. It should be able to fucking implement tree-like chat ...
If the client supports chat history, that you can resume a conversation, it has everything required, and it's literally just a chat history organization problem, at that point.
A perusal of the source code of, say, Ollama -- or the agentic harnesses of Crush / OpenCode -- will convince you that yes, this should be an extremely a simple feature (management of contexts are part and parcel).
Also, these companies have the most advanced agentic coding systems on the planet. It should be able to fucking implement tree-like chat ...
LM Studio has this feature for local models and it works just fine.
If the client supports chat history, that you can resume a conversation, it has everything required, and it's literally just a chat history organization problem, at that point.