Comment by TheAceOfHearts

14 hours ago

Someone should setup a plugin or something for Claude Code that makes it easy to log all inputs and outputs for people who are willing and interested in sharing their usage. I don't want Anthropic to be the only company that can train on my usage, I want to share my usage so it can be used for training all new models.

Once you have a system for collecting all logs, you just need a place where they can be submitted. Ideally it would be a freely licensed dataset that is publicly available for everyone.

Has anyone built this yet?

Discussed building it with my friends, obviously you might share secrets and other real reasons, but if gangs of corporations are already doing it, I don't see why we shouldn't just share it amongst the crowd too.

  • Yeah I could see it being a problem if you're doing work on closed source or repos with sensitive credentials. Since my usage has all been on open source projects I'd be happy to share everything I'm doing if it can help train better models.

Yikes, no thank you

  • Do you have a substantive reason why you dislike this? What is the problem if it's opt-in? Nobody is forcing you to share your usage if you don't want to.

    I'd prefer it if all the model builders could train on my usage rather than being limited to a single company. That'll hopefully help make all the models better in the long-term.

    • Very substantive - that data can be highly sensitive, and I don’t trust all model companies