Comment by redman25
16 days ago
I feel like the right response for those situations is to start asking questions of the user. It’s what a human would do if they did not understand.
16 days ago
I feel like the right response for those situations is to start asking questions of the user. It’s what a human would do if they did not understand.
I made the argument multiple times that the right answer to many prompts would be a question, and it was allowed under some rare circumstances, but far too few.
I suspect in part because the provider also didn't want to create an easy cop out for the people working on the fine-tuning part (a lot of my work was auditing and reviewing output, and there was indeed a lot of really sloppy work, up to and including cut and pasting output from other LLMs - we know, because on more than one occasion I caught people who had managed to include part of Claudes website footer in their answer...)
As in, participants would copy output from one LLM as a question to another?