Comment by ericflo
6 days ago
It's not either/or. Generally you finetune when optimized many-shot still doesn't hit your desired quality bar. And it turns out with RL, things like system prompts matter a lot, so searching over prompts is a good idea even when reinforcing the desirable circuits.
I am not an expert in fine tuning, but in the company I work for our fine tuned model didn't do any noticeable difference.