Comment by BenGosub
5 days ago
Certainly, in those cases one needs to be clever and design an evaluation framework that will grade based on soft criteria, or maybe use user feedback. Still, over time a good train-test database should be built and leveraging dspy will do improvements even in those cases.
No comments yet
Contribute on Hacker News ↗