Comment by kensai
16 days ago
"Prompt "engineering" and working around your tools will be over in a year or so."
What do you mean by that? What is happening in just over a year or so?
16 days ago
"Prompt "engineering" and working around your tools will be over in a year or so."
What do you mean by that? What is happening in just over a year or so?
I think Fable gave a bit of a sneak peek into the future.
My objective KPI: for the few days I was using Fable (18hr a day), it would frequently push back against my design ideas and propose alternatives -- and they almost always felt better to me. Back to Opus now, still 18hr days - and I dont think it disagreed with me meaningfully even once since Saturdy. I consider myself and old hand -- and i think Fable really didn't need me to be very specific in my prompts, it would have done a good job regardless, or even despite my prompting.
Of course whether this is the future is anyone's guess. Maybe we will experience a butlerian jihad and there won't be any prompting whatsoever for completely different reasons :-)
Remember to go outside once in a while, my dude
Crunch time, not the norm.
The models are getting better at agentic coding, so over time using complicated harnesses and precise prompt engineering to attempt to squeeze out an extra X% performance will become irrelevant as the models approach expert-level performance. The bitter lesson in miniature.
there will always be a difference between the general capabilities, and the particularities of your exact environment and requirements.
Closing this gap is done in the harness, either through Skills, user behaviour/prompts , Agents.md etc etc.
I think that this is an area worth investing time in, but it is indeed hard to know what the scope of this is.