Comment by KoolKat23
9 hours ago
Reasoning density.
I have a specific use case (financial analysis), that is at the edge of what is possible with this models (accuracy wise).
Gemini 2 was the beginning, you could see this technology could be helpful in this specific analysis but plenty of errors (not unlike a junior analyst). Gemini 2.5 flash was great actually useable, errors made were consistent.
This is where it gets interesting, I could add additional points to my system prompt, yes it would fix those errors but it would degrade the answer elsewhere, often it wouldn't be incorrect but merely much simpler less nuanced and less clever.
This is where multi-agents helped it actually meant the prompt can be broken down so that answers remain "clever". There is a big con to this, it is slow, slow to the point that I chose to stick with a single prompt (the request didn't work well operating in parallel as the other prompt surfaced factors for it to consider).
However Gemini 3 flash is now smart enough that I'd now consider my financial analysis solved. All with one prompt.
No comments yet
Contribute on Hacker News ↗