Comment by mixtureoftakes
5 hours ago
please, sign up for a paid plan of either chatgpt or claude. gemini is while close, still noticeably behind
you deserve opinions shaped by interactions with the best tools that are out there.
5 hours ago
please, sign up for a paid plan of either chatgpt or claude. gemini is while close, still noticeably behind
you deserve opinions shaped by interactions with the best tools that are out there.
Gemini feels deep and philosophical. Especially for product management. Tell him you're a product manager and we're a team of two.
But regular reminder - All LLMs can be wrong all the time. I only work with LLMs in domains I'm expert in OR I have other sources to verify their output with utmost certainty.
Or when you don't care about results being very correct.
When I'm cooking meatballs with sauce and the recipe calls for frying them, I'll have an LLM guestimate how long and which program to use in an air fryer to mimic the frying pan, based on a picture of balls in a Pyrex. So I can just move on with the sauce, instead of spending time browsing websites and stressing about getting it perfect.
I used to hate these non-deterministic instructions, now I treat it as their own game. When I will publish my first recipe, I'll have an LLM randomize the ingredient amounts, round them up to some imprecise units and also randomize the times. Psychologists say we artists need to participate and I WILL participate.
> I only work with LLMs in domains I'm expert in
This. Should become a general rule for any non-trivial use of LLM in a professionel setting.
Agreed, Gemini is clearly a capable model, but the tool use is lagging behind the other two. Ironically it regularly gets things wrong (ie. the current version of some software) because of an unwillingness to use web search.
Gemini is certainly not behind Claude in terms of physics.
ChatGPT and Gemini are actually fairly comparable.
Claude has been utterly useless with most math problems in my experience because, much like less capable students, it tends to get overly bogged down in tedious details before it gets to the big picture. That's great for programming, not so much for frontier math. If you're giving it little lemmas, then sure it's great, but otherwise you're just burning tokens.
Seriously, it’s not worth reaching for less intelligence. Use Extended Pro 100% of the time for things you’d spend the amount of time GP spent writing their post.