Comment by zamadatix
17 days ago
Is there a way to see/compare the shared results for all of the LLMs you've tested this prompt on in one place? The 2.0 pro result seems decent but I don't have a baseline if that's because it is or if the other 2 are just "extremely bad" or something.
Search by tag: https://simonwillison.net/tags/pelican-riding-a-bicycle/