Comment by minimaxir

20 hours ago

I just pushed gemimg 0.3.2 which adds image_size support for Nano Banana Pro, and I ran a few tests on some of the images in the blog. In my testing, Nano Banana Pro correctly handled most of the image generation errors noted in my blog post: https://x.com/minimaxir/status/1991580127587921971

- Fibonacci magnets: code is correctly indented and the syntax highlighting atleast tries giving variables, numbers, and keywords different colors.

- Make me a Studio Ghibli: actually does style transfer correctly, and does it better than ChatGPT ever did.

- Rendering a webpage from HTML: near-perfect recreation of the HTML, including text layout and element sizing.

That said, there may be regressions where even with prompt engineering, the generated images which are more photorealistic look too good and land back into the uncanny valley. I haven't decided if I'm going to write a follow up blog post yet.

The system prompt hacking trick doesn't work with Nano Banana Pro unfortunately.