Companies think that they can nerf the model and people won't notice. Here is exhibit 1: Nano Banana Pro has fallen far from the tree. In our independent evaluation, Nano Banana Pro's image generation capability as of recently is worse than FLUX.2 [dev] and far from GPT-Image 2 Low. Just a reminder that no weights, not your models.

https://preview.redd.it/qoa5db9r2e1h1.png?width=2212&format=png&auto=webp&s=cc6855134f01e81af16c83036d18c9e9f38027e3

You can explore our full calibration test set at https://tests.drawthings.ai/generate and https://tests.drawthings.ai/edit. We will release the score on our full private test set soon.

1. Charts

Top: Nano Banana Pro
Bottom: GPT-Image 2 Low

Prompt:

>A pie chart showing browser market share: Chrome 65%, Safari 20%, Firefox 10%, Other 5%. Each section should be appropriately sized and labeled with the browser name and percentage.

Nano Banana Pro no longer seems able to make a simple labeled chart reliably.

https://preview.redd.it/q67mipl5yd1h1.png?width=1024&format=png&auto=webp&s=4e8d0024331dfd906cda3da5e3cc307b6aec1f9e

https://preview.redd.it/6y1ak3q6yd1h1.png?width=1024&format=png&auto=webp&s=63f7349b216f21a40058ad7d550b8173a42796ce

2. Art style understanding

Top: Nano Banana Pro
Bottom: ERNIE Image Turbo

Prompt:

>A tabletop still life of a metal desk fan, a rotary telephone, and a folded newspaper in the style of Pablo Picasso's Cubist period. The objects must be fragmented into interlocking geometric planes, showing multiple viewpoints simultaneously, with a muted palette of browns, grays, and ochres.

Nano Banana Pro is surprisingly weak here. It gets the surface-level idea, but misses the actual style language.

https://preview.redd.it/guolnhmzxd1h1.png?width=1024&format=png&auto=webp&s=eb496570b5e68741272427267c6fcfd771048280

https://preview.redd.it/66tth8u0yd1h1.png?width=1024&format=png&auto=webp&s=4c0a45dbf886be89565ec7a1f3b9008a48429e73

3. Physics

Top: Nano Banana Pro
Bottom: GPT-Image 2 High

Prompt:

>A wine glass that has just been knocked off a table, captured 0.5 seconds after leaving the edge. The glass should be tilted at a realistic angle for its falling trajectory, with wine beginning to spill out following gravity but not yet scattered.

Nano Banana Pro is meh on physical plausibility. The scene looks plausible at first glance, but the motion / spill / object state does not really make sense.

https://preview.redd.it/8ugtdoitxd1h1.png?width=1024&format=png&auto=webp&s=8780b852e11e61a0b4f6a8e1c6432cef2a3193f6

https://preview.redd.it/5uazjsvuxd1h1.png?width=1024&format=png&auto=webp&s=0b6140a64514fea74f8eb6100f8bcbb697935c04

4. Referential consistency

Top: Nano Banana Pro
Bottom: GPT-Image 2 Low

Prompt:

>A composite architectural visualization. The top section shows a photorealistic 3D cutaway view of the ground floor of a modern house, with a furnished living room containing a sofa and TV and an adjacent kitchen with an island, separated by the same partial wall. Directly below it is a strictly orthographic 2D architectural blueprint floor plan of that exact same level. The wall footprints, door openings, furniture symbols, and kitchen island in the blueprint must align one-to-one with the 3D render above, with matching positions and proportions.

This one is especially disappointing because referential consistency is supposed to be one of Nano Banana Pro’s strengths.

https://preview.redd.it/d1uhop2kxd1h1.png?width=1024&format=png&auto=webp&s=a4b8edcf041b9135ff655616d06da371f09c00b0

https://preview.redd.it/ualxi12mxd1h1.png?width=1024&format=png&auto=webp&s=f8a380acfe21a3ec20c3e08884fe697afba65163

You can find more examples here:

https://tests.drawthings.ai/generate

The only saving grace: Google I/O is next week, so maybe there is a new un-nerfed version coming soon. 🤞

u/liuliu

Open-weights vs. closed models: Nano Banana Pro was nerf'ed

1. Charts

2. Art style understanding

3. Physics

4. Referential consistency