I in contrast ChatGPT Photos 2.0 and Gemini Nano Banana, and one simply wins


Ask the typical individual what they use AI for, they usually’ll in all probability rattle off the same old suspects: drafting emails, writing up fast LinkedIn posts, summarizing assembly notes, possibly debugging a line of code, and producing photographs. From creating photographs of you hugging your previous self to turning your self right into a Pixar character to designing complete product mock-ups and advertising and marketing belongings, AI picture technology is actually now not a celebration trick.

Google’s been main this area with its Gemini Nano Banana mannequin for a very good bit now, however since OpenAI dropped ChatGPT Photos 2.0 on the twenty first of April, Google has some severe competitors. I have been utilizing Nano Banana because it launched and have seen it develop into what it’s as we speak. I have been testing ChatGPT Photos 2.0 because the day it launched and, in fact, evaluating it to Nano Banana at each flip. The outcomes genuinely shocked me.

So, what are these two fashions?

A fast refresher earlier than we get into the enjoyable stuff

Gemini in LibreChat on desktop, lamp in view

I do know lots of people who simply open the respective device, immediate it to generate no matter picture they want, and by no means actually take into consideration what’s occurring underneath the hood. So, I assumed I would start with a fast breakdown of what every mannequin truly is and what makes them completely different. Google introduced their first picture mannequin powered by Gemini referred to as Nano Banana again in August 2025, constructed on the Gemini 2.5 Flash structure. It went viral virtually instantly. The quirky title caught, individuals have been cracking jokes in regards to the banana brand, and it rapidly grew to become the go-to for AI picture technology and modifying.

Then in November, they launched Nano Banana Professional, providing superior intelligence and studio-quality inventive management. And in February 2026, Google launched Nano Banana 2, which mixes the superior options of Nano Banana Professional with the pace of Gemini Flash fashions. Nano Banana 2 can pull from Gemini’s real-world data base, powered by real-time info and pictures from internet search to extra precisely render particular topics. It may well generate correct, legible textual content for advertising and marketing mock-ups or greeting playing cards, and even translate and localize textual content inside a picture. It helps as much as true 4K decision as a part of the usual providing, and it is lots higher at following directions in comparison with the earlier fashions. The mannequin is presently the default picture technology expertise throughout Google’s merchandise.

ChatGPT Photos 2.0 is a reasonably new launch, and was introduced throughout the identical week as GPT-5.5’s launch. It is OpenAI’s first picture mannequin with native considering capabilities, which means it is able to truly planning, looking out the online, and checking its personal outputs earlier than finalizing a picture. It runs in two modes: Instantaneous and Pondering. The previous is free for everybody, whereas the latter is reserved for paid ChatGPT subscribers. Together with the considering capabilities, the mannequin can deal with textual content rendering throughout languages like Japanese, Korean, Hindi, and Bengali with near-perfect accuracy and helps as much as 2K decision.

It may well generate as much as 10 photographs from a single immediate. The brand new mannequin has a extra “up-to-date understanding” and a data cutoff of December 2025. Sam Altman described the mannequin as “going from GPT-3 to GPT-5” , which is a reasonably daring declare to make. That mentioned, ChatGPT’s preliminary picture technology mannequin is one thing that I (and a whole lot of different individuals) discovered fairly underwhelming. I would mainly by no means attain for it over Nano Banana. So, the truth that Photos 2.0 has genuinely pulled me again says lots about how massive of a leap that is.

Each the fashions have completely different picture types

You possibly can spot which mannequin made straight away

composite portrait showing a girl at age 5 in 2005 and age 21 in 2026 with a birthday cake labeled 21

Each LLM has considerably of its personal character. For example, I discover that Claude fashions are much more conversational and ChatGPT fashions really feel extra assured and structured. You possibly can inform a distinction even once you give them the identical immediate. The identical applies to their picture fashions. Give ChatGPT Photos 2.0 and Nano Banana 2 the very same immediate, and you will get two noticeably different-looking photographs. This is not simply due to the information it is educated on or due to the mannequin’s underlying structure. It is as a result of every mannequin has a default aesthetic they simply appear to gravitate towards.

In my testing, I’ve discovered that ChatGPT Photos 2.0 finally ends up producing extra grounded and naturalistic outputs. The outputs appear like actual images which were professionally edited. The lighting feels a bit imperfect in a great way, textures have a variation, and the picture simply appears to be like very polished in all the fitting methods. Nano Banana 2, alternatively, leans tougher into vibrant, saturated, eye-catching visuals. The colours are deeper, the distinction is punchier, and all the pieces tends to really feel extra stylized. However they do not really feel very sensible.

This clearly is not simply my opinion both. For example, Reddit user u/Inevitable_Gur_461 posted a GPT-Picture 2 vs Nano Banana 2 comparability on the r/ChatGPT subreddit. He used a reasonably in-depth immediate the place he needed to generate a black and white classic wedding ceremony images from the Fifties. He generated 2 photographs from ChatGPT Photos 2.0, whereas the final picture he generated was from Nano Banana 2. I may’ve recognized the Nano Banana 2 picture and not using a double look or needing to see the feedback โ€” it simply felt very… Nano Banana-ey. It simply has a sure AI look to it!

For example, here is an instance I ran myself. There was this Instagram development occurring the place you’d give picture fashions images of youthful and present you, after which ask it to generate a picture of each variations of you sitting collectively. I gave each fashions the identical immediate, the identical reference images, and requested for a similar mushy, cinematic, studio-style look.

side by side photos of a woman at age 21 in 2005 and age 21 in 2026 with a birthday cake

Whereas I admittedly wasn’t the most important fan of ChatGPT’s outcome (which is extra so due to the best way my very own photographs turned out), Nano Banana 2’s outcome simply felt very blatantly overdone. It had that telltale over-smoothed pores and skin, barely too-perfect lighting, and a normal “AI sheen” that made it apparent at first look. It felt extra akin to an expert photoshoot, which wasn’t the vibe I had requested for in any respect.

That mentioned, I am not saying one is healthier than the opposite. It comes down to non-public choice and, extra importantly, what you are making an attempt to create. Should you want one thing that appears prefer it was pulled from an actual digicam roll, I would suggest ChatGPT Photos 2.0. If you’d like one thing that is instantly eye-catching, say for a social media publish, Nano Banana 2’s type is what you want.

ChatGPT’s actual benefit is not simply extra sensible photographs

It is significantly better

Whereas extra natural-looking photographs is actually one thing you will discover straight away, it is not actually what retains me reaching for ChatGPT Photos 2.0 over Nano Banana 2. The true benefit, in my eyes, is context. ChatGPT Photos 2.0 is lots higher than Gemini at remembering precisely what you are engaged on. For example, I’ve this trademark hamster sticker I have been utilizing on messaging apps (together with Slack) that I ship to everybody at any given second. If I am freaking out, I will ship it. If I am pleased, I will ship it. If I am in tears, you realize what I am sending. I as soon as determined, why not go forward and convert the sticker to a Google Meet background?

From there on, I have been always producing variations of the sticker related to the state of affairs I am in. A hamster (or the hamsters) crying, indignant over one thing, cramming for an examination, and even celebrating my birthday by blowing candles and carrying a cap. The hamster sticker is mainly alleged to characterize… me. I began this custom off with Nano Banana 2 (earlier than GPT Photos 2.0 launched) and whereas the outcomes have been all the time spectacular (they do not should be “sensible”), I would have to connect the reference picture once more, re-describe the character, and virtually begin the dialog over each few messages. If I simply gave it the directions by describing what I needed (even when I referred to the pic), it might both generate one thing utterly off or simply default to a generic hamster that regarded nothing like my unique sticker. The context simply does not stick. With ChatGPT Photos 2.0 although, I simply dropped the unique reference picture as soon as, and I’ve merely been telling it what to do from there.

So, for example, I requested the mannequin to maneuver all of the hamsters to a college and present that they are learning. I did not embody the reference picture, or any extra particulars. Simply the immediate, and that is it. I then requested it to make it appear like all of the hamsters have been begging and saying โ€œpls????โ€ as a result of I needed to ship it to my editor. At one level, somebody referred to as the hamsters mice, so naturally I had ChatGPT generate a whole indignant protest scene the place the hamsters have been screaming โ€œWE ARE NOT MICE!!!โ€ by means of tears. The purpose is, I saved constructing on the identical working joke with no need to repeatedly clarify the characters, their vibe, or their look. ChatGPT Photos 2.0 remembered the hamster universe surprisingly effectively. The hamsters nonetheless regarded like my hamsters and have been in the identical scene within the greater image, even because the eventualities grew to become progressively extra unhinged.

One other instance is the one I touched on above โ€” the youthful and older development. I dropped a screenshot of the Instagram Reel I noticed about this to provide Nano Banan 2 some reference, and instructed it to make the output just like it. As a substitute of utilizing the screenshot as stylistic inspiration, the mannequin simply gave me the identical picture barely edited.

side by side portrait comparison of a child in 2005 and adult in 2024 with birthday cake

Identical outfits, similar state of affairs, similar individuals within the unique picture with barely completely different faces. It utterly modified how the older lady regarded. Gemini gave her curls, which I discover humorous as a result of I haven’t got curls, which means the mannequin was clearly not trying to duplicate me!

ChatGPT makes modifying photographs ridiculously easy

The half Nano Banana 2 actually must make amends for

As I simply talked about, I’ve discovered that Gemini’s Nano Banana 2 mannequin is not one of the best at retaining context and I discover that I have to always clarify the identical factor time and again and re-upload reference photographs. So, you may think about what it is like refining a picture the mannequin produced. There does not appear to be an easy approach to simply say “change this one factor” and have it truly work.

Most of the time, you will discover that it is advisable obtain the picture, add it, after which request your modifications. ChatGPT Photos 2.0, alternatively, makes this entire course of really feel easy. You click on on a generated picture, and also you get two choices: you may both describe your edit straight within the dialog panel, or use a range device to spotlight a selected a part of the picture after which describe what you need modified. The mannequin holds onto all the pieces else, and solely touches what you requested it to. This may sound minor, however it makes a large distinction.

ChatGPT Photos 2.0 wins this spherical truthful and sq.

Whereas I did not actually count on I would ever be saying this, ChatGPT’s newest picture mannequin positively wins this spherical. It is a terrific mannequin, produces scarily spectacular photographs, and takes time to suppose by means of the picture and develop it (whereas Gemini appears to be in a rush all the time).

That mentioned, I/O 2026 is true across the nook, and a brand new mannequin is predicted. Google I/O kicks off on Might nineteenth, and a number of shops are speculating that Nano Banana may get a major replace alongside what’s anticipated to be a significant Gemini mannequin announcement. So whereas ChatGPT Photos 2.0 has the sting proper now, I would not rely Google out simply but.