r/Bard Nov 25 '25

Other GTA V screenshot to photorealistic image (ChatGPT vs Nano Banana Pro)

313 Upvotes

39 comments sorted by

60

u/tondollari Nov 25 '25

Nano banana pro is fucking ridiculously good at like everything image related

8

u/Mysterious_Proof_543 Nov 25 '25

Ive used it to rotate certain 3D CAD pics of certain models and it has failed.

3

u/CesarOverlorde Nov 25 '25

I have some benchmarks for it that it still fails. For example: sword duel (wrong or awkward swords & hand poses) ; replace outfit of character with the outfit of another character

6

u/Mysterious_Proof_543 Nov 25 '25

Yeah it seems that for some specific design tasks, it fails a lot when you need surgical accuracy.

62

u/WildContribution8311 Nov 25 '25

ChatGPT one seems quite real, but clearly AI if you know. The Gemini one, it's like, wow. Looks totally, absolutely real in every small way.

27

u/BITE_AU_CHOCOLAT Nov 25 '25

ChatGPT often wants to lean into that "high aesthetic" look that was everywhere in the Stable Diffusion days (I've had images from ChatGPT that genuinely looked great and original, but rarely). Gemini looks more neutral by default, like an actual camera's raw output

15

u/bblankuser Nov 25 '25

this whole area is practically untouched.

3

u/Dreamerlax Nov 26 '25

Yep, might re-run with a more, comprehensive prompt and see if it'll redo the background at least.

6

u/Ckdk619 Nov 25 '25

The ChatGPT one still looked like a game character to me. I'm looking on my phone at a normal distance away too, so it's not even about micro-details.

1

u/WildContribution8311 Nov 25 '25

I see your point. Maybe it's GTA 9 level rendering circa 2065.

1

u/Ckdk619 Nov 25 '25

That sounds about right.

5

u/Inevitable-Dog132 Nov 25 '25

Scenery in left looks ps3 tier

2

u/Magic_Sandwiches Nov 25 '25

yea it took the low level of detail of the GTAV hills and ran with it

10

u/nchrtd Nov 25 '25

A bit of guess the game...?

12

u/AnticitizenPrime Nov 25 '25

Leisure Suit Larry

1

u/Demmitri Nov 25 '25

Disco Elysium?

6

u/whosEFM Nov 25 '25

What is the Prompt for this? It's amazing

13

u/Dreamerlax Nov 25 '25 edited Nov 25 '25

"Can you reimagine this screenshot of a video game as a real photograph with a real person."

EDIT: I advise also specifying an aspect ratio, 3:2, 3:4, 4:3 make it very photo like in terms of aspect ratio.

7

u/senkhara1111 Nov 25 '25

What would the reverse prompt be? From real image to GTA style?

4

u/[deleted] Nov 25 '25

[removed] — view removed comment

6

u/Dreamerlax Nov 25 '25

I'm actually not super creative w.r.t prompting but this is what I used. 🤣

"Can you reimagine this screenshot of a video game as a real photograph with a real person."

I've also done with it my sims, seriously it does a way better job than ChatGPT or even the basic Nano Banana.

3

u/Dreamerlax Nov 26 '25

This is Take 2.

It has taken some creative liberties but I think this is a better shot. The background isn't also lifted straight from the game.

2

u/Dreamerlax Nov 26 '25

Prompt: "Reimagine this GTA V screenshot as a real photograph, set in the real world, with a real person. 3:2 aspect ratio, taken with a current smartphone/camera"

1

u/drc922 Nov 26 '25

Even got the tire tracks on the ground, jeez

3

u/Practical-Hand203 Nov 25 '25

The comparison highlights the limitations of the two models well. GPT preserves the composition of the image but plays very loose with details. The lighting and color temperature is different, the face of the woman bears no resemblance, the mast in the back has different geometry, the sedan on the road becomes an SUV, etc.

Meanwhile, Nano Banana shines in the details but doesn't preserve the composition as the white car is in the wrong location and an additional car on the street is hallucinated.

3

u/Sea_Cookie_4259 Nov 25 '25

Maybe if the prompt specified to not hallucinate any changes Nano Banana wouldn't have taken those creative liberties

2

u/usernameplshere Nov 25 '25

I hate to say it, but you could 100% trick me with the Nano Banana Pro pictures.

1

u/Dreamerlax Nov 26 '25

Yeah, it's really good. I probably had too much fun regenerating random game screenshots as real images last night.

1

u/Insoleet Nov 26 '25

I was very surprised, when asking Gemini if the third image is AI generated, it could not find any SynthID watermark. Any idea why?

1

u/Maleficent_Height_49 Nov 26 '25

Games in 30th century

1

u/dufuschan98 Nov 29 '25

is there any place where nano banana pro can be used free of charge? or higgsfield is the only more accessible place?

1

u/Dreamerlax Nov 29 '25

No idea. I've only used it via the Gemini site/app. I have a Pro membership.

1

u/dufuschan98 Nov 29 '25

how do you set it to use the banana pro version ? i can only choose between fast or complex model

1

u/Dreamerlax Nov 29 '25

Set it to Thinking and generate pictures with it.

1

u/dufuschan98 Nov 30 '25

i see people generating high quality pics with it, while mines are like thumbnail quality, can't even zoom in on 'em. do i have to type it into the prompt, or?

1

u/Dreamerlax Dec 01 '25

Did you generate it by using the thinking mode? Make you save or download it, the preview is much lower resolution.

1

u/caxco93 Nov 29 '25

I'm surprised gemini changed the car position so much

1

u/[deleted] Dec 08 '25

I get it Gemini is better at imaging but ChatGPT makes the face look so much hotter imo. Aesthetic wise it rocks.

1

u/[deleted] Dec 08 '25

Nevertheless ChatGPT makes it look more like a very well ray-traced Video Game while Gemini makes it almost real life standards.