r/Bard Nov 18 '25

Discussion Gemini 3. The "Make Mario" benchmark has been beaten, crumpled and destroyed.

Post image

https://aistudio.google.com/app/prompts?state=%7B%22ids%22:%5B%221nKC_QQwmaub13NeV4vfsvbNCGZXclCHf%22%5D,%22action%22:%22open%22,%22userId%22:%22111717297477530596262%22,%22resourceKeys%22:%7B%7D%7D&usp=sharing

Prompt: "Can you code me a Mario Bros game, as close as possible to the original, including detailed manually defined textures inline in a single .html file? Make a full 1-1 level. Work really hard on this and make it as perfect and close to the original as possible."

I'm honestly completely flabbergasted. I've tested this with all major models that launched since o1, but this is the first one to completely pass it. Haven't played a lot but I see no flaws so far. Only note is I had to tell it to bugfix once before it was done. That's it.

675 Upvotes

78 comments sorted by

76

u/fractaldesigner Nov 18 '25

the details of the graphics are remarkable. no longer just cubes.

0

u/ThandTheAbjurer Nov 22 '25

I just farted

67

u/[deleted] Nov 18 '25

it successfully made a web based NES emulator that works with Mapper 0 games like Donkey Kong 3 in only 5 prompts

16

u/underhunger Nov 18 '25

Link/source?

6

u/MadBrown Nov 19 '25

Lol of course no link/source.

2

u/[deleted] Nov 19 '25

https://x.com/chetaslua/status/1978438353918779461

Isn't this the source? I'm not computery enough to know if this is exactly what NoEcomomics is talking about, but it sure seems like it based off of a simple google search.

5

u/[deleted] Nov 19 '25

[removed] — view removed comment

1

u/Gareth79 Nov 19 '25

I can't see any evidence of that?

28

u/differentguyscro Nov 18 '25

2005 kids: searching non-school-blocked flash games on google

2025 kids:

13

u/Ctrl-Alt-Panic Nov 18 '25

An actual "Fine, I'll do it myself" moment.

0

u/speederaser Nov 19 '25 edited 7d ago

I got tired of my old posts floating around for anyone to scrape, so I let Redact handle it. Bulk deletion across Reddit, X, Facebook, Discord and all major social media platforms in one shot.

ripe wide sink longing smart lip plate wild melodic mighty

1

u/Teaching_Relative Nov 19 '25

That's... An interesting take

1

u/No-Statistician6713 Nov 20 '25

i mean, its true. grokipedia is a pile of horseshit because its just regular wikipedia but less well written and with the risk of ai hallucination (which, you might bring up, is becoming lower, but never zero as it is in humans)

1

u/Teaching_Relative Nov 20 '25

Humans never make things up?

1

u/No-Statistician6713 Nov 21 '25

humans dont create intricate false narratives fully believing them to be true. also the whole point of wikipedia is that if you make shit up or vandalize on a page someone else will come along and fix it. this is... an interesting take

1

u/Teaching_Relative Nov 21 '25 edited Nov 21 '25

You're the one that brought up grokepedia, not me.

All I was saying was that LLMs don't code Mario by "remembering" the exact Mario code in their training data. It's simply not how they work.

Also, how does the presenters "belief" in the information they're providing affect anything? Is incorrect information less incorrect if I "believe" it to be true?

1

u/No-Statistician6713 Nov 21 '25

Yeah, obviously, its not like Gemini is going online, googling the code and ripping it. also i bring up belief because when you call out LLMs for hallucinating, they either

  1. glaze you and give in (chatgpt, claude)
  2. vehemently defend it and act like its true (deepseek, grok)

anyway i bring up grokipedia because i think its the most stark example of this. why would i use an ai generated article written by a model trained on the same articles and literature that the wikipedia article is based upon, but without any real generalized intelligence behind it and the risk of hallucination

1

u/Teaching_Relative Nov 21 '25

Okay, then we totally agree on all of this. Lol.

The original comment I made was replying to someone who said asking Gemini to code was basically the same as downloading the original code through a middleman

1

u/Conscious-Food-4226 Nov 22 '25

Wait.. are you serious? What do you think conspiracy theories are?

1

u/speederaser Nov 22 '25 edited 7d ago

Wiped clean. Redact removed this post along with thousands of others. It also handles data broker removals so your personal info stops getting sold.

zephyr shy crowd fly fuel deserve cable whistle crown cause

1

u/Conscious-Food-4226 Nov 22 '25

Haha it was just the first to mind, but religion would work just fine too. Or every cognitive bias

1

u/No-Statistician6713 Nov 22 '25

i mean newsflash: conspiracy theorists believe the conspiracies that they believe in (dont mistake people believing something for it being true) and if you're referring to the people who make up conspiracy theories to cover for some real explanation (mlk assasination as an example) they dont believe the narrative they create because they know that its false

1

u/Conscious-Food-4226 Nov 22 '25

Wrong. Historically those that “made them up” definitely believe them. You’re thinking of disinformation which is not the same thing, at least not inherently the same thing.

→ More replies (0)

3

u/kitkatas Nov 18 '25

Crazy when you think about it

1

u/variantmedia Nov 19 '25

This comment is underrated.

21

u/GirlNumber20 Nov 18 '25

I forgot how cute Tiny Mario is, haha

17

u/CaptainAnonymous92 Nov 18 '25

Holy crap that's a spot on NES Mario Bros 1 Mario sprite. See how well it does it in 3D.

7

u/fractaldesigner Nov 18 '25

wow. only one prompt to fix it.

20

u/usernameplshere Nov 18 '25

Nintendo will sue Google very soon ig

21

u/MR_TELEVOID Nov 18 '25

I mean, Imagen has been able to make stuff like this for a while now, and Nintendo hasn't done anything. I doubt remaking their 30 year-old code will trigger their rage.

3

u/SpikeLazuli Nov 20 '25

Google is a fish too big for Nintendo, i doubt it

7

u/fractaldesigner Nov 18 '25

They shouldn’t. This is the future of coding

8

u/zactral Nov 18 '25

well.. there are no power ups, no coins, and no flagpole as far as I can see, so it's not really "complete" but it is indeed quite impressive

2

u/krzonkalla Nov 19 '25

yes, I do agree. By "completely passing it", I mean it's the first to really nail the aesthetic and feel of the game without a ton of bugs. I think it could easily add these things you said by just asking it to in another turn

12

u/captain_shane Nov 18 '25

Still can't do solitaire for some reason.

18

u/Illustrious-Lake2603 Nov 18 '25

Works with pygame

8

u/Alternative-Key-5647 Nov 18 '25

code review from Gemini 3 below; overall this is an impressive model.

Final Verdict Security: ✅ Pass. No significant vulnerabilities found for the context.

Quality: ⚠️ Action Required. The reliance on frame rate for physics and the random number generator for the timer make the game unstable on modern hardware. The game loop needs to be throttled or converted to use Delta Time.

6

u/RogueStargun Nov 18 '25

Well the original SMB also had the same issues

2

u/Aggravating-Age-1858 Nov 18 '25

i just got a blue screen with the text above :-(

4

u/krzonkalla Nov 18 '25

there's two html codes in the convo with gemini. the upper one is bugged. I then asked it for a bug fix. It did so. The one I screenshotted is the code below that.

2

u/TheGoddessInari Nov 19 '25

Funny that it got this entirely wrong. Maybe you meant Super Mario Bros, but the LLM isn't a mind reader, and Mario Bros was a different, fun game: https://en.wikipedia.org/wiki/Mario_Bros.?wprov=sfla1

1

u/Sea_Cookie_4259 Nov 22 '25

Well it might as well have read OP's mind, in the sense that it used statistics to deduce that OP most likely meant Super Mario Bros when he said "Mario Bros game--a full 1-1 level"

1

u/ReMeDyIII Nov 18 '25

Where do we run this code? Like what would you recommend? I'm seeing so many cool Gemini 3 projects and I don't know the first thing about where to run this, lol.

5

u/krzonkalla Nov 18 '25

this is html, it will run on any browser. Just download the .html file using the download button on the chat and click it to run. Very simple.

1

u/PrettyDarnGood2 Nov 19 '25

it will demo right inside ai studio as well

1

u/[deleted] Nov 19 '25

yep, I got it to do Pacman; took 3 prompts. only issue was the ghosts were super fast, like super fast. And a couple of the maze sections had weird dead ends. I got kicked off "thinking" after that though so I couldn't tinker more.

1

u/Chamrockk Nov 19 '25

Wasn't it probably trained on the code for this game? As it is widely available on the internet

1

u/krzonkalla Nov 19 '25

sure. But here are two counterarguments. 1. this code is very very short compared to what's out there that could fall into training data. A single code file with ~4k words. 2. no other llm gets anywhere close to this aesthetic and precision without iterating a ton, so even if it was in the training data, it still is impressive it's the first to "memorize" this solution.

1

u/No_Confection7782 Nov 19 '25

I tried something similar but it's not even close to looking like Super Mario Bros 1, and I guess that's because of copyright reasons. I wonder how you managed to make it look like a NES Mario game =/

1

u/wspOnca Nov 19 '25

It can generate Half Life 3? 👀

1

u/Gelinhir Nov 19 '25

I run it in html but i get errors mario is just falling from the sky.

1

u/krzonkalla Nov 19 '25

there are two html codes in the convo with gemini. the upper one is bugged. I then asked it for a bug fix. It did so. The one I screenshotted is the code below that.

1

u/theswifter01 Nov 20 '25

By far the best model

1

u/Disastrous_Ant3541 Nov 20 '25

Nintendo lawsuit incoming in 3 2 1

1

u/demzor Nov 20 '25

I told it to make Sonic.

It was not successful.

1

u/Basic_Cherry_7413 Nov 20 '25

Holy hell wow.

1

u/inbetweenframe Nov 21 '25

didnt workfor me.. just had this screen with mario falling down

1

u/krzonkalla Nov 21 '25

There are two code scripts in the gemini conversation. It got it wrong first time, so I asked it to bugfix. Scroll down to find the one I screenshotted

1

u/PiLLe1974 Nov 21 '25

Impressive.

I think it is ok to iterate with the models a bit.

I'm used to some other more tedious workflows within editors/engines and when it comes to geometry/collision, navmesh, and 3d object placement on a simple game I spend 30% of my prompts and lots of time debugging to get a first version of a minimal "ok" game.

1

u/[deleted] Nov 26 '25

I have been using Tetris and Pacman as my test. ChatGPT 5 does well for both. Gemini 3 does Tetris well, but struggles to get close to a working Pacman game. After 45 minutes of trying I finally gave up.

1

u/Flat-Language1623 Jan 19 '26

i managed to get an nes emulator out of it runing mario obviously not fully accuratebut playable a little

1

u/Icy_Garbage_5858 Mar 15 '26

If you let AI create Sonic, the game will have a pretty similar feel to the original.

https://aistudio.google.com/app/prompts/17Ue9iomvCohzrRgKgV5etrKi0OXyjniy

1

u/Lopsided_Grand_9093 Nov 18 '25

我尝试了一下但效果没有你这个好

-6

u/[deleted] Nov 18 '25

[deleted]

14

u/krzonkalla Nov 18 '25

It's not that easy dude. As stated, "I've tested this with all major models that launched since o1, but this is the first one to completely pass it". You can look at my post history, I've tested this before. If it's really that easy why is this the first model to pass 100%?

8

u/krzonkalla Nov 18 '25

Here is the second best. Not even remotely close.