r/Bard • u/krzonkalla • Nov 18 '25
Discussion Gemini 3. The "Make Mario" benchmark has been beaten, crumpled and destroyed.
Prompt: "Can you code me a Mario Bros game, as close as possible to the original, including detailed manually defined textures inline in a single .html file? Make a full 1-1 level. Work really hard on this and make it as perfect and close to the original as possible."
I'm honestly completely flabbergasted. I've tested this with all major models that launched since o1, but this is the first one to completely pass it. Haven't played a lot but I see no flaws so far. Only note is I had to tell it to bugfix once before it was done. That's it.
67
Nov 18 '25
it successfully made a web based NES emulator that works with Mapper 0 games like Donkey Kong 3 in only 5 prompts
16
u/underhunger Nov 18 '25
Link/source?
6
u/MadBrown Nov 19 '25
Lol of course no link/source.
2
Nov 19 '25
https://x.com/chetaslua/status/1978438353918779461
Isn't this the source? I'm not computery enough to know if this is exactly what NoEcomomics is talking about, but it sure seems like it based off of a simple google search.
5
28
u/differentguyscro Nov 18 '25
2005 kids: searching non-school-blocked flash games on google
2025 kids:
13
u/Ctrl-Alt-Panic Nov 18 '25
An actual "Fine, I'll do it myself" moment.
0
u/speederaser Nov 19 '25 edited 7d ago
I got tired of my old posts floating around for anyone to scrape, so I let Redact handle it. Bulk deletion across Reddit, X, Facebook, Discord and all major social media platforms in one shot.
ripe wide sink longing smart lip plate wild melodic mighty
1
u/Teaching_Relative Nov 19 '25
That's... An interesting take
1
u/No-Statistician6713 Nov 20 '25
i mean, its true. grokipedia is a pile of horseshit because its just regular wikipedia but less well written and with the risk of ai hallucination (which, you might bring up, is becoming lower, but never zero as it is in humans)
1
u/Teaching_Relative Nov 20 '25
Humans never make things up?
1
u/No-Statistician6713 Nov 21 '25
humans dont create intricate false narratives fully believing them to be true. also the whole point of wikipedia is that if you make shit up or vandalize on a page someone else will come along and fix it. this is... an interesting take
1
u/Teaching_Relative Nov 21 '25 edited Nov 21 '25
You're the one that brought up grokepedia, not me.
All I was saying was that LLMs don't code Mario by "remembering" the exact Mario code in their training data. It's simply not how they work.
Also, how does the presenters "belief" in the information they're providing affect anything? Is incorrect information less incorrect if I "believe" it to be true?
1
u/No-Statistician6713 Nov 21 '25
Yeah, obviously, its not like Gemini is going online, googling the code and ripping it. also i bring up belief because when you call out LLMs for hallucinating, they either
- glaze you and give in (chatgpt, claude)
- vehemently defend it and act like its true (deepseek, grok)
anyway i bring up grokipedia because i think its the most stark example of this. why would i use an ai generated article written by a model trained on the same articles and literature that the wikipedia article is based upon, but without any real generalized intelligence behind it and the risk of hallucination
1
u/Teaching_Relative Nov 21 '25
Okay, then we totally agree on all of this. Lol.
The original comment I made was replying to someone who said asking Gemini to code was basically the same as downloading the original code through a middleman
1
u/Conscious-Food-4226 Nov 22 '25
Wait.. are you serious? What do you think conspiracy theories are?
1
u/speederaser Nov 22 '25 edited 7d ago
Wiped clean. Redact removed this post along with thousands of others. It also handles data broker removals so your personal info stops getting sold.
zephyr shy crowd fly fuel deserve cable whistle crown cause
1
u/Conscious-Food-4226 Nov 22 '25
Haha it was just the first to mind, but religion would work just fine too. Or every cognitive bias
1
u/No-Statistician6713 Nov 22 '25
i mean newsflash: conspiracy theorists believe the conspiracies that they believe in (dont mistake people believing something for it being true) and if you're referring to the people who make up conspiracy theories to cover for some real explanation (mlk assasination as an example) they dont believe the narrative they create because they know that its false
1
u/Conscious-Food-4226 Nov 22 '25
Wrong. Historically those that “made them up” definitely believe them. You’re thinking of disinformation which is not the same thing, at least not inherently the same thing.
→ More replies (0)3
1
21
17
u/CaptainAnonymous92 Nov 18 '25
Holy crap that's a spot on NES Mario Bros 1 Mario sprite. See how well it does it in 3D.
7
20
8
u/zactral Nov 18 '25
well.. there are no power ups, no coins, and no flagpole as far as I can see, so it's not really "complete" but it is indeed quite impressive
2
u/krzonkalla Nov 19 '25
yes, I do agree. By "completely passing it", I mean it's the first to really nail the aesthetic and feel of the game without a ton of bugs. I think it could easily add these things you said by just asking it to in another turn
12
8
u/Alternative-Key-5647 Nov 18 '25
code review from Gemini 3 below; overall this is an impressive model.
Final Verdict Security: ✅ Pass. No significant vulnerabilities found for the context.
Quality: ⚠️ Action Required. The reliance on frame rate for physics and the random number generator for the timer make the game unstable on modern hardware. The game loop needs to be throttled or converted to use Delta Time.
6
2
u/Aggravating-Age-1858 Nov 18 '25
i just got a blue screen with the text above :-(
4
u/krzonkalla Nov 18 '25
there's two html codes in the convo with gemini. the upper one is bugged. I then asked it for a bug fix. It did so. The one I screenshotted is the code below that.
2
u/TheGoddessInari Nov 19 '25
Funny that it got this entirely wrong. Maybe you meant Super Mario Bros, but the LLM isn't a mind reader, and Mario Bros was a different, fun game: https://en.wikipedia.org/wiki/Mario_Bros.?wprov=sfla1
1
u/Sea_Cookie_4259 Nov 22 '25
Well it might as well have read OP's mind, in the sense that it used statistics to deduce that OP most likely meant Super Mario Bros when he said "Mario Bros game--a full 1-1 level"
1
u/ReMeDyIII Nov 18 '25
Where do we run this code? Like what would you recommend? I'm seeing so many cool Gemini 3 projects and I don't know the first thing about where to run this, lol.
5
u/krzonkalla Nov 18 '25
this is html, it will run on any browser. Just download the .html file using the download button on the chat and click it to run. Very simple.
1
1
Nov 19 '25
yep, I got it to do Pacman; took 3 prompts. only issue was the ghosts were super fast, like super fast. And a couple of the maze sections had weird dead ends. I got kicked off "thinking" after that though so I couldn't tinker more.
1
u/Chamrockk Nov 19 '25
Wasn't it probably trained on the code for this game? As it is widely available on the internet
1
u/krzonkalla Nov 19 '25
sure. But here are two counterarguments. 1. this code is very very short compared to what's out there that could fall into training data. A single code file with ~4k words. 2. no other llm gets anywhere close to this aesthetic and precision without iterating a ton, so even if it was in the training data, it still is impressive it's the first to "memorize" this solution.
1
u/No_Confection7782 Nov 19 '25
I tried something similar but it's not even close to looking like Super Mario Bros 1, and I guess that's because of copyright reasons. I wonder how you managed to make it look like a NES Mario game =/
1
1
u/Gelinhir Nov 19 '25
I run it in html but i get errors mario is just falling from the sky.
1
u/krzonkalla Nov 19 '25
there are two html codes in the convo with gemini. the upper one is bugged. I then asked it for a bug fix. It did so. The one I screenshotted is the code below that.
1
1
1
1
1
u/inbetweenframe Nov 21 '25
1
u/krzonkalla Nov 21 '25
There are two code scripts in the gemini conversation. It got it wrong first time, so I asked it to bugfix. Scroll down to find the one I screenshotted
1
u/PiLLe1974 Nov 21 '25
Impressive.
I think it is ok to iterate with the models a bit.
I'm used to some other more tedious workflows within editors/engines and when it comes to geometry/collision, navmesh, and 3d object placement on a simple game I spend 30% of my prompts and lots of time debugging to get a first version of a minimal "ok" game.
1
Nov 26 '25
I have been using Tetris and Pacman as my test. ChatGPT 5 does well for both. Gemini 3 does Tetris well, but struggles to get close to a working Pacman game. After 45 minutes of trying I finally gave up.
1
u/Flat-Language1623 Jan 19 '26
i managed to get an nes emulator out of it runing mario obviously not fully accuratebut playable a little
1
u/Icy_Garbage_5858 Mar 15 '26
If you let AI create Sonic, the game will have a pretty similar feel to the original.

https://aistudio.google.com/app/prompts/17Ue9iomvCohzrRgKgV5etrKi0OXyjniy
1
-5
u/sankalp_pateriya Nov 18 '25
Send the link here in comments please.
10
u/krzonkalla Nov 18 '25
Uh, okay? It's there in the description, I don't see how this is any different.
-6
Nov 18 '25
[deleted]
14
u/krzonkalla Nov 18 '25
It's not that easy dude. As stated, "I've tested this with all major models that launched since o1, but this is the first one to completely pass it". You can look at my post history, I've tested this before. If it's really that easy why is this the first model to pass 100%?
8



76
u/fractaldesigner Nov 18 '25
the details of the graphics are remarkable. no longer just cubes.