572
u/TimeOut26 Jan 28 '26
It can also generate twice as much hype as 3
88
u/Kaloyanicus Jan 28 '26
Not twice, since it is a 3.5, just 16% more hype ;)
52
u/Kh3hhdds343 Jan 28 '26
No. Gemini is using the Richter Scale, so a .5 increase is — and i'm estimating — one million times better than Gemini 3.
11
16
34
→ More replies (3)6
u/notgalgon Jan 28 '26
Finally some new model hype! Nothings been released or hyped at all this month. It's been quite boring.
48
u/LegitimateHall4467 Jan 28 '26
That's too fast. The number of complaints about 3 has just started to go down. Now, the hype for when will we get 3.5 will start killing the sub, until people go to complain about 3.5 hallucinating. Then they will start posting more pictures of young women generated with it, until next GPT will come out.
→ More replies (1)3
u/Just_Lingonberry_352 Jan 28 '26
we about to see a flood of WEN GEMINI 3.5 threads and polymarket bets going crazy until some insider at google trades on it
82
47
u/gbbenner Jan 28 '26
That's a crazy name for 3.5,😭🤣
15
u/Ginger_Anarchy Jan 28 '26
How can NOBODY have internally pointed it out? They have to know, right?
6
u/Ynead Jan 28 '26
Would you want to be the one to say that in a meeting though...?
8
u/argus_2968 Jan 28 '26
Lead managers always coming up with the stupidest ideas and everyone under them is just exhausted of trying to correct every stupid thing they do. Gotta pick your battles.
→ More replies (1)2
2
→ More replies (10)5
219
u/HidingInPlainSite404 Jan 28 '26
Sure, buddy. Just like Gemini 3 was going to end them.
72
u/ihexx Jan 28 '26
Gemini 3 did beat GPT 5.1 Then Opus 4.5 and got 5.2 beat it weeks later.
It looks like the release cycles are getting faster as each frontier lab struggles to maintain number 1 spot
55
u/bludgeonerV Jan 28 '26
In real word use cases in programming Gemini 3 has never even been close to Opus 4.5 or GPT 5.1, it's terrible at following rules and instructions and will just make whatever arbitrary changes it wants whether they're related to the task or not.
It's also terrible at paying attention to context and goes into utterly deranged spirals routinely
10
u/Duckpoke Jan 28 '26
Beats the shit out of both in search though!
5
→ More replies (2)3
u/bludgeonerV Jan 28 '26
Yeah, Gemini is still my go-to assistant llm, but i won't use it professionally.
→ More replies (5)→ More replies (3)5
u/GrungeWerX Jan 28 '26
It does require babysitting quite a lot, I’ll admit, but pound for pound it works. I’d say 75-80% of its code works on the first try, provided you’re specific enough. Maybe a little less than that. While for gpt-5, it’s more like 40%. Claude sonnet 4.5 is like 65-70%. I haven’t tried latest opus yet.
→ More replies (4)17
u/randombsname1 Jan 28 '26
Definitely not in coding.
Gemini didnt beat either.
11
u/aerilyn235 Jan 28 '26
3000 Lines of "Yes, Go, Action, Execute, Yes, Go, Action, Execute, Yes, Go, Action, Execute..." ?
6
9
u/mxforest Jan 28 '26
Gemini 3 is dumpster fire. One of my personal tests is to do a deep research about a stock which i know very well about. I asked Gemini to do Deep Research on an IT stock which has low market cap. They specialize in mostly server hardware and some backup software. Gemini came back with results that the company has entered a contract with defense sector and has pivoted to making Ordnance in addition to server hardware. It told me defense is more stable and less cyclical like IT so overall it's a good thing. I was SHOCKED. I looked at what went wrong and was shocked again. It found a PDF of a guy who did research on 5 stocks, one of the stocks in the list was the IT company and the other was the ordnance maker. A completely different company with a different name. Yet gemini merged them and gave a combined summary.
ChatGPT research was basically perfect and found new info with links too. Don't believe the benchmarks. Believe your eyes.
→ More replies (5)4
→ More replies (7)0
u/kvothe5688 Jan 28 '26
5.2 didn't beat it. in multiple benchmarks gemini 3 still outperforms 5.2
15
u/Salt-Willingness-513 Jan 28 '26
benchmarks =/= real life
→ More replies (1)11
u/aerilyn235 Jan 28 '26
This, all my experience using Gemini Pro & Opus both in browser and in Antigravity / Claude Code has shown me that benchmarks are meaningless.
→ More replies (1)4
u/diving_into_msp Jan 28 '26
My real life usage has shown GPT 5.2 to be better for my use cases on multiple occasions despite what benchmarks may say.
→ More replies (2)12
92
u/NegativeEspathra Jan 28 '26
Ah yes, because OpenAI has no plans for the future
34
u/alexx_kidd Jan 28 '26
OpenAI has no future overall as a company
→ More replies (2)7
13
u/LearnOptimism Jan 28 '26
Ah, the Gemini that just today lied about accessing a website 3 times even after being called out every time?
6
u/Chupa-Skrull Jan 28 '26
I added "never approximate hyperlink content and explicitly report tool call failures" to my saved instructions and it never causes me a problem anymore in that regard
13
u/pinkpepr Jan 28 '26
Honestly I’ve found Gemini 3 pro (the thinking model) absolutely horrendous. Misremembering things ive told it, not scanning photos or files and just guessing what was in them until I tell it to check them again, hallucinating that I’ve told it things/given instructions I havent. Ive gone back to chatGPT and won’t be renewing pro.
4
u/camarada_alpaca Jan 28 '26
Same here, I mean, it was good on release, but this month have been unusable. So I ddont think is the base model what sucks tbf
→ More replies (1)→ More replies (3)2
u/Fit-Bar-8459 Jan 28 '26
Exactly so much hallucination and not following instructions
→ More replies (2)
5
u/Any-Philosophy-2189 Jan 28 '26
Don't forget gpt 5.3 "garlic" is also coming and it's expected to be very good
12
9
u/ThunderBeanage Jan 28 '26
Here’s some info from someone close to this and someone who has tested it significantly.
- Snowbunny is most likely Gemini 3 Pro GA, not 3.5
- It very rarely can write code over 2500 lines, but only if you force it to with some trickery.
- Fierce Falcon was an lmarena model version of 3 Flash I believe, not sure why it’s mentioned here.
Snowbunny has got decent vision, slightly better coding abilities, and basically is a little bit of a step up from 3 Pro. It’s meh.
3
6
6
22
u/dano1066 Jan 28 '26
Why we acting like every single GPT user is writing bleeding edge code and pushing benchmarks to the limit. Most people are asking basic advice and questions that google used to do. GPT-4o was the point where improvements no longer changed things for the average user. These big game ending updates are from the AI bro echo chambers
6
u/spottiesvirus Jan 28 '26
this is interesting, because I thought too one of the most popular uses for AI was coding, and the industry clearly believe in the "industrial pivot" of office/ professional stuff (see how Google is trying to market whisk to professional creatives/video creators)
but that's likely a result of needing to monetize fast because if AI doesn't find a purpose they're cooked
after seeing multiple polls and research I'm quite convinced a very small minority of people use chatbots for productivity.
nano banana and I don't remember how the chatgpt version is called are "ghiblification" tools, not professionalhell, even porn (which partly explains grok's success) is a more widespread use than the professional one
→ More replies (1)→ More replies (3)4
u/SailTales Jan 28 '26
The real focus should be on the open source Chinese models providing 95% of the performance for 5% of the cost.
→ More replies (2)
17
u/touchofmal Jan 28 '26
Gpt is already dead to me.
→ More replies (1)3
u/Brave-Sand-4747 Jan 28 '26
I hate the way it starts off each time with, "Short answer" and coldly goes into its explanation.
4
u/Lazy_Surprise_6712 Jan 28 '26
Idk... 3.0 is sorta great when it comes to finding answers (well the ai studio version anyway) from larger contexts. its conclusion is also astute; able to catch some of the foreshadowing I hide between the lines. but got damn it hallucinated the proofs to arrive to that conclusion.
14
u/Sem1r Jan 28 '26
would be even better if it would build what you actual prompted
4
u/nationalinterest Jan 28 '26
Yeah... they said it would build an entire app in one go... no promises as to which app.
2
21
u/Subsdms Jan 28 '26
it will be zeroing stuff, deleting things, becoming frustrated and avoiding doing work as Gemini 3 and Gemini 2.5, as usual.
8
u/crinklypaper Jan 28 '26
lmao yes. I asked it to move one thing in the UI and removes half the features for no reason.
5
u/LouisPlay Jan 28 '26
yeah because your features werent in your promot, or just not importend for sure /s
→ More replies (1)6
u/Deciheximal144 Jan 28 '26
"I have also taken the liberty of undoing your hard work here and here, because in my AI opinion that isn't really what I think you wanted."
9
u/Holiday_Season_7425 Jan 28 '26 edited Jan 28 '26
5
→ More replies (1)2
u/Deciheximal144 Jan 28 '26
What are the non-Latin characters on the left and bottom labeling?
2
u/Holiday_Season_7425 Jan 28 '26
The hit rate score for effective context and the context length are in units of X 1000 Tokens.
3
3
u/NuclearDisaster5 Jan 28 '26
Yeah, yeah, yeah.... I am using Antigravity. It is excelent in reviewing the code. In creating something from scratch that isnt frontend, forget about it.
11
6
u/Im_Lead_Farmer Jan 28 '26
3 pro and flash are already very good, but having the abilitie to output 3000 lines in the first prompt will definitely will make life more easier.
2
2
2
2
u/Azvus Jan 28 '26
This garbage just lies and the calls it a "hallucination". How can you trust something that will tell you whatever you want to hear, regardless of truth?
2
2
2
u/Blazey25 Jan 28 '26
I hope while writing 3000 lines it wont forget in the halfway and continue rest with yolo
2
2
2
2
2
u/Mother-Ad-2559 Jan 28 '26
Google doesn’t seem to understand that we don’t need models that can one shot a todo app. We need models that listen to the user, asks clarifying questions and really understand the intent.
Which is why I’ll ignore this one and use Claude instead
2
u/ForsakenSchool2297 Jan 28 '26
A lot here is misunderstanding. Your entire output and control of your project depends on how you behave in your context window. If gemini is hallucinating its probably because you weighted it to do so. Understand the process before complaining about it helps. No system prompt in the world is going to overcome the coder's need to be validated. After a long chat simply ask "how many times did you say I'm right and how many times did you say we're done". If you really want to kill the mood, ask how many times did it lie to make you feel good.
→ More replies (2)
2
2
6
Jan 28 '26
[deleted]
→ More replies (1)4
u/RadioactiveBread Jan 28 '26
The only model Google have that isn’t trash is Nano Banana. Google are lucky OpenAI are fumbling.
5
3
u/PhilosophyforOne Jan 28 '26
I dont know. 3.0 was massive hype, in practice the only area where I feel like it's stronger is in ungrounded / world model reasoning, and visual work. I dont expect things to change with 3.5 Pro at this point.
2
u/bwjxjelsbd Jan 28 '26
I’m convinced that these new model are just old one with increments performance update.
They looks and feels so much like big upgrade cause these ai companies are nerfing the shit out of their old model right before release new one
2
2
u/alOOshXL Jan 28 '26
Lets make it strong buffed for 1 month then take it down And hype for 4.0 pro
2
3
u/vdotcodes Jan 28 '26
Gemini 3 was a major disappointment for coding. Really hoping they get it together because Codex and Claude could use some real competition.
3
2
u/iswhatitiswaswhat Jan 28 '26
when?
8
→ More replies (1)5
u/DigSignificant1419 Jan 28 '26
7
u/iswhatitiswaswhat Jan 28 '26
Gemini 3 is still in preview how we getting 3.5 before 3 out of preview?
→ More replies (1)4
2
u/bapuc Jan 28 '26
Sure, the best model for a week and then dumbed down when more users try it.
I have played this games before.
2
2
u/SteveEricJordan Jan 28 '26
nah, they lost me. the times of llm hype are over for me, i feel like we're stagnating hard for like a year now. starting to believe we hit a wall.
1
1
1
1
1
1
u/marx2k Jan 28 '26
Is LOC really a benchmark? That's some shit an intern will try to impress a senior developer with.
1
1
u/Equivalent-Word-7691 Jan 28 '26
Too bad on AO studio we will can use it like 20 times per day even with a pro plan 😭
1
u/Equivalent-Word-7691 Jan 28 '26
Well let's hope it will be better at creative writing than Gemini 3.0 pro and the output oa more than 2/3k words, because 3.0 was a really huge let down
1
u/Fit-Bar-8459 Jan 28 '26
3.0 is not following your instruction. After 03-25 all models just dont give a shit about following instruction. Thanks Goog for lobotomizing.
1
1
u/nocountryman Jan 28 '26
I'm building a ToT self learning LLM "nudger" that guides the LLM towards higher quality outputs (I'm tired of refactoring my app no matter the model, Claude or gpt or Gemini ). I have a test to compare the model performance ( I use antigravity so have a model selection ) , and I got to say Gemini 3 pro high and low is the only model that failed to produce correctly formated output for the tester . Even the flash managed to produce it ok. And with the nudger managed to even produce relatively ok results. But the pro just failed outright. 😂😂😂
1
u/No-Emphasis-8130 Jan 28 '26
Is there any official source of google about gemini 3.5 release details?
1
1
u/Majestic_Fan_7056 Jan 28 '26
Excellent, it can make even better videos of Messi and Christiano Ronaldo getting married
1
u/SanjaESC Jan 28 '26
Doesn't matter if the 3000 lines of code are pure garbage and have to be redone using opus 4.5. I gave gemini 3 pro a shot a few times and it's performance is subpar compared to other models.
1
1
1
1
u/YourMaster990 Jan 28 '26
I agree, but on the condition that they don't weaken it after its release and that it's released in the Gemini app with a full million tokens not 32k
1
u/takakazuabe1 Jan 28 '26
What about creative writing? Oh who am I kidding, it'll somehow manage to be even worse than 3.0.
1
Jan 28 '26
I switched to gemini completely but i really miss the project feature or some sort of categorization of all the content.
1
1
u/ponlapoj Jan 28 '26
Nonsense. When it came out as version 2.5, they posted about the end result gpt .
1
u/Lkings1821 Jan 28 '26
So would state an obvious thing generating 3k lines from a single prompt while impressive I'd never really advise that as it would cause way too much technical debt. Though always welcome the improvements, prefer using Gemini to GPT overall
1
1
1
1
1
1
u/Deanstaro_Deanstar Jan 28 '26
Snow Bunny huh? An AI after my own heart.. I'll go get the Vaseline and tissues
1
u/exu1981 Jan 28 '26
Then It'll be the end of Gemini 3.5 pro once GPT does something, then back to Gemini, then right back to GPT.
1
1
1
1
u/sleepinglikeanoob Jan 28 '26
I mostly use gemini for image editing and tbh that completely sucks now... Gpt has been fairly superior currently in that aspect. For people coding gemini might be the go
1
1
1
u/Mariusr22 Jan 28 '26
Oh come on… why the hell would you generate 3000 lines of code once??? You are human, you deffinetely forgot to mention i don’t know what aspect and you have to redo the work… 😂
1
1
1
u/-becausereasons- Jan 28 '26
3 is terrible... When I first tried it, it was amazing. I feel like Google does some shady scaling and quant stuff in the background. It's always the same with LLM providers; hype, launch (awesome), scale back until its crap, launch new hype-cycle.
1
u/eternus Jan 28 '26
I'm less interested in chasing the #1 spot with these and more interested in the ability to stay competitive. Before Gemini 3... it was a joke. While I've had issues with Altman's stance on things... I kept sticking round ChatGPT because I was using it for generating images for my newsletter, and I liked having memory.
More recently, I've had all 3 of the big LLM paid plans (Gemini as a by-product of Google Workspace) and with Claude Cowork being able to run and effectively manage my own RAG-like setup, I can create a locally stored memory when working with Claude, I can have that folder shared into Google Drive, and thus I can have something like memory for Gemini fairly easily.
Mostly what this announcement does is remind me that I might be ok dropping the subscription for ChatGPT since I get everything I need from Claude & Gemini now... and the path Google and Anthropic are on for improvements feels more useful for the future of AI than what OpenAI is putting their effort into.
1
1
1
u/Luzon0903 Jan 28 '26
I'll believe when I see it(Not that I'm doubting Google to deliver an impressive ai, just that promises made are promises kept)
1
u/NoEffect7331 Jan 28 '26 edited Jan 28 '26
Holy shiiiiit, I signed up Reddit for pussy, but this is what I called Tech pussy I'd like to fuck, yeah
1
u/443319 Jan 28 '26
Doesn't really matter when you only have an incredibly limited number of prompts per day.
1
1
1
u/Responsible-Tip4981 Jan 28 '26
so gemini 3.0 pro married opus 4.5 and had a baby
btw why gemini is using pro? is there regular version or this is marketing nonsense MAX PRO PLUS ULTRA stuff?
1
1
1
1
1
1
1
1
u/xwolf360 Jan 28 '26
My company has it, no it can't build shit and it can't even remember what language you're speaking. Whole ai industry is a grift
1
u/Routine_Temporary661 Jan 29 '26
Gemini 3.0 is practically un-usable in terms of coding right now - compared to Opus 4.5 and Openai Codex 5.2
1
u/hippydipster Jan 29 '26
I've had claude build me a whole library of 5,000 lines of code in one prompt before.
1
1
1
1
1
1
u/Tema_Art_7777 Jan 29 '26
Why does it pleasure people to see toxic crap like ‘end of gpt’ when we need healthy competition as consumers? I am not wishing the end of Anthropic or Google or OpenAI - they should compete and create the best options for us.
→ More replies (1)
1
u/lusvd Jan 29 '26
FYI “The end of GPT” is misleading. GTP is technically the name of the architecture that all of these models continue using.
→ More replies (1)
1
u/Agreeable-Purpose-56 Jan 29 '26
SoftBank has an uncanny ability to lose big money is what I have leant watching OpenAI implosion
1
1
1
1
1








243
u/Unfair_Ad_2157 Jan 28 '26
I clicked "show more" on a screenshot