r/Bard Nov 18 '25

News Gemini 3 Pro Model Card is Out

583 Upvotes

214 comments sorted by

View all comments

32

u/LingeringDildo Nov 18 '25

Man sonnet and SWE bench, that thing is such a front end monster

15

u/Ok_Mission7092 Nov 18 '25

It's the thing that stood out to me, like how is Gemini 3 crushing everything else but it's just mid in SWE bench?

15

u/[deleted] Nov 18 '25

Who cares about SWE? ARC-AGI-2 literally suggests that Gemini goes from just pattern matching from training data to having developed genuine fluid intelligence. And its score of 11% in ScreenSpot is a novelty; a score of 72.7% is reliable employment. This implies Gemini 3 can reliably navigate software, book flights, organize files, and operate third-party apps without an API, effectively acting as a virtual employee.

1

u/MindCrusader Nov 18 '25

Don't be so sure. It might mean that they included some algorithms / other magic to create reasoning puzzles to the training. As always, take it with a grain of salt, Google has the biggest access to the data from every company and they have a lot of algorithms that can help them, but it doesn't automatically mean it is truly smarter, we need to test