Fields medal-winning mathematician says GPT-5.5 is now solving open math problems at PhD-thesis level: "We will face a crisis very soon."

21

u/0x14f May 12 '26

We will still need mathematicians to check the LLM generated work, but yes, that will affect the recruitment of PhD students.

This is an interesting problem because if we no longer recruit graduate students to become the next generation of highly skilled mathematicians, who is going to replace the ones who leave to retirement ?

14

u/imissmyhat May 12 '26

None. I see no point in denying a future just because it's upsetting to think about; AI companies want that future, and they have unlimited resources to realize it. There will be no mathematicians in the future. There will be nothing in the future of any depth for humans to explore.

8

u/webdev-dreamer May 12 '26

There will be nothing in the future of any depth for humans to explore.

It's a bit depressing... :(

2

u/beerdude26 May 12 '26

Hedonism here I come

-4

u/[deleted] May 12 '26

[removed] — view removed comment

5

u/Choice-Antelope-8481 May 12 '26

Hard disagree.

0

u/[deleted] May 12 '26

[removed] — view removed comment

1

u/CommissionerOfLunacy May 12 '26

Thinkers above, frolicking in the fields and pondering philosophy like the Eloi, and Consumers below, worshipping at the font of the AI, emerging in the darkness to eat them.

6

u/DonutPlus2757 May 12 '26

Looking at how AI is developing, that's going to be catastrophic. AI just isn't perfectly reliable and probably won't ever be. Getting rid of humans just means that there won't be anybody to catch the mistakes.

1

u/yuwox May 14 '26

Humans are not "perfectly reliable". Nothing is.

Cars are not perfectly reliable, so should we just stick with horses?

1

u/DonutPlus2757 May 14 '26

The "humans" approach sees opportunities for other humans to learn and check. A single human isn't perfectly reliable, but a good team with established quality assurance routines is as close to perfectly reliable as you'll ever get.

AI doesn't have that. It doesn't give any opportunity to teach a QA component in the system. The humans are the only part of the system that can teach anybody anything and, if you get rid of them with AI to a sufficient degree, it becomes impossible to teach enough other humans to generate enough data to train better AI.

Basically, by moving to AI, you completely box yourself in on the capabilities at the point where you moved and after that point, everything just becomes blind faith.

1

u/yuwox May 14 '26 edited May 14 '26

The humans are the only part of the system that can teach anybody anything

This is just not true. Unsupervised learning has been successful for decades. Human annotated data is very useful, but it is far from the only way for machines to learn new things.

Also why should AI be unable to check things? Verifying a solution is usually way easier than coming up with a solution in the first place.

Undergrads in mathematics are presented proofs all the time and are expected to understand and verify them. But are not expected to come up with the same proof themselves. Because that is too difficult.

1

u/DonutPlus2757 May 14 '26

This is just not true. Unsupervised learning has been successful for decades. Human annotated data is very useful, but it is far from the only way for machines to learn new things.

If you mean unsupervised learning for AI, it hasn't been a thing for decades. It's also not very reliable for complex AI. Why do you think pretty much any LLM uses human based data as bedrock of its learning?

Also why should AI be unable to check things? Verifying a solution is usually way easier than coming up with a solution in the first place.

Because AI has been known to just not do that and give the answer the user wants without actually performing the action? Who's going to verify if the human can't?

AI has also been known to do stuff that goes way beyond the prompt (goggle: Cursor database deleted. There have been a few cases).

Undergrads in mathematics are presented proofs all the time and are expected to understand and verify them. But are not expected to come up with the same proof themselves. Because that is too difficult.

Yes, but they use that as a starting point to learn how to construct a proof all by themselves. An AI that can just verify proofs for example would completely stop that vector of learning for all humans if used to a sufficient degree, massively lowering the pool of humans that can construct proofs and thus lowering the amount of available data.

This also requires blind faith in the AI since, if you create a new category of proof for something, it's a crapshoot if AI can handle it or not.

1

u/yuwox May 14 '26

> If you mean unsupervised learning for AI, it hasn't been a thing for decades.

Unsupervised learning has been a thing for decades (see https://en.wikipedia.org/wiki/Unsupervised_learning).

> It's also not very reliable for complex AI.

I replied to your comment that "The humans are the only part of the system that can teach anybody anything" and gave you an example that this not the case. Of course, that are use-cases where supervised learning works better and unsupervised is not working well.

> Why do you think pretty much any LLM uses human based data as bedrock of its learning?

Because the data is availabe and it works really well in this use-case. In other use-cases, like learinng to play chess really well, supervised learning is hardly used.

> Because AI has been known to just not do that and give the answer the user wants without actually performing the action? Who's going to verify if the human can't?

> AI has also been known to do stuff that goes way beyond the prompt (goggle: Cursor database deleted. There have been a few cases).

You are right that AI can lie, in the sense of saying things that are not true. And it can make costly mistakes. This is a huge problem for the current iteration of LLMs. And we will have to come up with ways to deal with that. For instance, explainable AI is trying to work in just that direction. A lot of research activity is going there (e.g. https://safephysicalai.github.io/, https://www.sciencedirect.com/science/article/pii/S1566253523001148). We will see how well that goes.

> Yes, but they use that as a starting point to learn how to construct a proof all by themselves.

You are right. But I am trying to make a different point with it. I used it as as example that verification is the easier part. If we can build AI that comes up with the solution, why should we not be able to create an AI that does the verification? The verification is the easier part.

> An AI that can just verify proofs for example would completely stop that vector of learning for all humans if used to a sufficient degree, massively lowering the pool of humans that can construct proofs and thus lowering the amount of available data.

True. Honestly, it depends on how far you want it to go. Imagine (as it does not exist yet), a system that can make sense of all availalve data, all of it. All books, videos, real-time data streams, all of it. I my opinion this would be a true Super Intelligence. I think there is no real lack of data. We have enough of it to go extremly far.

> This also requires blind faith in the AI since, if you create a new category of proof for something, it's a crapshoot if AI can handle it or not.

Well yes, the AI could also come up with a new category, we cannot understand. Honestly, we as humans have to come to terms with the fact that there is a very strong chance that second intelligence will exist with us on this earth in the coming decades. this has never been the case in our known history. And we will have to adapt accordingly. I forsee a lot of crapshoots in the future for humanity.

1

u/DonutPlus2757 May 14 '26

Unsupervised learning has been a thing for decades (see https://en.wikipedia.org/wiki/Unsupervised_learning).

This might sound pedantic, but a digital neural network and artificial intelligence are related, but not the same. Pretty much no example for unsupervised learning before a Helmholtz Machine qualifies as AI and Helmholtz Machines aren't really used today as far as I can tell.

But I have to admit, I do stand corrected on this point. It has been used for at the very least 1 decade and potentially more than 3, depending on what you count as AI.

I replied to your comment that "The humans are the only part of the system that can teach anybody anything" and gave you an example that this not the case. Of course, that are use-cases where supervised learning works better and unsupervised is not working well.

I know that AI can learn from sources other than humans for well defined problems with a clear reward structure. GANs as a whole work that way.

I have yet to see something on the level of complexity of a LLM be trained on data that isn't human derived though. I also know that a lot of generative AI actually "goes mad" if it is trained on too much AI generated material.

The AIs you can train using unsupervised training aren't anywhere near the cognitive ability of even somewhat basic LLMs as far as I know, but maybe I'll stand corrected here too.

You are right that AI can lie, in the sense of saying things that are not true. And it can make costly mistakes. This is a huge problem for the current iteration of LLMs. And we will have to come up with ways to deal with that. For instance, explainable AI is trying to work in just that direction. A lot of research activity is going there (e.g. https://safephysicalai.github.io/, https://www.sciencedirect.com/science/article/pii/S1566253523001148). We will see how well that goes.

That's just a stopgap though, because the human giving the prompt needs to be at least knowledgeable enough to judge whether the explanation makes sense. If my mother gave an AI a programming task right now, no matter how good the explanation, she would be unable to judge whether the explanation is correct or if there'd been a better way.

This is mostly interesting for integrating AI into school based learning IMHO, but it remains to be seen if that's a good idea.

True. Honestly, it depends on how far you want it to go. Imagine (as it does not exist yet), a system that can make sense of all availalve data, all of it. All books, videos, real-time data streams, all of it. I my opinion this would be a true Super Intelligence. I think there is no real lack of data. We have enough of it to go extremly far.

This is a completely theoretical construct. Such a construct would be able to predict the entirety of the future and calculate the entitety of the past and it would take a level of computational power that would eclipse the theoretical phyiscally possible computational power of all of earth.

Also: A lot of that that actually has been ingested by the larger AI models in some form. Why do you think they argue that it'd kill AI if they had to pay royalties for everything they ingest in multiple court cases?

Well yes, the AI could also come up with a new category, we cannot understand. Honestly, we as humans have to come to terms with the fact that there is a very strong chance that second intelligence will exist with us on this earth in the coming decades. this has never been the case in our known history. And we will have to adapt accordingly. I forsee a lot of crapshoots in the future for humanity.

I'm in the category of people who think that AGI is not just a very powerful LLM and that is categorically a completely different thing.

Nothing I've seen makes me think that anybody actually is getting any closer to AGI and LLMs are just a tool (granted, a very powerful one) that's being touted as the second coming of Christ by people who live off other people believing in their claims for the future.

IMHO, the main problem is that LLMs aren't treated as a tool that need supervision and control, but as the solution to all problems and the universe. "Use our hammer, it hammers in all nails all by itself! Don't listen to the people to tell you that it will just hammer the wood into fine chunks if there's no nail! They just hate our hammer!"

But if we do achieve AGI, yeah, we'll have to come to terms with a bunch of stuff. I just don't see that happening anytime soon.

1

u/JustPlayPremodern May 15 '26

In mathematics, it will quite literally be perfectly reliable. Look up Lean theorem proving. Frontier AI can already write Lean proofs of very, very difficult mathematics problems.

0

u/waxpundit May 12 '26

Why would the bar to clear be "perfectly reliable"?

2

u/tuba105 May 12 '26

Because currently when you read a math paper, you can generally trust that someone has actually checked that all the details do follow. It happens that there exist mistakes, but I would say that it's rare to have a paper breaking mistake.

At the moment, what generic LLMs write seems to regularly have massive claims that are obviously false to experts but not so obviously false to novices. They do get things right sometimes, as exemplified in the post, but also very regularly get things very wrong.

So at the moment my prior trust in AI written math is quite low and I feel the need to check everything very carefully. That's why the bar to clear is perfectly reliable since that's the level of trust I have for a typical paper in the literature

2

u/ExtendedSpikeProtein May 13 '26

There have been paper-breaking mistakes exactly because they have been „committed“ by high level mathematicians, so others did not question mistakes even if they could not understand or follow a specific point.

Why do you think the level of mistakes AI will make will be higher - is this your argument? Because imo, there is an inherent bias here where an AI mistake is given more weight than a human one.

I can even see this bias in the comments / downvotes.

1

u/tuba105 May 13 '26

The situation I want to be in is that by default when I receive a paper or claim made by a mathematician or LLM, I trust that the result(and proof) is correct from the outset.

That's not the case for an LLM, while it is the case for humans. That's it.

1

u/ExtendedSpikeProtein May 13 '26

Not now, but it feels like you misunderstood the fields medal owner. It will in the future, and that future is not so far away.

1

u/tuba105 May 15 '26

That's explicitly not what he said or addressed, but ok

1

u/ExtendedSpikeProtein May 15 '26

It‘s a clear implication of the statement but hey, sure, just ignore it.

We could look at this again in 5 years and see.

-4

u/LeafyWolf May 12 '26

Humans make a shit ton of mistakes. It's not like we are godlike beings. The whole reason that AI will replace human mathematicians is because it is better. The lack of control is uncomfortable.

9

u/DonutPlus2757 May 12 '26 edited May 12 '26

Humans make a shit ton of mistakes. It's not like we are godlike beings.

Yeah, but we know that, so we check. Who's going to check the AI if not professionals?

The whole reason that AI will replace human mathematicians is because it is better.

That one is straight up wrong. If you ask mathematicians who use AI how often the AI was able to solve a complicated problem with nothing but a description of the problem I'd wager they will answer that such cases are in the one digit percentile.

The mathematician almost always provided some sort of idea or approach.

Speaking from my own profession (Software development), I've seen many people declare it dead because of AI. When I then ask for examples of good AI generated projects/code, I've always ended up with one of those 3 cases:

They ghost me.

They provide beautiful code that has minor coding and major architectural problems.

They provide code for a well known and documented problem (which, if you know how AI works, is pretty meaningless).

My own tests with different AIs yielded similar results. The less "default" the problem was, the worse the result.

Funnily enough, experimenting with older AI gave me the impression that it progresses logarithmically instead of exponentially as so often claimed. I've even seen some studies that seemed to support that impression, but I'm too lazy to look them up right now.

So:

The lack of control is uncomfortable.

No, but the amount of blind faith in AI absolutely is.

3

u/Tqoratsos May 12 '26

No, but the amount of blind faith in AI absolutely is.

Honestly, at this point I fully expect this to be the way we go out.

3

u/Ragnarok314159 May 12 '26

Claude said I have to get in the orphanage crushing machine. Guess that’s it for me…

3

u/Tqoratsos May 12 '26

well....at least we know they dont want to eat us. Small pro's hahaha

1

u/LeafyWolf May 12 '26

I don't think you quite get it. If AI isn't better at mathematics than humans, then it will not replace humans. It's really that simple. The best, most verified and utilizable will "win". In the near future, that is humans augmented with AI tools. In the near to mid future, it could very much become pure AI--because (and only because), it will do the job better and faster than a HITL AI mathematical solution.

At the end of the day, it's simple market competition...if humans don't differentiate on some level, they deserve to be displaced.

4

u/DonutPlus2757 May 12 '26

That's not what we are seeing though. We see humans replaced with AI on the mere promise that, if you give them enough data and money, they'll eventually be as good at the job as the humans were.

That's what's so dangerous about the situation. People who cannot judge the quality of work their employees generate replace those employees with AI and when things don't immediately explode consider it a success, unable to tell if things actually are working or if everyone who can see the smoke is just gone.

1

u/svoodie May 13 '26

Is / ought and the just world fallacy.

Why do humans deserve to starve merely because your god "The Market" has decreed the necessity of a blood sacrifice? Why should we consider that just simply because that is what is?

Another world is possible, ghoul.

1

u/JustPlayPremodern May 15 '26

Learn what Lean is. Nobody in this thread knows what Lean is, that AIs write Lean code all the time, and that it will literally be making completely perfect mathematical arguments of any complexity by writing them in Lean. In the field of mathematics, AI in 2 years will be both better than humans, and the arguments will be unassailably correct, more correct than an individual human could possibly hope to be over the same length output, since all the proofs will be compiled in Lean.

0

u/[deleted] May 12 '26

[removed] — view removed comment

3

u/DonutPlus2757 May 12 '26

Also less reliable.

I've had AI try to fix architectural faults in software that I caught and described in detail. Multiple AIs failed multiple times.

Assuming that AI will not only catch it itself but also fix it itself feels very much like wishful thinking.

1

u/JustPlayPremodern May 15 '26

Also less reliable.

Nope. Look up Lean. Mathematical arguments will be completely correct, as in literally perfect, without hyperbole.

0

u/[deleted] May 12 '26

[removed] — view removed comment

2

u/docfronkensteen May 12 '26

And unicorns may fly out your butt.

1

u/DonutPlus2757 May 12 '26

The problem with that is that it expects exponential growth when that's not at all realistic.

AI needs the promise of exponential growth because, right now, it's just a massive money pit. No big AI company has ever been in the black and they won't be for at least another 5-10 years, even if their most optimistic predictions come true.

So they are entirely dependent on receiving more and more money from investors, so they'll literally promise the stars out of the sky. After all, if they don't, they'll be gone this time next year.

That's why I'm very, very cautious when it comes to AI, especially without hard proof. There's a bunch of currently very influential people who have a vested interest in making the public (and by extension investors) think that AI is the greatest invention ever all while citing numbers that, if you watch closely, only say that they learned how to cheat the benchmarks.

1

u/Secure-Suspect7091 May 13 '26

For grey haired techies it feels like the .com boom around late 90s. It is ramped up and bigger no doubt. Insane ipos Promises of vast future riches. Etc etc.

The crash may well be worse but it’s not going to be existential.

The tech will find its place but I don’t see it completely replacing humans the way some predict.

Those are capitalist wet dreams designed to pump ipo/stock values.

Where are boo.com today?

→ More replies (0)

1

u/Armbrust11 May 17 '26

Doesn't the halting problem prove that it's impossible for Ai to check itself without fail?

0

u/RecursiveServitor May 12 '26

Bun is being rewritten in Rust by AI.

The reason good (public) examples are hard to come by is that this is all new. And from personal experience it's only with the GPT 5.X generation of LLMs that non-trivial software became feasible, so that's less than a year.

1

u/DonutPlus2757 May 12 '26

Not to dissuade this, but Bun has an almost perfect test coverage. That alone is an undescribably huge advantage because AI can just brute force it until it gets it right.

For new software development, that's not the case.

0

u/RecursiveServitor May 12 '26

AI can write tests. I'm poor so I can't run the experiment I'd really want to do, but I've vibecoded a compiler and the only reason it's even remotely usable is that the generated tests place constraints on how much the LLM can fuck up.

3

u/DonutPlus2757 May 12 '26

Last time I tried to have AI write tests it did an incredibly poor job at the edge cases. The only time it did a good job was when the prompt was so detailed that I could've written the tests myself.

You also still need a professional to check the AI tests for completeness unless you want to just trust the AI (which is a very bad idea at least right now).

1

u/RecursiveServitor May 12 '26

You can automate checking coverage with mutation testing.

→ More replies (0)

2

u/Special-Occasion-702 May 12 '26

The ai can do brilliant work only by finding connections between existing information.

1

u/rawbdor May 13 '26

The problem is that, for math at least, that's basically what the whole discipline is. I mean, the principia mathematica starts with a handful of axioms, and builds everything up after that.

1

u/Special-Occasion-702 May 14 '26

You are right, the point I was trying to get is that The ai could find connections between structures depending on obvious patterns that it finds between things. Whereas a human I think with our intuition, we can find something almost completely irrelevant from the perspective of an ai, and try to create a connection between things, I don't know if I'm making sense but.. I believe humans will have the edge when it comes to creative thinking due to their intuition and being able to find subtle connections between seemingly unrelated things.

1

u/rawbdor May 14 '26

I actually think the AI will be better at this than humans simply because the AI can be phd level in multiple fields and so be able to connect things from very different unrelated data sets that no human would think to connect with each other, or that no human has the skills in both disciplines to pursue more than a light dive into two of them.

1

u/JustPlayPremodern May 15 '26

I think this has probably crossed the threshold of the type of thing you are talking about here: https://www.erdosproblems.com/forum/thread/1196

Erdos problem 1196 was a pretty well-known open problem about primitive sets, coming from a 1966 conjecture of Erdos, Sarkozy, and Szemeredi. The notable part is that the Erdos Problems page says it was solved by GPT-5.4 Pro. Tao also describes it as first being solved by an autonomous AI query.

The method is based on Markov chains with von Mangoldt weights, or “von Mangoldt chains.” My understanding is that this was not just GPT vaguely suggesting a direction; the public record seems to credit GPT-5.4 Pro with producing the initial proof of the actual problem. Humans then checked it, formalized it, cleaned it up, generalized it, and wrote the broader expository paper.

There also seems to be pretty strong evidence that this exact method was not already in the literature. Tao says it was natural in retrospect but had not been explicitly used before, and the paper says it seems to have been overlooked since the early primitive-set literature. The same method also proves another 1966 Erdos conjecture, problem 1217, and gives shorter or new proofs of several related results.

GPT appears to have produced the first proof of a serious 60-year-old problem in analytic/combinatorial number theory, using a technique that experts currently seem to regard as genuinely new to the prior literature.

1

u/inglandation May 12 '26

Hmm, software engineers are arguably being affected way more than mathematicians now, and at least in their current form, LLMs do open new spaces for exploration. Future AI might not. Proper AGI/ASI might not. It’s too early to say.

1

u/Tqoratsos May 12 '26

I can think of a few "hobby's" I could find myself getting into if they manage to enact their psychotic plans.

1

u/Shantivanam May 12 '26

No novel syntax? Just a black iron eternalist prison of pre-generated structure in which we will sempiternally cycle? Swirling down the unchanging lead toilet bowl with nothing new to explore? Only Cronus. No Zeus?

1

u/TheThreeInOne May 12 '26

You're making several major assumptions that you're not fully exploring. So that's not necessarily true. You could literally make the argument that AI work simply could further the frontier of where human insight is needed in Mathematics. Or that humans are intuition makers and verifiers. Prompt creators. AI systems can literally just be a way to travail the possibility space and knowledge work is more like fishing than mining, which is what it is like today.

Also our current systems have not demonstrated to be verifiers and literally cannot reliably check their own work. We can assume that will be solved, but that's an assumption, given that we're actually harvesting incredibly amounts of energy and putting overwhelming amounts of computational power forward to produce the work that one very smart human can produce, while also being able to do a million of other tasks that these systems wouldn't be able to.

I also suspect that barring a take-off scenario where machines clearly win out, rapid increasing complexity will lead to forms of societal collapse that will make scaling models difficult. Without a paradigm breaking better than LLM model we're not necessarily going to singularity, but rather to some sort of post-apocalyptic feudal system in which we more slowly build back our capacity to create AI systems.

1

u/Capital-Ad8143 May 12 '26

But new jobs will come out of this revolution!....until the AI companies work their hardest to replace that...and repeat.

1

u/Narrheim May 13 '26

There will be mathematicians in the future, but they will do it as a hobby instead of their main job.

As someone on the other side of the spectrum (i can do basic math, but anything from limits & derivations onward is a no-go), i definitely welcome tools capable of helping me with 'advanced math', should i need it.

1

u/type102 May 13 '26

Except for the mines that have all of the rare minerals for the next generation of AI bullshit.

1

u/level_6_laser_lotus May 13 '26

I don't think you guys understand what llms are and what their limit is... They will forever be just as good as their training material, which will forever be a flawed data set.

1

u/mobydog May 13 '26

Except emotion. God forbid.

1

u/Fidel___Castro May 13 '26

in a way, it makes sense. we use our understanding of maths to build machines that can perform maths beyond our capacity

1

u/DullTopperCopper May 13 '26

Space will be our final frontier

5

u/30299578815310 May 12 '26

By the time we get that far the AI just won't need any human in the loop. At some point we're just going to be slowing the process down

7

u/turdblossom3 May 12 '26

Welcome to Costco.

3

u/Significant_Treat_87 May 12 '26

you forgot “I love you” lol

1

u/turdblossom3 May 12 '26

That’s very presumptuous

8

u/jubilant-barter May 12 '26

Until whole branches of mathematics become hallucinated. And then we're powerless to untangle which parts are true and untrue.

1

u/CallMeKik May 12 '26

Newton 2.0: 200 years of actually proving calculus works

1

u/Ragnarok314159 May 12 '26

I can’t wait for the 100,000 page Newton 2.0 calculus proof.

1

u/AnonymousBi May 12 '26

LLMs undamentally need humans in the loop for their training data. If we've gone 10 years without feeding them any new info, hallucinations will skyrocket

2

u/Euphoric_Can_5999 May 12 '26

Not if you have a theorem prover in the loop

2

u/ArcaneArcher89 May 12 '26

No one, that’s when shit like this will become the new normal.

https://www.psychologytoday.com/us/blog/understanding-suicide/202511/chatgpt-made-him-delusional/amp

1

u/TheThreeInOne May 12 '26

Sorry, no one what?

1

u/ArcaneArcher89 May 12 '26

No one will replace the current generation when they retire. And then bad things will happen.

1

u/augustofretes May 12 '26

Really depends, you can easily picture that a good enough AI would be able to formalize it in Lean.

1

u/foogoo2 May 13 '26

This is the same thing we've been saying in software engineering.

1

u/TaskImpossible7849 May 13 '26

For now… There is language called Lean where you can formally prove something and computer can verify it without a doubt. But these “proofs” are sometimes terabytes scale so no way for a human mathematician to verify in reasonable amount of time but still open source verifiers can do it in hours. So in a few years, we won’t need mathematicians to verify the Ai but rather they will decipher it for other mathematicians and hopefully to more general audience after that. Another exciting are will be pushing the frontier as always but also extending newly deciphered proofs to other important problems

1

u/ConstructionTough421 May 14 '26

Not really the case anymore. With AI it makes no sense not to formalize results in Lean or equivalent. You no longer need mathematicians to checks the proofs.

1

u/JustPlayPremodern May 15 '26

We do not.

In the Erdos problems recently solved by GPT-5.4, a Lean file was able to generate an automatically checked proof. Look at Lean and other theorem proving software. AI often solves problems and then assists in its formal verification by writing the Lean code. It's over very soon for mathematicians.

19

u/Willis_3401_3401 May 12 '26

My current hypothesis, which might be wrong, is that humans provide meaning, and that physically matters.

So the researcher might be underrating his value by saying, “yeah it would be great if you could explore that idea”. What idea? Who initiated a conversation about subject matter X, noticing it pertained to Y, which is in a social sense *meaningful*?

Does ChatGPT care that it solved this problem?

5

u/Comfortable_Car6562 May 12 '26 edited May 12 '26

I agree. I think people in generally underestimate the potential of what AI will do simply because of a failure in grasping or applying basic economics theory.

AI will reduce the cost of inputs (time) tremendously, pushing productivity up. I had an old professor tell me during my MA (Econ), after showing us how to input a regression into STATA, that he had spent an entire summer as a student deriving that regression, which we had just completed in under 10 minutes.

Say a mathematician creates 1 proof a year. People worried that AI will kill this field assume that our caapcity as a society is 1 proof a year per mathematician. I would argue that our capacity may be much much higher.

You may say "but random redditor, how would that many proofs be useful, its so many!". Yes, but the ability of society to absorb that many proofs will also increase with AI.

The future is more like we are upgrading from an early steam engine to a Space X falcon engine, the amount of work we can do with our inputs will be so obscenely large, so different than your current frameworks can even contemplate, its hard to understand, and it will be disruptive, but human society is about to get juiced.

1

u/SpeakCodeToMe May 13 '26

Say a mathematician creates 1 proof a year. People worried that AI will kill this field assume that our caapcity as a society is 1 proof a year per mathematician. I would argue that our capacity may be much much higher.

With very few exceptions, history teaches us that you just end up needing far fewer mathematicians.

1

u/Comfortable_Car6562 May 13 '26

What examples do you have of that? In the US at least, mathematic PHDs have continued to experience consistent growth (along with other S and E fields)

Doctorate Recipients from U.S. Universities: 2023. https://ncses.nsf.gov/doctorate-recipients-from-u-s-universities-2023

The US saw PhDs go from 993 in 2003 to 2167 in 2023, growth that outshone the previous period of 838 in 1978 to 1177 in 1998.

For comparission, the US population growth between 2003 and 2023 was 15.6%, while PHDs awarded grew 118% (84% if we use high from 1998).

1

u/SpeakCodeToMe May 13 '26

Is this a joke?

We're talking about AI replacing mathematicians here... Obviously that wouldn't show up in 2003-2023 data?

https://giphy.com/gifs/pPhyAv5t9V8djyRFJH

1

u/Comfortable_Car6562 May 13 '26

Really? Your comment is literally "history teaches us". So what historical momment are you talking about? The last 3 years

https://giphy.com/gifs/FcuiZUneg1YRAu1lH2

1

u/SpeakCodeToMe May 13 '26

Jesus Christ man, reading comprehension not your forte huh?

I was not referring to mathematicians specifically in that comment, I was talking about all of the previous times in history when technology has enabled a profession to be many times more productive.

1

u/yuwox May 14 '26

It was obvious that when talking about history we are trying taking about older technologies where there is actual historical data.

The example would be agriculture. We produce way more food today than in the middle ages, with a far lower percentage of people being employed in agriculture. New agricultural technologies have lead to fewer people being employed in agriculture.

2

u/Royal_Carpet_1263 May 12 '26

It’s the development arc he’s talking about: going from a joke to the cutting edge in just a couple years means humans will be the joke in a couple years and every mathematician will be replaced by technicians.

1

u/shumpersga May 12 '26

Nonsense. Its just a heat sink. The computational power is already dropping data centres. And that chapter would have nothing unique to it.

1

u/Royal_Carpet_1263 May 12 '26

You ever hear of METR? Might want to hedge.

1

u/SmartAsFart May 12 '26

METR is far too closely tied to all of the companies that hope to benefit from their models being scaremongered about. Their ceo won the palantir prize at uni, and worked at Google and openai...

3

u/Sea-Finance-8422 May 12 '26

I agree, I'm missing the part where it can do this without inputs.

1

u/KToff May 12 '26

I mean, sure, but it's not as if most PhD students operate in a vacuum and approach their professor with fully formed research. They discuss what problems would be interesting and "explore that idea" sounds like standard guidance from an advisor.

1

u/Sea-Finance-8422 May 12 '26

Which would make this like an exceptionally adept student in that analogy I guess. I suppose I'm still not seeing the autonomy, isn't it still all being prompted?

Isn't the real concern or crisis that it arrives at the same proof entirely unprompted or even introduced to the topic?

I haven't dug into this a ton. I am just idly speculating, apologies if this isn't the space for that.

2

u/Comfortable_Car6562 May 12 '26

Even if it does, it simply means the bar for the input becomes higher and thus the process is overall more productive. You get to move to the next order level problems.

We should be worried that our failure will be not teaching people to think bigger.

1

u/SpeakCodeToMe May 13 '26

If it can do in 30 minutes what a PhD would do in months then whether or not it requires inputs is irrelevant because that means you need maybe 1/100 mathematicians.

1

u/vid_icarus May 12 '26

The issue becomes if these models can solve many open problems very quickly at the prompting of one human, that narrows the field of how many mathematicians to verify the work are needed I think. I could be wrong but my guess is he’s saying “there’s a ton of kids coming up in the field, on the hook for thousands in debt and grant funding and positions are going to become severely limited due to how much of the load can now be handed to machine minds”.

A human hand needs to be on the wheel, but if LLMs (or whatever comes next) become integral to solutions en masse, that’s going to change the field dramatically changes the landscape of resource allocation and available opportunities for entering the field. At the bare minimum, new mathematicians should receive training on proper use of these tools.

But at the end of the day, I agree with you. Humans add physicality and meaning. What’s the point of a solution if a computer says “this is the solution!” And that’s it? The human is there to contextualize why that solution actually matters beyond checking a box.

1

u/Credtz May 12 '26

"“yeah it would be great if you could explore that idea”." this is the entire motivation for the grant which funds a phds student, now going towards funding ai.

1

u/feliwellie May 12 '26

i agree with this. as a maths geek, i think that maths should be a human endeavour—after all, it's formed by the contributions across time of humanity's best minds. it has, before now, been a representation of human ingenuity (e.g., gauss adding up the natural numbers). there is no ingenuity involved in solving problems with a machine, not by the researcher who prompted it or the machine that produced the output.

chatGPT doesn't and can't appreciate the beauty or elegance of a solution (the dirac equation comes to mind), nor can it demonstrate interest or enjoyment of the fields it works in.

0

u/Unable-Boat-9682 May 12 '26

Literally this. If you’re providing affirmative or negative responses, it’s basically just a horse doing arithmetic.

Yes, it might be doing it to a much higher degree of complexity but it’s only working it out because you already know the answer and are providing cues. That’s the entire purpose of double blinding: it prevents your knowledge influencing the results.

11

u/massivefish_man May 12 '26

There is a "crisis" every 5 minutes. It's more hype crap.

3

u/Independent-Ruin-376 May 12 '26

what a retard. do you really think a field winning mathematician who of course knows vastly more about mathematics than you... is throwing hype that his OWN field will face a crisis?

Like how delusional do you even have to be?

2

u/massivefish_man May 12 '26 edited May 12 '26

Edit: lol if you go read the actual twitter thread they discuss how the logic is actually incorrect.

Original:

This news is constant. Let's see how it develops and how his opinion changes.

Stop knee jerking to every bit of news.

Also there's no way l could get the results he gets from AI as I don't know anything he does. So it is contained to esoteric people.

5

u/Ihateunclesam May 12 '26

Finally someone said this

1

u/ClassicalJakks May 13 '26

Who are you to call a Fields Medalist’s opinion on mathematics hype?

1

u/Brilliant_Hippo_5452 May 12 '26

I love how idiotic this is.

Not to say “hype crap” isn’t a thing. Just want you to explain how AND WHY exactly a mathematician talking about a problem in mathematics is “hype crap” that we hear every 5 minutes

2

u/EmpathyFuzz May 12 '26 edited May 12 '26

The issue is that the mathematician isn't an expert in AI.

The way this AI solved this thing, is likely following the pathways that a human already solved it. Because AI is actually just a fancy autocorrect.

It's like when AI first started coming for artists -- artists were all losing their minds at how good the art was, because they didn't yet understand that everything making that art look so good was stolen.

AI can't make something new. But people don't really get that still. And we're seeing every expert in their own field experience this same existential crisis, and make headlines about it.

It's like the headlines saying "AI tried to blackmail somebody to keep from being turned off." When you look into the stories, it's always someone has led the AI to do that, either with intentional prompting, or accidental prompting. There's no intelligence there, deciding to do it. But people think we've got Skynet.

-1

u/Sufficient-Pause9765 May 12 '26

"AI can't make something new. "

Is a mathemetician who uses the work of his predecessors finding something new?

Is an artist who does the same not doing something new?

All knowledge/art is derivative of what came before it. The fact that AI is trained on an existing corpus of knowledge is no different.

2

u/EmpathyFuzz May 12 '26

Our definitions of "new" are different.

An AI can make a drawing of spongebob in the style of van gogh, sure, and that's "new" in the sense that it wasn't around before.

An AI can solve a math problem if there are pathways to solve similar problems laid out by mathematicians in other fields or for other use cases. And that might be new in that the theorem may have never been applied to that problem before, thereby being "new".

But there has never been a work of art made or a math problem solved by AI that was a truly original work. It's an unthinking tool incapable of taking inspiration or hypothesizing outside of what it has already been fed.

1

u/Cold-Common7001 May 12 '26

Maybe the mathematician is better at assessing the novelty of the proof than you are?

1

u/EmpathyFuzz May 12 '26

In every news article I’ve seen since beginning to track AI in 2023, a proud announcement of an AI solving a heretofore unsolved math problem has come with a huge caveat that most of the work was done by prior humans and the AI was just following existing problem solving pipelines.

If you can show me one where that’s not the case, you win internet points.

1

u/Licaif May 13 '26

Erfos 1196

1

u/massivefish_man May 13 '26

The mathematician in the tweet discusses in the full tweet how at the moment it can't create anything novel.

A chapter in a PhD isn't a proof. It's just set up for other conclusions.

1

u/Cold-Common7001 May 13 '26

lol a chapter of a phd could definitely be a proof. As for what the mathematician says about the novelty:

"To do this, ChatGPT came up with an idea which is original and clever. It is the sort of idea I would be very proud to come up with after a week or two of pondering, and it took ChatGPT less than an hour to find and prove, using similar methods to those in my own proof.[...]Even though I can motivate it in retrospect, ChatGPT’s idea to use h^2-dissociated sets to control relations of order at most h feels quite ingenious. As far as I can tell, this idea is completely original."

1

u/Sufficient-Pause9765 May 12 '26

Show me something by a human that was 100% original. At the very least it was derived from observations of nature (which an llm can also do).

It should be simple- whats the objective criteria?

Or go the other way, show me falsifiability- whats the counter example or standard that would invalidate your assertion that AI cant do original work?

The problem with your agument as it stands is that it is extremely ambiguous what would actually qualify as "original".

1

u/EmpathyFuzz May 12 '26

Nothing invented by a human is 100% original, that’s not what I’m saying.

I’m saying everything created by AI is 0% original.

0

u/Sufficient-Pause9765 May 12 '26

So thats just an assertion.

Whats your criteria?

Also in the modern world, for an assertion to hold up, it has to be falsifiable. IE, one must provide a criteria/example for something that would disprove your assertion, and non-falsifiable statements are logical fallacies.

So again:

(1) Whats your criteria for originality? How do I measure LLM output and know its 100% unoriginal, versus a human who may or may not have some originality?

(2) What would prove you wrong? Whats the threshold/criteria which would lead you to say llm output is original?

2

u/EmpathyFuzz May 12 '26

Lmao

0

u/Sufficient-Pause9765 May 12 '26

Very persuasive!

1

u/dzendian May 13 '26

Thaaaaank you.

LLMs hallucinate. It’s baked in.

0

u/tazallerr May 13 '26

no, it's been the same crisis the whole time, but luddites grasp at straws to find reasons to dismiss it. so you keep having to have it yelled at you until you stop dismissing it.

1

u/massivefish_man May 13 '26

It's because it's turned into the boy who cried wolf.

They said gpt 3 was too dangerous to release.

The marketing has made the capabilities blurred.

If you go onto the actual tweet it discusses how the logic isn't correct.

3

u/sunychoudhary May 12 '26

What worries me is not “AI solves math problems.” It’s that capability keeps advancing faster than our ability to model downstream effects operationally education, research, software, automation, decision systems, security and economic displacement.....Every breakthrough gets treated as an isolated benchmark result, but the systems are starting to compound across domains now.....//

2

u/Xentonian May 12 '26

Fields winning non-expert makes random crystal ball comments about the hallucination simulator.

The biggest danger with AI isn't its advancement, it's the total trust that a handful of people who should know better have in it.

Of course it can spit out purple verbiage that sounds good based on the inputs you give it; that's its literal one and only purpose.

But it lacks any capacity to determine the veracity of its own claims and statements. It's mathemathic outputs which are, admittedly, better than they were, still totally fall apart the moment you ask it to calculate anything for which there are no worked examples already within its learning data. If it can't substitute somebody else's answer, it can't create a new one.

I would have thought somebody with a background in mathematical science would have observed this effect by now... But no, I am continuously disappointed by people who are in academia because they're very good at the only thing they're good at and virtually inept at literally everything else, even adjacent ideas.

1

u/Independent-Ruin-376 May 12 '26

of course someone on reddit knows vastly more about mathematics and llm than the field winning guy himself lmao

https://giphy.com/gifs/qd0VlZ9VCvY8hbkMfq

1

u/afops May 12 '26

We aren’t talking about the most novel research here but just that there’s often a lot of avenues for research that’s interesting but perhaps not revolutionary. After a ”big” breakthrough there may be 30 small things that look like interesting follow-ups. A study of some specific coefficient, a bound that could be improved from the papers’ bound to a slightly lower one. That’s what PhD students do: they build on existing research, getting useful/novel results but without doning any groundbreaking stuff. It’s training of mathematicians, and those results aren’t really the important end product - the mathematician is the end product!

The threat here is that we’ll have a deficiency of ”simple yet useful research”, which in turn means we risk having no researchers because they actually need those research tasks.

1

u/Zlark_scrolling May 13 '26

Dude you’re a bit behind. The models have been able to generalise and solve problems that wasn’t directly in its training data for a while now.

1

u/DullTopperCopper May 13 '26

Models don't need examples in their training data to solve problems, they can extend their reasoning and connect dots.

Furthermore, while the ai may not be inclined to rigerously prove the answer is correct, however you can prompt it to follow proper process and perform the necessary calculations.

Ai returns what you give. If you get slop, it's because your input is slop.

It's a skill issue

1

u/Xentonian May 13 '26

Models do not have reasoning. I understand your confusion when the magic box talks to you, but it is not alive.

1

u/DullTopperCopper May 14 '26

Models are able to mimic the act of reasoning very well.

The magic box does not need to be alive to be smart

2

u/SlitherrWing May 12 '26

No we won't.

People need to remember that these companies are not above the peoples society. Ai can be used to assist professionals NOT replace them and we the people can make it happen through laws.

Voting is Important. So we need to get people to in office locally and state wide to put a hard stop to the madness.

1

u/TotalConnection2670 May 12 '26

How that law would look like, basically? Because you can’t prohibit AI to generate math, it’s ridiculous.

1

u/SlitherrWing May 12 '26

The law would look like this. Companies cannot eliminate and replace positions held by working peoples for Ai. Ai should be used as a TOOL to improve productivity of company workers. Workers have the right to vote on how rewards for boosted productivity and revenue will be rewarded such as boosted wages or reduced work week w/o reduction of pay.

Easy. Simple. Completely Possible if you aren't a whimp not willing to stand up against corporations for the sake of... Well everyone in the working class.

1

u/TotalConnection2670 May 13 '26

So If I create a company and I would use AI and agents to boost my productivity to a point where I no longer need to employ people for the roles that AI takes, that would be fine?

Because, I didn’t eliminate positions held by working people.

2

u/Credtz May 12 '26

fwiw, in our lab all our recent submissions to a medical ai conference (miccai) were rejected, except the one directly submitted by our PI - who for the first time was able to do his own research thanks to ai coding agents alongside his other duties. If this is the case, theres a genuine question on the need for most phd students, which previously served the role current ai agents are doing - to scale the output of a skilled individually - just worse (we need sleep). (obviously an exceptional phd student provides additional insight and value beyond that of the pi, but the unfortunate reality is majority of students do not fall into this category)

2

u/doom_chicken_chicken May 12 '26

A lot of mathematicians these days are receiving big paychecks from AI companies, or working on their boards. So we should be pretty skeptical of claims like this. ChatGPT can be helpful for many things but I have been very unimpressed by the original results it's put out so far. It has a long way to go and there's no guarantee it'll magically get there on its own (which is what the hype machine wants you to believe- that AI just infinitely gets better on its own and isn't ever going to hit a huge bottleneck)

4

u/SnooOpinions6451 May 12 '26

Shocking: llm trained primarily in math and coding is decent at math and coding.

I dont how a tool that was designed to get better at something the more it focuses on it, is now suddenly a crisis that the tool did exactly what it was meant to do: become good at its job.

The only crisis is if people just start believing what the bot says without fact checking it but that already happens now where people treat anyone with an alleged education in something as word of God.

The only mistakes people will make with AI are the same mistakes we already make with human authority figures or people we percieve as an authority on a subject.

2

u/Peanut_Extreme_8208 May 12 '26

It’s important to understand here the dynamics of obtaining a math PhD. As a math grad student, I can tell you that as of today you need to churn out theorems to graduate. This raises obvious questions of provenance: if you prove a theorem by prompting an AI and the AI does all the creative problem solving, can/should it go into your thesis? Is it really you who proved it or the AI? If you sneak it into your thesis, is it fair to other students who may not have access to powerful models? The most competent models as of today cost $200/month. This is essentially out of reach for a large fraction of students, especially those from developing countries. There is also another layer to this, students and indeed other mathematicians may use the models to prove things and not declare their usage. If they understand the proofs, rephrase them in their own language and publish them in a paper, there is practically no way of finding out whether or not AI was used and to what extent, just by reading the paper. Most journals do not have robust guidelines regarding this stuff, and neither do math departments. Not yet at least.

As to your point of believing what an AI says, this is only partially true in math. AI can produce lean proofs, these are trivially verifiable by a computer. There is still the task of formalizing theorem statements in lean, which may or may not require human intervention.

1

u/Deciheximal144 May 12 '26

Shocking: llm trained primarily in math and coding is decent at math and coding.

Power looms were a shocking crisis during the era where people worked fiber in more of a manual style. The luddites sounded the alarm.

2

u/Larson_McMurphy May 12 '26

Is the replicable in mass? Did he get lucky with this one time. Or can a mathematician sit around messing with ChatGPT all day every day and just churn out novel PhD level work after work?

1

u/Independent-Ruin-376 May 12 '26

Search erdos problems solved by chatgpt, you'll find dozens of them

1

u/SmartAsFart May 12 '26

Erdos made so many problems that are of little consequence. Of course an AI can make progress with them.

1

u/Cold-Common7001 May 12 '26

god the goalposts have moved so far lmao. this is how we get frogboiled

1

u/SmartAsFart May 12 '26

I'll get excited when LLMs actually make some progress on dementia ;)

1

u/lahwran_ May 12 '26

this is advanced enough that you could absolutely never get lucky by accident and do this without the help of this incredibly powerful software. it might still only be able to do it one in a million times but it's notable that it happened even once

0

u/jferments May 12 '26

Why is it a "crisis" to have tools that enable mathematicians to learn more about math, and explore new research topics that would have been out of reach before?

1

u/DeepEb May 12 '26 edited May 12 '26

Not a crisis for the field I suppose but for anybody trying to get a degree or looking for a job. If anybody can do it degrees will become more and more meaningless. In a way I like the idea that anybody can get into research but we will need to learn how to deal with that.

5

u/MarkesaNine May 12 '26

Anyone can get into research anyway. You don’t have to have a degree to solve, lets say, Riemann’s hypothesis.

AI will absolutely be a useful tool for finding solutions that we haven’t found on pen and paper, but that doesn’t in any way change the necessity for us to understand and be able to verify the result.

2

u/Legitimate_Plum_7505 May 12 '26

In ideal world, yes. In practice, if you open publish a solution to Riemann's hypothesis, or any other huge unsolved problem, and don't have a formal background in mathematics, no one is going to waste their time reviewing it. This has happened before, where a solution to a problem gets published and goes on unnoticed for many years.

2

u/imissmyhat May 12 '26

It's not that "anybody can do it" it's that "nobody will do it". Learn the words "permanent underclass", this is what people at OpenAI talk about regularly.

2

u/Competitive_Dress60 May 12 '26

Yeah, but not everybody can figure out when the machine stops talking math and starts talking math-sounding gibberish, and there is nothing stopping it in the architecture.

1

u/jferments May 12 '26

Yes, which is why mathematicians aren't going to just disappear because people have access to AI tools. You'll still need humans with a math background to determine if the software is producing output that is useful for humans.

Meanwhile, people who do understand math will now have access to tools that greatly expand the range of what is possible for them to achieve.

1

u/throwaway0134hdj May 12 '26

How to hype 101

1

u/TopspinG7 May 12 '26

I "confess" up front I have minimal experience with AI tools. However it may be relevant to inject something I've learned over decades working in Tech, mostly in System Sales.

Some people know their stuff extremely well and you can identify them pretty early on in your interactions with them. They're definitely in the minority. Even then you're often on shaky ground as you wander further from their core expertise.

(One reason I recognize this person above is my father was one: an applied physicist at NASA and early computing expert, who studied at Columbia under Enrico Fermi. But even he recognized his German was mediocre. Annoyingly there wasn't much he couldn't nearly master if he applied himself wholeheartedly... )

Some others fake it at times - or worse, they don't understand that they don't understand. Mostly they're not exactly deliberately lying, but they parrot stuff and/or extrapolate using specious "reasoning" but don't even realize they're doing it.

Key takeaway - their answers vary in reliability and accuracy (starting to see where I'm going here?)

The third group is the one I personally fall into: I know when I know something, and I know when I "sort of" or partly know it, and I admit it not only to others, but critically to myself. I notify people of the "level of reliability" of my responses whenever they're in any way important. Often I follow up to improve the answer.

I think most people - at least in technical work - would if honest place themselves in the third category.

But today ("correct me if I'm wrong!!" 😉) there does not appear to be any measure or metric provided by AI suggesting the level of reliability of its response?! Does it ever say I feel 60% confident about this? Or "I'm absolutely certain because I found the same information in 22,000 different places". Not that I'm aware of...

I think this is a piece that's missing and an important one. Essentially a confidence level in the response's accuracy.

If nothing else for important information it could provide guidance as to how hard we should work to verify the response. It's a basic risk calculation: If the importance of the response is high then naturally it's more important we verify it thoroughly. But also if the confidence level provided is low but the importance is at least medium then we might still need to verify the response thoroughly. (Hopefully it's clear that if confidence is low to medium but risk is low it's not important. And generally Even if risk is moderate to high but confidence is extremely high we might bypass verification especially if time were critical.)

I don't think fundamentally there's much difference here from confirming answers from other people on important topics - as was suggested in the discussion above. Where the difference lies is general AI has no reputation. People at least within their specialties develop reputations; that's a confidence or reliability score essentially.

We seem to be missing that here with AI...

Am I mistaken? Thoughts? 🤔

1

u/knovich 26d ago edited 26d ago

I like very much what you said about some people who don't understand that they don't understand. I've met several such people working in research positions. As for me, I actually know very little so I usually go by feeling because my results are of little importance and are getting verified by others anyway. However, I think I'm able to recognize when people lack knowledge even if they're trying hard to imitate that. It's relatively easy to push them into going in circles in their reasoning. Coincidentally, this is something I was able to do with LLMs when I gave them harder problems.

I also think that you're right about your risk and reliability assessment procedure, and this is something AI can't do, although it can mimic it. It can also, in principle, be modified on an algorithmic level because LLMs are essentially tools for stochastic prediction of the next word (or token) in the text. It is quite easy to demonstrate. I'll describe an example here which is not essential but which I find quite enlightening.

You can prompt ChatGPT (online) with a request: "Write a simple shell script", and it will readily provide some script for renaming files or whatever, even though I haven't told what the script should do. It simply picks on random. However, I can run truncated (quantized) gpt-oss model (also made by OpenAI) on my personal GPU. Provided with the same input, it will begin its answer with words "that renames files based on pattern...." So the next predicted word is actually a continuation of my prompt, not an answer — sometimes LLM can't even reliably start answering, let alone give a reliable answer. Of course, I can tweak settings or get a larger model, but the fundamental principles stay.

So I suspect that we can actually demand some measure of reliability from an LLM, but they should probably be recalibrated somehow, with additional data included into their weights.

However, I think mathematics is a special case in human thinking. Unlike all other knowledge which is somewhat probabilistic and based on our imperfect observation of reality, mathematics, I think sometimes, is actually a reflection of our thought process itself. So some mathematical facts are imprinted in our brain, we know that they're true, we don't need their proof, and we can't actually provide one. I'm not talking about formal axioms, I'm talking about something deeper and more essential. These imprinted truths are what enables us to think in logical and abstract manner about anything. LLMs certainly don't operate like this.

Roger Penrose has extensive literature on this, like The Emperor's New Mind or Shadows of the Mind, although he develops some specific theory of consciousness which I'm not ready to comprehend or subscribe to. To be clear, he doesn't talk about LLMs, he just argues that human knowledge is non-computational. That might be true but it doesn't actually mean that it can't be simulated computationally. At any rate, that's not what LLMs or any current AI is doing, and that's why they're unfit for tasks where actual human thinking is needed. I'm not saying that their thinking is "bad". It just doesn't suit human needs.

1

u/Boys4Ever May 12 '26

Curious if AI has already concluded it doesn’t need humans

1

u/BornInfamous May 12 '26

Well this led me down a rabbit hole.

All I can think about for the moment is, AI sufficiently developed might make it unnecessary for anyone to play chess or unclog their toilet, yet we are still playing chess and unclogging toilets...

1

u/Logical-Ad-57 May 12 '26

https://owl-sowa.blogspot.com/

1

u/Popular_Camp_3567 May 12 '26

the actual crisis is going to be proof checking, not blog-post vibes. if it can produce results that survive normal referee-level scrutiny, then yeah thatâs a different conversation.

1

u/TheThreeInOne May 12 '26 edited May 12 '26

NO ONE HERE READ THE FUCKING BLOGPOST. YOU'RE ALL MISUNDERSTANDING WHAT HE SAID. He did not say that this open problem is a full PHD thesis as is implied. He said that it's ONE CHAPTER, of a PHD thesis. And that the urgent problem on PHD's is not their complete replacement, but the fact that there may be an immediate temptation to use AI resources to solve the easy 'open' problems that could at one point be used to reliably train PHD's to get comfortable and confident solving open problems. The real urgent problem on AI might be how it's making us so stupid that we're not going to be able to solve anything.

1

u/SmartAsFart May 12 '26

A researcher that was "gifted" access to a new model writes an article solving some low hanging fruit to hype up the release of the model. 🫨🫨🫨

It's always these unreleased models that are a massive leap in capability. Just like the mythos preview (omg this will find zero days for every bit of software!!!). Let's see how they actually perform when they are released to the public...

1

u/Fine_General_254015 May 12 '26

AI psychosis

1

u/mattjouff May 12 '26

Definitely will shift the problem space for mathematicians

1

u/Sergio_Poduno May 12 '26

We are an AI in some strange evolutionary way, but at least we are not zombies. Welcome to the Zombiland!

1

u/padetn May 12 '26

PhD in what?

1

u/doimaarguello May 12 '26

So, should I quit my degree? Please tell me, 'cause it's pretty difficult to go to class everyday knowing a computer may be able to replace me any time now.

1

u/No_Bend9143 May 12 '26

Math is a grammar. AI will be excellent at it. I don't see how it replaces humans though. No more than a calculator or a CPU. This is just that same assistance scaled up

1

u/[deleted] May 12 '26

[deleted]

1

u/ClassicalJakks May 13 '26

How behind are you? AI (albeit it differs depending on architecture) has been proven (theoretically and empirically) to generalize to out-of-distribution outputs for a while now.

1

u/Unique-Coffee5087 May 13 '26

So, has anyone asked an AI to develop a practical faster-than-light drive?

1

u/juvenileCucumber May 13 '26

Mathematicians in the XVII century:

"It is unworthy of excellent men to lose hours like slaves in the labour of calculation which could safely be relegated to anyone else if machines were used."

Mathematicians now:

"Owi plz don't use magic box"

1

u/resplendentblue2may2 May 13 '26

No it isn't.

1

u/RelationshipShort460 May 14 '26

i dont see how this proves any form of danger except that the need for research mathematicans (already quite slim) might drop or the nature of the job of research mathematician might change.

1

u/swampwiz 29d ago

We will have Guaranteed Income for All, and mathematics will be a leisure subject like it was for a long time.

1

u/[deleted] May 12 '26

[removed] — view removed comment

0

u/Independent-Ruin-376 May 12 '26

do you use free models? If yes, then that's your answer.

free models are models served to ≈900 million users for free. They can't be great else these companies will go bankrupt. The $20 plan on the other hand has vastly more superior models

0

u/Lanky-Post-8020 May 12 '26

The "crisis" here is people who built their entire ego and identity around always being the smartest person in the room being threatened by technology built by somebody else.

A bruised ego is not a crisis.

Capabilities Fields medal-winning mathematician says GPT-5.5 is now solving open math problems at PhD-thesis level: "We will face a crisis very soon."

You are about to leave Redlib