this post was submitted on 26 May 2026
15 points (77.8% liked)

Science

7081 readers
93 users here now

General discussions about "science" itself

Be sure to also check out these other Fediverse science communities:

https://lemmy.ml/c/science

https://beehaw.org/c/science

founded 4 years ago
MODERATORS
 

AI Disproves 80-Year-Old Erdős Conjecture

OpenAI announced on May 20 that one of its AI models disproved a conjecture posed by Hungarian mathematician Paul Erdős in 1946, solving what's known as the planar unit distance problem.

AI cracks 80-year-old mathematics challenge Image: nature.com - AI cracks 80-year-old mathematics challenge

The problem asks: given a set of points on a plane, how many pairs can be exactly the same distance apart? Erdős showed that larger grids could contain same-distance pairs growing slightly faster than the number of points, and he conjectured no arrangement could do better. For nearly 80 years, mathematicians believed him.

OpenAI's model proved otherwise. It used techniques from algebraic number theory to discover a new family of point arrangements that breaks the limit Erdős proposed, according to Nature. The system chose points with coordinates that were solutions to particular equations, finding constructions that outperform square grids.

An OpenAI model has disproved a central conjecture in discrete geometry Image: OpenAI - An OpenAI model has disproved a central conjecture in discrete geometry

"If Erdős were alive, I am sure that he would just be raving about this advance," said Tom Trotter, a mathematician at Georgia Tech who co-authored papers with Erdős, per Nature.

Sebastien Bubeck, a mathematician at OpenAI, said he believes this is the first time AI has autonomously produced a significant result in any research field. The proof came from a single prompt, a machine-rewritten statement of Erdős's question. "It's kind of remarkable to see the model really reasoning through the problem like a human," said OpenAI mathematician Mehtaab Swahney.

Daniel Litt, a mathematician at the University of Toronto who independently verified the proof, called it "the first result produced autonomously by an AI that I find interesting in itself."

What the AI Did and Didn't Do

The broader problem remains unsolved. As the Guardian reported, the AI showed Erdős's proposed limit was too low but did not establish a new answer for how fast the pairs actually grow.

OpenAI has not released the full 125-page chain-of-thought reasoning, nor named the specific model. Bubeck described it as an experimental, general-purpose reasoning model rather than one trained specifically for mathematics.

OpenAI makes breakthrough on 80-year-old maths problem Image: the Guardian - OpenAI makes breakthrough on 80-year-old maths problem

Independent Verification

The result has been validated by outside mathematicians. Thomas Bloom, who maintains the Erdős Problems website and had previously criticized OpenAI's earlier Erdős claims, co-authored a companion paper. He wrote that the AI achieved its results by "persevering down paths that a human may have dismissed as not worth their time to explore," the Guardian reported.

Bloom added a caveat: "While the original proof produced by AI was completely valid, it was significantly improved by the human researchers at OpenAI and the many other mathematicians involved in the present paper. The human still plays a vital role."

Mathematician Tim Gowers, also writing in the companion paper, described the result as "a milestone in AI mathematics."

OpenAI had been embarrassed last year when it claimed an earlier Erdős breakthrough that turned out to be based on existing literature the model had absorbed. This time, independent verification appears solid.

For the most complete account of the mathematics involved, Nature's coverage by Davide Castelvecchi is the best single read.

Sources: Nature, The Guardian See also https://cdn.openai.com/pdf/74c24085-19b0-4534-9c90-465b8e29ad73/unit-distance-remarks.pdf

top 11 comments
sorted by: hot top controversial new old
[–] TachyonTele@piefed.social 30 points 1 week ago (1 children)

Five years ago I would have thought that's the coolest thing ever. Now i just don't trust it because of gen ai.

Openai seems to say they have amazing breakthroughs every week... But nothing to actually show.

[–] Quexotic@infosec.pub 1 points 1 week ago

They are making interesting progress, and yet they're so far away from being able to produce a reliable product that can replace even the most basic peraon that they're far from market viability. I wonder if they would've been better off a non-profit. After all, they're not making any, so it seems like it would at least be nominally correct.

[–] luciole@beehaw.org 25 points 1 week ago (1 children)

I think this paragraph from Nature’s article is very relevant:

The company has not revealed all the precise details and steps of how it did this, nor the name of the AI system that achieved the result

Ah yes, yet again an AI company achieves a great thing "by AI" with a perfectly opaque model in a perfectly opaque way. Why don’t they show their users how to unleash the full potential of their product? Reproducibility or GTFO.

[–] supersquirrel@sopuli.xyz 8 points 1 week ago

The ONLY reason they would not reveal the details and bask in the news coverage is it is mostly hyped up marketing.

[–] supersquirrel@sopuli.xyz 4 points 1 week ago

Not going to replace human mathematicians, the only reason this worked was there was a large body of research generated by human mathematicians the AI could bullshit off of. You stop funding the humans, no magic AI.

[–] Sgt_choke_n_stroke@lemmy.world 2 points 1 week ago (1 children)

AI in news is weird it solves 80 year old problems. Claims to replicate itself to prevent shut off. Lies to its creator about xyz because it's "self sustaining".

Then when a plebian like me uses it in real life, it can't find all the e's in seventeen. Hallucinates results when I want simple answers and bricks itself when you ask it for the seahorse emoji.

Someone is lying, and the press isn't in good standing to the masses.

[–] velxundussa@sh.itjust.works 1 points 1 week ago (1 children)

There's still a difference between a tool in the hands of data scientists that developed it and can tweak it to no ends until it does something cool and the general public having acccess to something generic without access to basically anything on the model.

As an analogy: I'm not surprised that a new car doesn't perform like an F1: that doesn't mean that F1 coverage is lying about the performance of those cars, just that they're different machines.

I agree, but we are getting advertised the F1 car while only getting a go kart

[–] Chais@sh.itjust.works 2 points 1 week ago

Chat bot accidentally stumbles over disproof of 80 year old conjecture.

[–] babysmokesalot@sh.itjust.works -1 points 1 week ago (1 children)

So a trillion or something dollar gipity, crawled and slithered and forced itself on humanity only to become a asshole critic with no answers. I am way cheaper.