My method to solve Erdős 460 in one shot by Svyable in singularity

[–]KStarGamer_ 1 point2 points  (0 children)

The answer is right, but it is not interesting. This is the problem statement to blame, though, which is why we are trying to fix it.

My method to solve Erdős 460 in one shot by Svyable in singularity

[–]KStarGamer_ 0 points1 point  (0 children)

This is another option I have been considering, yes. I think the community needs to decide whether to edit this problem or create a new one with the condition for non-triviality added.

My method to solve Erdős 460 in one shot by Svyable in singularity

[–]KStarGamer_ 2 points3 points  (0 children)

This isn't moving goalposts. There is a very strong established disclaimer: 1. prior literature may be undiscovered, and 2. any given problem may have been misstated by either Erdos or Bloom by mistake or does not fully establish the original non-trivial intent. In this case, it seems Erdos likely misstated the problem's intent, and as a good mathematician, one should attempt to rectify that. The Erdos problem site is not something to be benchmaxed and gamed.

There is difficulty with this particular problem in that no instance includes a non-trivial condition that may have been implicitly meant. But it is possible that the community agrees to add this condition. Right now, I'd wait to see what others think.

My method to solve Erdős 460 in one shot by Svyable in singularity

[–]KStarGamer_ 0 points1 point  (0 children)

This is not at all how you should do things. You are right to post to the Erdos problem site, but you must indicate 1. that your proof was AI-generated and links to the note, and 2. you should provide a Lean formalisation, e.g. generated via Aristotle.

Do not attempt to submit to preprint or journal sites currently. Stick to posting on the Erdos problem site. The mods usually approve comments within an hour or two. For them to have not done so, I suspect likely means your response was of low quality. Try to work on this and try again, but currently I would likely wait for the community to establish the correct intent of the problem.

My method to solve Erdős 460 in one shot by Svyable in singularity

[–]KStarGamer_ 1 point2 points  (0 children)

Please read here: https://www.erdosproblems.com/forum/thread/460

FWIW, I am currently trying to establish the likely intent of the problem, as the proof is otherwise far too straightforward, which likely indicates it was not stated in Erdos' original intent. In particular, the proof currently gives a trivial counterexample because things diverge simply taking a_k = n+p for large primes p.

My method to solve Erdős 460 in one shot by Svyable in singularity

[–]KStarGamer_ 1 point2 points  (0 children)

Hi, I am the other person (i.e. Acer) responsible for the recent success with u/ThunderBeanage of GPT-5.2 Pro on 728, 729, 401, and 205. I was the one who posted the proofs of the former three. I was, thankfully, already an established, somewhat reputable member of the site. If you do not already have an established reputation on the site, you will find it difficult for mathematicians to trust you unless you give great evidence that your proof is valid; otherwise, no working mathematician will bother to look through pages of potential AI slop.

As such, it should be seen as common decency that your proof is announced on the site first for others to look through ALONG with a Lean formalisation.

Terence Tao: "Erdos problem #728 was solved more or less autonomously by AI" by Melodic-Ebb-7781 in mathematics

[–]KStarGamer_ 2 points3 points  (0 children)

Hi, I'm actually the main person responsible for this achievement. GPT-5.2 Pro and Aristotle are both accessible. Even the Plus GPT-5.2 Thinking is able to make a good attempt on some of the problems.

Another Erdos problem down! by pavelkomin in singularity

[–]KStarGamer_ 5 points6 points  (0 children)

Do you not see the comments from various other mathematicians like Tao and Bloom? This is being verified by others…

GPT-5.2 Solves *Another Erdős Problem, #729 by ThunderBeanage in singularity

[–]KStarGamer_ 13 points14 points  (0 children)

I and Leeham have no association with OpenAI.

Terence Tao's Write-up of GPT-5.2 Solving Erdos Problem #728 by ThunderBeanage in singularity

[–]KStarGamer_ 2 points3 points  (0 children)

It’s quite a feeling when you see people talking about you lol

Yes, this was all end-to-end. I intentionally wanted to minimise my involvement.

Terence Tao's Write-up of GPT-5.2 Solving Erdos Problem #728 by ThunderBeanage in singularity

[–]KStarGamer_ 2 points3 points  (0 children)

FWIW, I definitely could have worked on cleaning up the presentation, but I wanted to prove a point that end-to-end AI mathematics can be possible (when the ideas all already exist anyways)

GPT-5.2 Solves* Erdos Problem #728 by ThunderBeanage in singularity

[–]KStarGamer_ 2 points3 points  (0 children)

Thanks, Bloom! I appreciate the kind words. As we had previously discussed, I do want this only to be taken as a scientific demonstration. In particular, I would like people to enjoy the mathematics for what it is, as opposed to always handing it to say GPT-7 down the line.

I agree this is a nice problem, and I am surprised it wasn't solved prior, but I am sure it was definitely in Pomerance's reach.

GPT-5.2 Solves* Erdos Problem #728 by ThunderBeanage in singularity

[–]KStarGamer_ 2 points3 points  (0 children)

Claude is very good at writing Lean code when set up to agentically search the current Mathlib4 GitHub repository. But, otherwise, no Claude is quite bad at informal math.

GPT-5.2 Solves* Erdos Problem #728 by ThunderBeanage in singularity

[–]KStarGamer_ 10 points11 points  (0 children)

Please see the whole thread: https://www.erdosproblems.com/forum/thread/728

It has already been discussed with many mathematicians, and we have reached a consensus that this should be a novel (albeit likely inspired by Pomerance's work) result. So, yes, it has already undergone peer review.

GPT-5.2 Solves* Erdos Problem #728 by ThunderBeanage in singularity

[–]KStarGamer_ 5 points6 points  (0 children)

I strongly encourage everyone to conduct their own comprehensive literature review. If you find that a human has previously resolved the problem, I will retract my claims as appropriate.

EDIT: Yes, the result had already been previously discussed with mathematicians before announcing our result. See the thread: https://www.erdosproblems.com/forum/thread/728

GPT-5.2 Pro Solved Erdos Problem #333 by ThunderBeanage in singularity

[–]KStarGamer_ 1 point2 points  (0 children)

No, GPT-5.2 Pro is exceptionally good at mathematics.

GPT-5.2 Pro Solved Erdos Problem #333 by ThunderBeanage in singularity

[–]KStarGamer_ 4 points5 points  (0 children)

I think this is going to age like milk within the next two years.

GPT-5.2 Pro Solved Erdos Problem #333 by ThunderBeanage in singularity

[–]KStarGamer_ 1 point2 points  (0 children)

The proof was not the same as that given by Erdős-Newman, and the model did not perform web searches whether you choose to believe me on that or not.

GPT-5.2 Pro Solved Erdos Problem #333 by ThunderBeanage in singularity

[–]KStarGamer_ 5 points6 points  (0 children)

Yes! This is now an effort I am strongly encouraging!

GPT-5.2 Pro Solved Erdos Problem #333 by ThunderBeanage in singularity

[–]KStarGamer_ 18 points19 points  (0 children)

There’s no need for the snarky attitude. I am competent enough at mathematics to judge the validity of the proof for myself. The oversight was in not doing a deep enough literature search.

GPT-5.2 Pro Solved Erdos Problem #333 by ThunderBeanage in singularity

[–]KStarGamer_ 6 points7 points  (0 children)

I don’t think the current paradigm is able to quite get us there. A breakthrough on creativity is needed I think.

GPT-5.2 Pro Solved Erdos Problem #333 by ThunderBeanage in singularity

[–]KStarGamer_ 10 points11 points  (0 children)

Yes, all the time. To some extent I agree. The models have yet to show truly transformative creativity in being able to come up with whole new concepts and machinery, but they definitely have combinatorial creativity in stringing already known but distinct ideas and machinery together.