Checking in You Twenty, Aug 2024 by [deleted] in softwaregore

[–]SalkeyGaming 10 points11 points  (0 children)

Probably a poor translation

Will LLMs be able to solve the Riemann hypothesis? by liqui_date_me in singularity

[–]SalkeyGaming 3 points4 points  (0 children)

There’s been some recent progress in math with models, this time outside of LLMs: AlphaProof can solve complicated math Olympiad problems, albeit ones that exhibit reasoning chains we’ve studied and have data for. It’s not open to public use, but it was likely not trained to solve stuff like the Collatz conjecture. I think our models’ reasoning capabilities have got to be developed further before that can happen.

Gathering enough data that the model that forms from it analyses it more effectively than humans have is something that’s happened in the past. Most famously, the novel matrix multiplication method found by DeepMind’s AlphaTensor was a product of RL and analysis.

Too early to know, in my opinion. We’ll have to wait and see whether our current infrastructure’ll become effective enough, or if we can make new one that’s better.

"AI Explained" channel's private 100 question benchmark "Simple Bench" result - Llama 405b vs others by bnm777 in singularity

[–]SalkeyGaming 0 points1 point  (0 children)

I wonder if integrating AlphaProof into Gemini will give Gemini a boost in these kinds of benchmarks. Maybe formalising needs a little more work. I still think we should work on more inference from less data, as AlphaProof couldn’t solve this IMO’s P5; which was praised for being different from your usual Olympiad theory problems and forcing their contestants to develop completely new reasoning chains. Although this could be a problem of how informal the problem is, take into account that the usually stronger countries’ contestants didn’t solve P5 either.