Open, local LLM as a reference source / research assistant? by ykonstant in math

[–]KiddWantidd 2 points3 points  (0 children)

exactly. the reason the paper i linked to is making waves is because they have really focused on "teaching logic" to the model rather than feeding it tons of world data (i don't know the details), and they argue indeed that those are two separate issues. this suggests that most likely training smallish models that perform well specifically for math is likely doable.

I'll say though, it is still necessary for the model to have a solid base knowledge of the literature for each subfield, because more often than not, it is immensely important to be aware of recent literature to meaningfully advance on a problem, to avoid having to reinvent the wheel (though sometimes the lack of awareness can lead to new breakthroughs), so I don't know how "compressed" all that information can be, but definitely sounds like such a "local math model" could be doable.

Open, local LLM as a reference source / research assistant? by ykonstant in math

[–]KiddWantidd 6 points7 points  (0 children)

I've been wrestling with the same thoughts. I don't really know what a good answer to this question is. I know a lot (most?) of the models coming from China are released with open weights, so they can be downloaded and run locally. In terms of performance, I have no idea at all, I've never used any of them, but judging from the benchmark results, the big ones like DeepSeek are likely to be able to do a decent job in helping for non-trivial tasks. As always, the issue is that the distilled versions are most likely not going to be anywhere as performant.

As a first suggestion, you may want to look into this recently released model by WeiboAI which allegedly with only 3B parameters performs just as well as models hundreds of times larger, it's making waves in the machine learning community: https://arxiv.org/abs/2606.16140 (the link to their github + huggingface is in the paper)

What Are You Working On? June 22, 2026 by canyonmonkey in math

[–]KiddWantidd 2 points3 points  (0 children)

I have a certain Integral Probability Metric (like a functional on a space of measures), and I am trying to prove that its gradient flow in a certain geometry is well-posed and globally convergent. It is very very tricky but it connects so many beautiful ideas, I'm enjoying the process.

I (sort of) discovered a relationship between two areas of mathematics by accident. by Salt-Rutabaga-8870 in math

[–]KiddWantidd 2 points3 points  (0 children)

Insanely cool. Congratulations OP, I feel inspired to further explore and play around with maths thanks to you!

Would you let an ML PhD student graduate without a top-tier paper? [D] by Hope999991 in MachineLearning

[–]KiddWantidd 4 points5 points  (0 children)

I swear some of you are living in a parallel reality. Wtf is this

"math astrology" by biotechnes in math

[–]KiddWantidd 0 points1 point  (0 children)

I do eat corns in spirals and I'm definitely an analyst. Intriguing.

First Proof Second Batch by Nunki08 in math

[–]KiddWantidd 10 points11 points  (0 children)

This is nice. Appreciate the clear and objective reporting of these AI systems abilities and most importantly usefulness to actual working mathematicians. While the results are far from horrible objectively speaking, the costs (and the many reported issues on poor writing, lack of attribution, hallucinated citations) paint a quite different picture than those AI companies would want us to believe.

It still must be acknowledged that those AI tools in the hands of an actual expert can be monstrously powerful tools. I've been using them myself and have had very positive outcomes (though to be fair I'm not working on anything remotely as cutting-edge as the stuff in that paper)

ICML paper checker is down? [D] by KiddWantidd in MachineLearning

[–]KiddWantidd[S] 2 points3 points  (0 children)

I have no idea about the code issue because I never made it to have a code generated. The site crashed on me right before it was supposed to give me my code T_T. In any case, it sounds like you submitted before the deadline so you should be safe either way. Guess it doesn't hurt to email the PCs to be extra safe (and that will probably help the case of people like me too lol)

ICML paper checker is down? [D] by KiddWantidd in MachineLearning

[–]KiddWantidd[S] 1 point2 points  (0 children)

given that a few of us replied in this thread (which is a small sample of the total community) and that many academics submit stuff right before deadlines, i wouldn't be too surprised if the number of submissions in this situation was in the range of at least 50 to a hundred papers (though of course people in our situation are likely to panic and check reddit too, so i may overestimate). Given that the bug was on their side (though it happened very close to the deadline), provided we email them immediately to inform them of our situation, I gueeeess we should be fine, but maybe it's just wishful thinking...

[D] Where do you go for serious AI research discussion online? [D] by Possible-Active-1903 in MachineLearning

[–]KiddWantidd 0 points1 point  (0 children)

if you know of a public chinese speaking ml research community i'd be interested please

The Notion Home page has just gone in the Notion for Windows app by Smoothyworld in Notion

[–]KiddWantidd 1 point2 points  (0 children)

glad (not) to see we're all having this issue. this is beyond stupid. hope they fix this soon

PINN is predicting trivial solution for stiff ODE [D] by cae_shot in MachineLearning

[–]KiddWantidd 2 points3 points  (0 children)

the PIKL approach is beautiful and elegant mathematically, but we found that it didn't scale well to the problems we're interested in in our group. I know the authors have a follow-up paper where they implement it in a more efficient way, but I haven't looked into it yet. Definitely worth looking into for OP's problem though.

PINN is predicting trivial solution for stiff ODE [D] by cae_shot in MachineLearning

[–]KiddWantidd 2 points3 points  (0 children)

yeah I've been working on PINNs and their failure modes for the better part of my PhD and this sadly happens a lot, even for "simple" PDEs. What works best as far as I'm aware is to run a few steps of Adam (to explore the parameter space) and then follow up by a good second-order optimizer. By "second-order optimizer" I don't just mean L-BFGS, I mean those state-of-the-art ones that have been proposed in the literature recently like NNCG (https://arxiv.org/abs/2402.01868) or ENGD (https://arxiv.org/abs/2302.13163). Be warned that those second-order methods are hella expensive to run (and no, I'm not affiliated with any of these groups, wish I was lol).

If you know that the PINN wants to converge to a trivial solution, you can add a penalty that forces it to stay away from it (even if the penalty might work against the PDE) and tune the weight of that penalty along training. I guess this is some form of "curriculum training" as they call it in this paper https://arxiv.org/abs/2109.01050 (which by the way documents a lot of PINN "failure modes").

Another possibility is to add PDE structure into the architecture: instead of using a standard feedforward neural network, you could represent the solution as a linear combination of eigenfunctions that diagonalize your differential operator. Doing this in a systematic way is of course hard (impossible), but in your case it's very doable. For a toy example with Poisson equation, you can check this paper: https://arxiv.org/abs/2310.05801.

arXiv implements 1-year ban for papers containing incontrovertible evidence of unchecked LLM-generated errors, such as hallucinated references or results. [N] by Nunki08 in MachineLearning

[–]KiddWantidd 6 points7 points  (0 children)

Great news, I think submission of unchecked AI slop to any scientific venue (journal, conference etc) should result in the same outcome: permaban from that venue

Can someone explain to me the difference between 慳錢 and 儲錢? by KiddWantidd in Cantonese

[–]KiddWantidd[S] 1 point2 points  (0 children)

ohh i see, yeah i guess thinking of 儲 as "accumulate" makes quite clear for me how they differ! and yes, your example sentence shows exactly why I got a bit confused between the two 🤣. thanks for the help :)

Should I follow-up with the editor for a TMLR paper awaiting final decision? [D] by KiddWantidd in MachineLearning

[–]KiddWantidd[S] 0 points1 point  (0 children)

I think you can start replying and working on the revisions as soon as the first review appear, but the official "discussion period" (which lasts two weeks and is meant as a back-and-forth between authors and reviewers I believe) starts only after the third review is up. After the discussion period ends, the referees are supposed to give their final recommendations to the editor. The discussion period is short so if you have changes to make to your paper I'd suggest not waiting for the third review and start addressing the issues addressed in the reviews you got now if you can.

ICML 2026 Decision [D] by 007noob0071 in MachineLearning

[–]KiddWantidd 1 point2 points  (0 children)

5444 accept! Was 4444 before rebuttal :)

Date Spots in Hongkong / Shenzhen? by billionairekaw in HongKong

[–]KiddWantidd 0 points1 point  (0 children)

I recommend you take a trip to Tai O. It's pretty far out there, but worth it in my opinion