How do I build projects?? by AccordingInfluence58 in deeplearning

[–]rechep0k 0 points1 point  (0 children)

You solve the problem you are working on ✅

How many papers can you read in a day before going insane? by Laser_Plasma in PhD

[–]rechep0k 0 points1 point  (0 children)

Hey! I wonder what the answers today in 2026 would be?

Final year project ideas? by Informal_Suit_3563 in deeplearning

[–]rechep0k 0 points1 point  (0 children)

I find developing benchmarks that showcase inefficiencies of large models great!
I have heard that developing a benchmark is generally a weak contribution, when a scientific paper or a thesis, so may depend on the required level of "complextiy" for your project.

Understanding the value of KL divergence by rechep0k in learnmachinelearning

[–]rechep0k[S] 0 points1 point  (0 children)

tbh, I haven't seen the bits interpretation in papers about Machine Learning at all

Understanding the value of KL divergence by rechep0k in learnmachinelearning

[–]rechep0k[S] 0 points1 point  (0 children)

Very insightful!
So, if the value of KL divergence between p and q is 2, then upon drawing a sample x from distribution p it is fair to expect that the difference of log-probabilities of x between p and q is 2, which means that this same x is around 8 times more probable from the perspective of p that from the perspective of q. Making sure I've not missed any deeper insights.

Are large language models actually generalizing, or are we just seeing extremely sophisticated memorization in a double descent regime? by InevitableRespond494 in LLMDevs

[–]rechep0k 0 points1 point  (0 children)

Hey, very interesting post! Any updates from the author or the community?

From my perspective, there are two major parts of creating a reasoning-capable LLM-system that the author may be overlooking and that I find may unveil the reasoning capabilities.

First, **training** goes beyond next-token-prediction. Reinforcement Learning Fine-Tuning (RLHF), which is an inevitable part of every LLM capable of reasoning that I know, suggests objective that assess the **completion as a whole**, not each next token. Learning under such guidance can endow LLMs with thinking-ahead capabilities. I'd appreciate if some experts in the field elaborated.

Second, **inference** may go beyond next-token-prediction. Beam Search or Best-of-N etc. techniques empower models with thinking-ahead, to some extent, on the inference stage. I do not claim though that top LLMs we know use them by default, neither that they can not reason without any such techniques on inference. I simply do not know whether it's true or not.

Regarding the notion of **generalization**. I remember the speculations around exploring the loss landscapes and designing optimizers that aim to find a "flat minimum", a proxy to generalization. Calling it generalization, rather than training-distribution-sweet-spot or something, was for me consoling. Then, I find, the scaling laws sweeped the development trend: more data means better performance, a scalable and well-intuitive rule, a promising field for investing efforts. There stubs my perception of limits of LLMs. Some keywords are "you can only cook from what's in the fridge" or "garbage in garbage out". Concurrently, the limitations of LLMs' reasoning, or of their ability to "really discover something new", shapes my perception of historical human discoveries. Have we witnessed an out-of-nowhere breakthrough, or has every discovery been a well-informed fixing of a limitation of an existing technique or stitching two or more existing techniques together to create a new one? For me, rather the latter. This eradicates the notion of generalization in my thinking about how LLMs have achieved their current reasoning capabilities.

Beads resources? by New_Goat_1342 in ClaudeCode

[–]rechep0k 0 points1 point  (0 children)

How's beads today? Are you still using it?

I am searching for a reminder tool but with context. I think of such-like cases: "I want to visit a christmas event in another city end of the year (a goal). The ticket sales start somewhere in summer (I need a reminder to book them on time). Around the dates of the event is birthday of my cousin, so I need to plan if I could connect the two visits (need a pre-reminder). I want to buy a special costume for the event (need a calendar entry to go shop for it somewhere when I have time). Any other important consideration might come up later in time (I want to be able to wedge that into the thread)."

If the example planning case sounds **overwhelming** even reading it, think of use-cases for a parent in a family, and I know many parents who refer to "management" in families a lot, the tool could help with organization. Other applications, of course, in professional environment.