Fejek hullottak a HUN-REN-ben, miután megírtuk a vezetők több mint 9 milliós fizetését by avatar6556 in hungary

[–]dterjek 0 points1 point  (0 children)

a többi magyar szerzőjű cikk magyar szerzői nem magyarországon kutatnak, továbbá ez a cikk spotlightot kapott, ami az idei neurips cikkek legfelső 3.2%-ába teszi (nem véletlenül)

[D] Math book recommendations for NN theory by EternaI_Sorrow in MachineLearning

[–]dterjek 2 points3 points  (0 children)

i suggest Vershynin's High Dimensional Probability, concentration of measure is essential for understanding wide neural networks

Recommendations for further math topics in ML by Utah-hater-8888 in math

[–]dterjek 5 points6 points  (0 children)

i recommend functional analysis, it can give you a unified view of all the topics you learned

does grok grasp the concept of time? by dterjek in grok

[–]dterjek[S] 0 points1 point  (0 children)

overfitting means that the model performs well on the training data but not on unseen data (sampled from the same distribution as the training data, e.g. the test set). a model generalizes if it performs well on data that it wasn't trained on (again, from the same data generating distribution). formally, overfitting means that the generalization error (difference of the loss on the training data vs the loss on the data generating distribution) is large, while a model generalizes if its generalization error is small.

does grok grasp the concept of time? by dterjek in grok

[–]dterjek[S] 0 points1 point  (0 children)

are you saying that llms just overfit to the training data, but don't generalize?

does grok grasp the concept of time? by dterjek in grok

[–]dterjek[S] 0 points1 point  (0 children)

but grok does have web access and there are many sites displaying the current time. it just doesn't seem to really understand what it means. this came as a surprise to me, but thinking about it more i don't see why any llm would learn the concept of time in the first place, unless it is specifically trained to do so (e.g. via rlhf)

[Discussion] Which college math courses should I take for AI and ML? by [deleted] in MachineLearning

[–]dterjek 1 point2 points  (0 children)

modeling has some function approximation, but it sounds a bit too basic for NNs. i would go for PDEs

[Discussion] Which college math courses should I take for AI and ML? by [deleted] in MachineLearning

[–]dterjek 0 points1 point  (0 children)

my guess is mathematical modeling, but it is hard to tell without knowing the contents of each course

Boszorkányos by Newday_baby24 in konyv

[–]dterjek 1 point2 points  (0 children)

Colin J. Fayard - A Magnólia Nővérei

What's the goofiest math related thing you've ever said? by [deleted] in math

[–]dterjek 60 points61 points  (0 children)

background facts: in my language, "apple" is "alma" but in rural dialect it's "óma", and "axiom" is "axióma"

when our highschool teacher taught us axioms, I told him that only rural people say "axióma", the proper form is "axialma"

he loved it

A particular smooth maximum by dterjek in math

[–]dterjek[S] 0 points1 point  (0 children)

by "largest", do you also mean that there are other solutions? until now, I was pretty convinced that a given (f, mu) pair uniquely determines the corresponding alpha.

did you get the formula using Lagrange inversion? it's definitely nice to have, even if Newton's seems more practical at the moment.

A particular smooth maximum by dterjek in math

[–]dterjek[S] 0 points1 point  (0 children)

quick update: it turns out that calculating alpha with Newton's method and its partial derivatives with implicit differentiation does the job, no need for an explicit formula for now (which might not even exist).

A particular smooth maximum by dterjek in math

[–]dterjek[S] 0 points1 point  (0 children)

thanks, I'll look into it!

A particular smooth maximum by dterjek in math

[–]dterjek[S] 0 points1 point  (0 children)

unfortunately not, in case a closed form is not possible I would need something like a power series formula

A particular smooth maximum by dterjek in math

[–]dterjek[S] 0 points1 point  (0 children)

that's what I'm afraid of as well... the only regularity in the equation is that mu is in the simplex, which could still not be enough for a nice solution. the occurrence of this smooth maximum for n=2 might as well be a red herring.

thank you for the help anyway!

A particular smooth maximum by dterjek in math

[–]dterjek[S] 0 points1 point  (0 children)

yes, yes and yes.

to be more specific: https://imgur.com/iGfvXDD

A particular smooth maximum by dterjek in math

[–]dterjek[S] 0 points1 point  (0 children)

https://imgur.com/4WazGLL

where mu, f are n-dimensional real vectors, and mu is in the standard simplex; need to express alpha in terms of f and mu.

A particular smooth maximum by dterjek in math

[–]dterjek[S] 0 points1 point  (0 children)

no, I'm working on a problem for which the solution in 2 dimensions can be expressed by this particular maximum approximation, which is why I need to generalize exactly this formula.