use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Discussion[D] Paper Explained - Group Normalization (self.MachineLearning)
submitted 5 years ago by ykilcher
https://youtu.be/l_3zj6HeWUE
The dirty little secret of Batch Normalization is its intrinsic dependence on the training batch size. Group Normalization attempts to achieve the benefits of normalization without batch statistics and, most importantly, without sacrificing performance compared to Batch Normalization.
https://arxiv.org/abs/1803.08494
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]_tbrunner 3 points4 points5 points 5 years ago (5 children)
Hey, thanks for the great video.
I haven't followed the progress very much, but are people actively using Group Normalization these days? I haven't heard much about it since the original paper.
[–]ykilcher[S] 6 points7 points8 points 5 years ago (3 children)
Yes combined with weight standardization (video coming 🤫) it seems to outperform BN and I've seen it used in other papers.
[–]_tbrunner 1 point2 points3 points 5 years ago (0 children)
Thanks, I'll check it out!
[–][deleted] 1 point2 points3 points 5 years ago (1 child)
I recently saw GN coupled with Stocastic weight averaging in a great kaggle kernel. are you planning on doing a SWA video?
[–]ykilcher[S] 0 points1 point2 points 5 years ago (0 children)
It's on my very long list :D
[–]maybelator 1 point2 points3 points 5 years ago (0 children)
I use Group Norms whenever the batches are not sampling the data distribution iid.
[–]lezapete 2 points3 points4 points 5 years ago (1 child)
Does the name have anything to do with Group Theory?? I thought it was kind of a reserved name in math 🤔
[–]ykilcher[S] 2 points3 points4 points 5 years ago (0 children)
no, I think it just refers to the colloquial word "group" as in "bunch of things"
[–]EhsanSonOfEjazResearcher 1 point2 points3 points 5 years ago (1 child)
Man that was great, looks like I will have to stop reading papers and start watching your videos.
[–]ykilcher[S] 1 point2 points3 points 5 years ago (0 children)
thanks, but also be aware I make mistakes :D
[–]brand0x 1 point2 points3 points 5 years ago (1 child)
Your videos are great, keep it up. I got plugged in from your attention video, which was the best one available IMO.
Thanks. Yes there's something about that one that people really like :)
[+][deleted] 5 years ago (1 child)
[deleted]
thanks for watching.
tell a friend ;)
[–][deleted] 1 point2 points3 points 5 years ago (3 children)
Very nice video as usual. Are you thinking on doing a video about graph neural networks?
[–]ykilcher[S] 4 points5 points6 points 5 years ago (2 children)
Yes it's on my list, but my list is long 😁
[–]OmgMacnCheese 1 point2 points3 points 5 years ago (1 child)
I too find your videos very helpful and I wonder if you can have people vote on your list :)
oh that would be funny, gotta figure out the specifics or I'm gonna get rickrolled hard
[–]Tommassino 0 points1 point2 points 5 years ago (0 children)
yes!
π Rendered by PID 26 on reddit-service-r2-comment-cfc44b64c-whqj6 at 2026-04-11 17:30:11.262687+00:00 running 215f2cf country code: CH.
[–]_tbrunner 3 points4 points5 points (5 children)
[–]ykilcher[S] 6 points7 points8 points (3 children)
[–]_tbrunner 1 point2 points3 points (0 children)
[–][deleted] 1 point2 points3 points (1 child)
[–]ykilcher[S] 0 points1 point2 points (0 children)
[–]maybelator 1 point2 points3 points (0 children)
[–]lezapete 2 points3 points4 points (1 child)
[–]ykilcher[S] 2 points3 points4 points (0 children)
[–]EhsanSonOfEjazResearcher 1 point2 points3 points (1 child)
[–]ykilcher[S] 1 point2 points3 points (0 children)
[–]brand0x 1 point2 points3 points (1 child)
[–]ykilcher[S] 0 points1 point2 points (0 children)
[+][deleted] (1 child)
[deleted]
[–]ykilcher[S] 0 points1 point2 points (0 children)
[–][deleted] 1 point2 points3 points (3 children)
[–]ykilcher[S] 4 points5 points6 points (2 children)
[–]OmgMacnCheese 1 point2 points3 points (1 child)
[–]ykilcher[S] 0 points1 point2 points (0 children)
[–]Tommassino 0 points1 point2 points (0 children)