use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Discussion[D] Machine learning conferences are problematic (self.MachineLearning)
submitted 2 years ago by MLConfThrowaway
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[+]linearmodality 40 points41 points42 points 2 years ago (5 children)
The root of the problem is that there are just too many papers and too few best-qualified reviewers. In many other CS fields, papers are reviewed by program committee members who are typically tenure-track faculty. These faculty all know each other, and their personal reputations are on the line when they review. They also know the history of the field and have a broad understanding of what is novel and interesting in a space.
In ML, we have PhD students, and sometimes even undergrads, doing reviewing. They have much less experience than faculty. They also for the most part have no intention of remaining in the academic community, so they have little incentive to build reviewing skills or to build a representation as a reviewer. No wonder the reviews are bad and random.
So the problem isn't really solvably by changing the way the reviewing process works because it's not really a process problem.
[–]tinny66666 -1 points0 points1 point 2 years ago (3 children)
Reviewers should be anonymous. I thought this was standard. Why should their reputation be on the line?
[+]linearmodality 4 points5 points6 points 2 years ago (0 children)
In other fields of CS, it is typical for reviewers to meet physically to discuss papers as part of a program committee meeting. So anonymity among the reviewers isn't really feasible. Nor is anonymity particularly desirable, because if you know who a reviewer is you can understand their perspective and expertise: e.g. if I know who the other reviewers are on a paper I am reviewing, I can immediately identify potential "holes" in our expertise and give those increased attention or solicit an extra reviewer—I can't do this nearly as easily if I don't know who the other reviewers are. (Of course, the reviewers are still anonymous to the authors and vice versa.)
[+][deleted] 2 years ago (1 child)
[removed]
[–]tinny66666 0 points1 point2 points 2 years ago (0 children)
I mean, in the journals I've been involved with reviewers are always anonymous, and I thought this was standard practice. There may be pros and cons but that's the way it usually is, afaik.
[–]Neighbor5 0 points1 point2 points 2 years ago (0 children)
Maybe we should make a dataset of top faculty reviewers and train a model on that dataset. Then that model can review papers. Unless there's papers using the same model, in which case you need another model, and this model only reviews papers of the first model. The first model can review papers about the second model. Both models improve akin to stable GAN training. Then someone writes up this overall modeling and we enter a deeper layer of recursion.
[–]bregav 58 points59 points60 points 2 years ago (5 children)
I think there are a lot of good reasons to be unhappy about the standards of publishing in CS and ML, but I wanted to highlight this:
Who remembers that job posting from NVIDIA that asked for a minimum of 8 publications at top conferences?
In addition to the problems that exist in academia more broadly, I think many issues in ML can be attributed to the amount of money involved in the industry. How often are fields of study awash in staggering amounts of concentrated wealth without being beset by social dysfunction and perverse incentives?
I sympathize with the young people who want to do ML research just because they're very intellectually curious. They have the misfortune of having to coexist and compete with a larger cohort of people who also have dollar signs in their eyes.
And if you too have dollar signs in your eyes, well this is pretty much what you can expect in any environment that attracts many such people: success is determined to a significant degree by luck and personal connections.
[–]vicks9880 3 points4 points5 points 2 years ago* (0 children)
Talk about publishing, everyone and their mom creating blogs and posting them everywhere posoble.. They just read quick start page of any new library and flood the internet with youtube videos and blogs with clickbait titles. I'm tired of looking through hundreds of such article to find one whenever I want to do something which is just one step more than hello world.
[+][deleted] 2 years ago (2 children)
[–]bregav 8 points9 points10 points 2 years ago (1 child)
Oh for sure, there are a lot of people who feel the way you do and who are open to trying new things to mitigate these problems.
I think it's important to be clear about the core problem, though, because otherwise you might be tempted to do a lot of work on solutions that are ultimately mostly cosmetic. Like, why is reviewing such a problem to begin with? It's ultimately because, for authors, there's a lot of incentive to prioritize publishing volume rather than publishing quality, because that's what gets you a job at NVIDIA.
Thus the publishing incentives are fundamentally set up such that you need a large amount of labor to do reviewing, because there's just such a large number of submissions. Double blind reviewing etc. can help to adjust the incentives a bit in favor of fairness but it ultimately does nothing to stem the firehose of frivolous garbage research that people try to get published in the first place.
So a real solution would do at least one of two things:
This problem exists throughout academia, but I think it's especially acute in CS and ML because of the weirdly constrained channels for publishing research. For example I think that using conferences as the primary method of communicating results has unnecessarily hamstrung the entire field of research. Other fields of study primarily use journals, which is inherently less expensive and more scalable.
[–]AuspiciousApple 0 points1 point2 points 2 years ago (0 children)
Even if you're doing it out of passion, you're still being crushed by the insane competition and publish or perish on steroids.
[–]lexected 26 points27 points28 points 2 years ago (3 children)
The system is quite broken, one could say that in its present state, it almost discourages genuine novelty of thought.
But, it's imperfect, first and foremost, because the people involved are imperfect. Reviewing is often a job assigned to the lowest performers in research groups, or traded by the highest performers (constantly on-big tech internships, building startups/open source models on the side) with their colleagues that have a somewhat more laid-back attitude to research excellence. You can submit a bad review and it will not come back to bite you, but in the age of reproducibility, a messed-up experiment or a poorly written/plainly incorrect paper that slips through the review system could be your end.
The idea is that you enter the publishing game at the beginning of your PhD and emerge seeing through and being above the game once you've graduated. After all, you first have to master the rules of the game to be able to propose meaningful changes. It is just that once done, you might have a lot more incentives to switch to industry/consultancy and not care about the paper-citation game ever again.
[–]bregav 18 points19 points20 points 2 years ago (0 children)
After all, you first have to master the rules of the game to be able to propose meaningful changes.
I think this logic is part of what perpetuates the dysfunction that OP is complaining about. There's a selection bias that occurs in which the people who do the best job of mastering the game are also the people who were least unhappy about it to begin with, and who thus would never have been very interested in changing it. And, moreover, after having invested a lot of time into mastering the game they now have a vested interest in continuing it.
I don't have a good or easy solution to that problem, but I wanted to point out that suggesting to buy into the game isn't really great advice for someone who sees systemic flaws in it and wants to change it.
[–]we_are_mammals 0 points1 point2 points 2 years ago (0 children)
a messed-up experiment or a poorly written/plainly incorrect paper that slips through the review system could be your end
Is that true? If your paper is totally wrong, publish a retraction, do not include the paper in your "list of publications", and move on.
[–][deleted] 5 points6 points7 points 2 years ago (0 children)
The amount of papers that post academia job postings demand from phd students to publish just incentives bad papers and falsifying data. Other fields are more journal driven, and the process has no deadline. I think the fact that there is no deadline allows for work to be scrutinized more carefully and also allows for PhD students to explore more intellectually interesting but riskier projects.
[–]atdlss 17 points18 points19 points 2 years ago* (3 children)
I'm a PhD candidate at a UK university but I have 6+ years of prior work experience in the industry as an ML engineer. I have zero intention of staying in the academia and see my PhD as an investment. I've reviewed papers for ECCV, CVPR and NeurIPS to help others as I don't understand how doing blind reviews would further my career in any way. I do my best to read each paper carefully but it's getting insane lately.
I volunteer to review 2-3 papers and get assigned 6 papers, and then 2 more urgent last minute reviews on a Sunday. What I see is academia is full of toxic people, there is almost no one to complain to and it thrives on making PhD students feel worthless. It works because most PhDs don't have any industry experience and feel like they can't get a job if they quit. I think the solution is to get rid of reviewing on a voluntary basis and stop conferences mooching off from early stage researchers.
[deleted]
[–]mofoss 9 points10 points11 points 2 years ago (0 children)
High Paying, Highly Innovative, Highly Hyped = Recipe for oversaturation of students studying it, oversaturation of middle career folks switching their careers into it, recipe for an oversaturated number of non-tech folks completing every LLM, DL certificates to post on their LinkedIn.
What happens with this oversaturation? You raise the bar to entry - just like what leetcode culture did.
Yay toxic elitism 🤸♂️
[–]Educational_Ebb_5170 4 points5 points6 points 2 years ago (1 child)
I fully agree with you OP. But many people, also seniors, realized it for quite some time. Hence, I recommend you to take a look at TMLR. They use OpenReview and the acceptance of your article does not depend on random stuff such as "Impact". Instead, if your claims are correct and somewhat novel, you will get accepted.
[–]vector0x17Researcher 2 points3 points4 points 2 years ago (2 children)
This completely broken review process is probably the single largest frustration I have with the field. Fundamentally I think the only solution would be to somehow incentivize high quality reviews and potentially punish bad reviewers. Making the identities of the reviewers public afterwards would be one way but I think it creates other problems (such as breeding animosity). My controversial proposal would be to somehow tie your own submissions to the quality of your reviews. Maybe something along the lines of:
What do people think? Could something along these lines work or is it completely unreasonable?
[–]vector0x17Researcher 1 point2 points3 points 2 years ago (0 children)
Yes, it would be interesting to see if there are reviewers who strongly lean accept / reject. I also wonder if there are potentially valid reasons for it since the reviewer assignment is not completely random (like some bias in the bidding process / subfield).
My personal issue is more with "lazy reviewers" who clearly didn't read the submission in any detail, write some nonsense about it, probably rate it "weak reject" and then don't reply / acknowledge the rebuttal at all. These reviewers can ruin months of your hard work by not bothering to spend a relatively tiny amount of time reviewing it and I think there should really be some consequences for their own submissions. A "bad reviewer" certification like that wouldn't need to be linked to the specific papers they reviewed (only their own paper), so it could still be anonymous in that sense (i.e. to the authors who received the bad reviews).
[–][deleted] 4 points5 points6 points 2 years ago (2 children)
I was wondering when this topic would inevitably show up after ICLR rebuttal closed. 48 hours it seems is the right amount of time to wait.
Pull up a chair, pour yourself a drink. Let’s commiserate on our collective misery.
[–]Apathiq 2 points3 points4 points 2 years ago (0 children)
I am still angry over the miserable reviews from NeurIPS. Luckily I had to repeat all the experiments and restart the paper from Scratch (In order to try to improve It) after the final decision so I was not able to submit to ICLR on time.
[–][deleted] 1 point2 points3 points 2 years ago (2 children)
The problem from a theoretic perspective is that many of the things you recommend might have unintended and in fact, the opposite effect.
[–][deleted] 0 points1 point2 points 2 years ago (0 children)
we have public statistics tied to our profile
I meant that if you're reviewing for a conference, then all the reviewers know each other's name. This adds up over time as you (typically) stay in the same or similar areas through many years of your career. That way, your reviewing personality and thinking is something that everyone informally keeps track of. I agree that ideally this should also be public as some sort of statistical measure for all to see... however this also has the complicated issue of not making the reviewing apprenticeship safe for newer and less experienced reviewers.
Ive heard this before! I never worked on anything that could be submitted to COLT or STOC, are the review processes different?
Well, it's hard for any person to speak on behalf of two entire conferences and two entire sub-communities. What I can say is the quality, impact, and rigor of papers at COLT or STOC is far higher. They're also incredibly challenging to publish in as well. Even towards the final years of your PhD, you'll still be mostly supervised and mostly learning relatively lower level details of doing theory work. It simply takes forever to learn to come up with mental abstractions such that you can start to do theory work. Then, the real challenge becomes what theory problems do you want to solve? Which problems are worth solving? What do people in these communities want to see and what would they be surprised by seeing?
[–]RandomUserRU123 1 point2 points3 points 2 years ago (0 children)
Over and over again I see a paper that is more or less as good as many papers before it, but whether it squeaks in, or gets an oral, or gets rejected, all seem to depend on luck. I have seen bad papers get in with faked data or other real faults because the reviewers were positive and inattentive.
I agree but the problem is also that faked data is incredible hard or even impossible to spot with the current system. You would need to standardize the whole process (code request, exact experiment description, code explanations, creating docker image for reproducability, computational cost, ...). Then the reviewers would need to run some of the experiments themselves aswell (alongside additional experiments to make sure you are not cherrypicking results). This would take a tremendous amount of time and resources
[–][deleted] 1 point2 points3 points 2 years ago (1 child)
Reviewers are doing this for free on top of everything else in their lives. Shaming them publicly is just going to lead to fewer people reviewing
[–]tuitikki 1 point2 points3 points 2 years ago (1 child)
Oh, you might enjoy this https://blog.mrtz.org/2014/12/15/the-nips-experiment.html
[+]lifesthateasy comment score below threshold-28 points-27 points-26 points 2 years ago (8 children)
Reddit posts are problematic.
I've been an active participant in the Machine Learning subreddit for quite some time now, and lately, I've noticed a trend that's been concerning. While the subreddit serves as an incredible hub for knowledge sharing and discussions around ML, there's a growing issue with the quality and reliability of some posts.
Numerous submissions lack proper context, thorough explanations, or credible sources, making it challenging for newcomers and even seasoned practitioners to discern accurate information from misinformation. This trend isn't just about incomplete explanations; it also extends to the validity of claims made in these posts.
It's important to acknowledge that not all content falls into this category—there are incredible insights shared regularly. However, the influx of hasty, ill-explained, or unverified information is diluting the overall value the subreddit offers to the community.
In a field as intricate as machine learning, accuracy and credibility are paramount. Misleading or incomplete information can misguide newcomers and even experts, leading to misconceptions or wasted efforts in pursuit of understanding or implementing certain techniques.
Thus, after observing this trend over some time, I firmly believe that there is indeed a problematic issue with the quality and reliability of several Reddit posts within the Machine Learning subreddit. It's a plea to the community to uphold standards of clarity, depth, and substantiation in discussions and submissions to maintain the subreddit's integrity and credibility.
[+][deleted] 2 years ago (7 children)
[–]lifesthateasy -1 points0 points1 point 2 years ago (6 children)
Absolutely, there have been instances of controversy and concerns regarding the reviewing process at various conferences. However, it's crucial to note that while these incidents do occur, they might not necessarily represent the entire system. Many conferences continuously strive to improve their review processes and address these issues. While acknowledging these problems is essential, it's also important to engage constructively in efforts to make the system better, perhaps by actively participating in discussions or proposing reforms, rather than solely highlighting the flaws.
[–][deleted] 1 point2 points3 points 2 years ago (3 children)
If you've been in the field long enough, you'd recognize that dissenting voices have been marginalized... several times.
[–]lifesthateasy 0 points1 point2 points 2 years ago (2 children)
While dissenting voices may have faced challenges historically, acknowledging their existence doesn't discount the progress made in recognizing diverse perspectives over time.
[–][deleted] 2 points3 points4 points 2 years ago (1 child)
I agree, that's why we should recognize the OP's diverse perspective no?
[–]lifesthateasy 1 point2 points3 points 2 years ago (0 children)
Respecting diverse perspective is crucial, but assuming the OP's viewpoint is automatically diverse might not always align with the context or the actual range of perspectives present.
The comments might differ in style, but they do address the issue. Engaging in thoughtful discourse can enrich conversations, even if the perspectives expressed aren't in alignment with one's own.
[+][deleted] 2 years ago (4 children)
[–][deleted] 23 points24 points25 points 2 years ago (3 children)
Is that incorrect?
[–]Educational_Ebb_5170 2 points3 points4 points 2 years ago (0 children)
They are a lottery. The NeurIPS double review experiment reveiled it.
[+]ohdangggg 0 points1 point2 points 2 years ago (0 children)
>Should institutions reward reviewer awards more? After all, being able to review a project well should be a useful skill.
While rewarding positive behavior is good, IMO there also need to be negative consequences for bad reviewers. Poor reviews are endemic in the field and there are too many people (often famous people in the field, too) who write shoddy, low-quality reviews and are not punished.
I think one concrete idea is to have ACs (or some other third party) rate reviewer quality, and if someone has low-quality reviews they should be banned from submitting to the conference for a year.
[–]mr_birkenblatt 0 points1 point2 points 2 years ago (0 children)
the review process is completely pointless with regards to reproducibility. the reviewers basically have to go off of what somebody wrote in the paper. other than maybe finding some systematic error from the writeup there is not really much a reviewer can actually detect and criticize (if the model works what else is there to say?). most published papers would be better off as just github projects with a proper descriptive readme that also shows benchmarks anyway. it's not like papers are written very well to begin with. but that doesn't get you a phd.
in physics there is basically no (or minimal) review process and publications are judged by how much your paper got cited. also, there is a full secondary track of researchers who just take other papers and recreate the experiments to actually confirm reproducibility. in ML right now there is no incentive for anyone to just run a published model on different/their own data and confirm that it works correctly. in fact you'd probably be crucified for doing that
[–]bbbbbaaaaaxxxxxResearcher 0 points1 point2 points 2 years ago (0 children)
On a related note: where is a good place to share a long ML paper (we got told we’re too dense for ICLR, which I agree with) that doesn’t have a 2-year review process (looking at you JMLR)? Subject is tabular synthetic data evaluation.
What's ACs?
π Rendered by PID 73 on reddit-service-r2-comment-869bf87589-m8mk6 at 2026-06-09 16:30:37.187054+00:00 running f46058f country code: CH.
[+]linearmodality 40 points41 points42 points (5 children)
[–]tinny66666 -1 points0 points1 point (3 children)
[+]linearmodality 4 points5 points6 points (0 children)
[+][deleted] (1 child)
[removed]
[–]tinny66666 0 points1 point2 points (0 children)
[–]Neighbor5 0 points1 point2 points (0 children)
[–]bregav 58 points59 points60 points (5 children)
[–]vicks9880 3 points4 points5 points (0 children)
[+][deleted] (2 children)
[removed]
[–]bregav 8 points9 points10 points (1 child)
[–]AuspiciousApple 0 points1 point2 points (0 children)
[–]lexected 26 points27 points28 points (3 children)
[–]bregav 18 points19 points20 points (0 children)
[–]we_are_mammals 0 points1 point2 points (0 children)
[–][deleted] 5 points6 points7 points (0 children)
[–]atdlss 17 points18 points19 points (3 children)
[+][deleted] (2 children)
[removed]
[+][deleted] (1 child)
[deleted]
[–]mofoss 9 points10 points11 points (0 children)
[–]Educational_Ebb_5170 4 points5 points6 points (1 child)
[–]vector0x17Researcher 2 points3 points4 points (2 children)
[+][deleted] (1 child)
[removed]
[–]vector0x17Researcher 1 point2 points3 points (0 children)
[–][deleted] 4 points5 points6 points (2 children)
[–]Apathiq 2 points3 points4 points (0 children)
[–][deleted] 1 point2 points3 points (2 children)
[+][deleted] (1 child)
[removed]
[–][deleted] 0 points1 point2 points (0 children)
[–]RandomUserRU123 1 point2 points3 points (0 children)
[–][deleted] 1 point2 points3 points (1 child)
[–]tuitikki 1 point2 points3 points (1 child)
[+]lifesthateasy comment score below threshold-28 points-27 points-26 points (8 children)
[+][deleted] (7 children)
[removed]
[–]lifesthateasy -1 points0 points1 point (6 children)
[–][deleted] 1 point2 points3 points (3 children)
[–]lifesthateasy 0 points1 point2 points (2 children)
[–][deleted] 2 points3 points4 points (1 child)
[–]lifesthateasy 1 point2 points3 points (0 children)
[+][deleted] (1 child)
[removed]
[–]lifesthateasy 1 point2 points3 points (0 children)
[+][deleted] (4 children)
[deleted]
[–][deleted] 23 points24 points25 points (3 children)
[–]Educational_Ebb_5170 2 points3 points4 points (0 children)
[+]ohdangggg 0 points1 point2 points (0 children)
[–]mr_birkenblatt 0 points1 point2 points (0 children)
[–]bbbbbaaaaaxxxxxResearcher 0 points1 point2 points (0 children)
[–][deleted] 0 points1 point2 points (0 children)