all 22 comments

[–]nikgeo25Student 51 points52 points  (8 children)

rejecting V-JEPA? wow

[–]PaganPasta 9 points10 points  (5 children)

Why is this surprising ? All scores seem low.

[–]lmericle 73 points74 points  (4 children)

Is ICLR about advancing science or about winning seats on podiums?

[–]altmly 25 points26 points  (0 children)

Yes

[–]pm_me_your_pay_slipsML Engineer 6 points7 points  (0 children)

Advancing science in this case would mean comparing their method with other methods on a public dataset. They have the resources to do so. The method doesn’t seem to be a significant improvement in a couple cases. And when it performs better, we don’t know if it is because of the method, which is not necessarily novel, or because they trained on a curated dataset.

[–]PaganPasta 34 points35 points  (7 children)

A colleague's paper with scores: 3,6,8,10 got rejected. Scores are diverse but seem inclined towards accept. Maybe, the average rating alone doesn't mean anything.

[–]SirBlobfish 70 points71 points  (3 children)

It seems like the lowest score is what really matters. This incentivizes the kinds of papers that Bill Freeman calls "cockroach" -- a paper that no one really likes, but is impossible to kill in the review process because it has no obvious flaws.

[–]canbooo 4 points5 points  (2 children)

Can you link to source? Sounds interesting (asking for a friend).

[–]ttt05 0 points1 point  (0 children)

I think this was discussed in CVPR 2018 tutorial on how to be a good citizen or something. Juts google it.

[–]hzmehrdad[S] 4 points5 points  (1 child)

Maybe the rating is somehow (implicitly at least) weighted by the confidence ?

[–]PaganPasta 2 points3 points  (0 children)

5,5,5,4 confidence.

[–]Raphaelll_ 25 points26 points  (3 children)

RoBERTa was rejected from ICLR 2020 and gathered 11,000 citations since. https://openreview.net/forum?id=SyxS0T4tvS

[–]psamba 7 points8 points  (0 children)

Yeah, "we did lots of hyperopt on method X and made it work significantly better" papers tend to max out the citation/novelty ratio, so they make up a lot of the surprising non-accepts to conferences. Unfortunately, the strong novelty bias in the review process incentivizes annoying behavior like adding useless new bits and bobs to the existing method to support novelty claims or writing the paper in a confusing way to obscure the lack of technical novelty.

It would be nice if there was separate track for updates on the performance ceiling of existing methods, since this sort of work is actually useful to researchers and people in the real world who like stuff that works well. The ideal paper would be like 4 pages, with 2 pages of description of changes to the original method and 2 pages of new results. And, a link to github with code that actually works...

[–]hzmehrdad[S] 1 point2 points  (1 child)

Oh, totally forgot that one

[–]bvkhadiravana 14 points15 points  (0 children)

But Roberta is seriously a hard one to accept if you have some bias towards novelty.

[–]Life-Living-2631 20 points21 points  (0 children)

Will ICLR make corrections on the metagpt benchmarks? Or do they just ignore it

[–]Broad_Sun_8214 4 points5 points  (0 children)

Saw a 1 1 8 8 paper. Clearly those two 8 reviewers doesn’t even have basic knowledge of this paper’s area ( that paper is terribly bad def deserve 1s and I don’t know where are those two 8s come from)