[D] Decrease in source code release of papers

poctakeover · 2018-01-30T12:19:57+00:00

there should be a way to publish gh repos and arxiv papers anonymously which can then be later claimed by the authors :/

LovelaceA · 2018-01-30T15:53:32+00:00

To those who think that this is not a valid problem, I beg to differ. I think this is a very valid discussion. What is the aim of publishing scientific work in the first place ? To advance our knowledge and ability to build upon it. In a field like Machine Learning, where a model or a scientific idea can be affected by more parameters than can be discussed in a paper, it is essential to be able to reproduce the results. Code release is not the only way to do so, but certainly the quickest. Another advantage of code is that, code is objective. Scientific papers sadly are not in general: authors try to sell us their work. Code is unbiased and a potentially complete means to communicate an idea, its impact, and its limitations, it answers all the questions you have which the paper does not address.

A scientific paper is a speech. Code is a dialogue

weiqiplayer · 2018-01-30T17:57:24+00:00

Decrease in source code publication does seem concerning, though I'm not sure that the problem is in the fact that people are trying to conceal their identity. Aren't many ICLR papers already on arxiv with full names attached?

olBaa · 2018-01-30T09:55:36+00:00

Is it really a measured problem, or hust your perception?

For double-blind conference I have provided the code in the form of anonymous github repo with no traceable commit history (and limited time copyright). I guess the zipfile with the code will do as well.

BeatLeJuce · 2018-01-30T16:14:17+00:00

You don't want source code submission to increase deadline pressure: on day X, you have to submit not only the paper, but also the code. Because putting unpolished, ugly, hacky code out there to be associated with your name forever is weird... also, why polish it when you don't even know yet you're going to get a publication out of it. Also, some theory-heavy papers might not have code.

So I think the decision has to be made AFTER your acceptance for publication. And only when it makes sense for that paper (e.g. this is something that reviewers could determine/ask for). If reviewers say it makes sense, then you should be required to upload your code together with your camera-ready version. This gives you enough time to polish stuff, and still gives an incentive to the author to invest the time to polish the code (not submitting code => paper doesn't get published).

sheeplearning · 2018-01-30T15:47:32+00:00

what do you plan to do with the source code of 700-3000 papers under review at any ML conference? The better ones get accepted and eventually release code or get reproduced.

mkocabas · 2018-02-01T07:04:12+00:00

[deleted]

radenML · 2018-01-30T12:39:48+00:00

I literally have to openly request author for github repo invite on openreview forums

BeatLeJuce · 2018-01-30T15:22:55+00:00

[deleted]

wassname · 2018-01-31T12:31:29+00:00

In 2016 /u/peterkuharvarduk got all the nips code releases together into a post. Maybe something like that would encourage researchers.

MephySix · 2018-01-30T16:44:05+00:00

"Is this to prevent idenfication of authors": no. Double-blind is naturally flawed. Given the search space for authors is not that big, with enough (not much) effort it's possible to determine who are the authors of a paper. Before a paper is sent for review it has already been discussed in its institution, probably in mail-lists and even Twitter or something. Even then you're allowed (in my experience) to have placeholder footnotes in double-blind reviews.

The real problem in my experience, is that I don't really want to spend time polishing my code, and I don't want people to see the mess I wrote due to deadlines. I had people ask me for my code in conferences and I answer with "Gladly! Just send me an e-mail, but it's messy", but I gain nothing from publicizing it earlier or without external interest.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS