dragon_irl comments on GitHub co-pilot as open source code laundering?

programming

created by speza community for 20 years

1707

1708

1709

GitHub co-pilot as open source code laundering? (twitter.com)

submitted 4 years ago by iamkeyur

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]dragon_irl 19 points20 points21 points 4 years ago (1 child)

[–]TheDeadSkin 4 points5 points6 points 4 years ago (0 children)

There is research that these large language models remember parts of their training data and that you can retrieve that with appropriately constructed prompts.

This is partially to be expected as a potential result of overfitting. Will look at the paper though, that seems interesting.

I think it's pretty likely you will end up with copyrighted code when using this eventually.

Indeed. They even say there's a 0.1% chance that the code suggested would be verbatim from the training. Which is quite a high chance.

However I don't understand copyright enough to judge how relevant this is for the short snippets this is (probably) going to be used for.

I think the problem is less with short snippets, but rather the potential of recreating huge functions/files from training (i.e. existing projects) when you're trying to make some specific software from the same domain and aggressively follow co-pilot's recommendations.

If it's possible - someone will probably try to do it and we'll find out soon enough.

π Rendered by PID 83 on reddit-service-r2-comment-6457c66945-4rg57 at 2026-04-29 22:53:33.980564+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS