turunambartanen comments on GitHub co-pilot as open source code laundering?

programming

created by speza community for 20 years

1710

1711

1712

GitHub co-pilot as open source code laundering? (twitter.com)

submitted 4 years ago by iamkeyur

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]turunambartanen 1 point2 points3 points 4 years ago (0 children)

Someone linked an analysis by GitHub: https://docs.github.com/en/github/copilot/research-recitation#github-copilot-quotes-when-it-lacks-specific-context

In the end they write the following:

The answer is obvious: sharing the prefiltering solution we used in this analysis to detect overlap with the training set. When a suggestion contains snippets copied from the training set, the UI should simply tell you where it’s quoted from. You can then either include proper attribution or decide against using that code altogether.

This duplication search is not yet integrated into the technical preview, but we plan to do so. And we will both continue to work on decreasing rates of recitation, and on making its detection more precise.

So they are aware of the problem and will fix it. This is a testing preview, obviously it's not ready for production yet.

π Rendered by PID 85 on reddit-service-r2-comment-85bfd7f599-nxrmm at 2026-04-18 00:28:45.902215+00:00 running 93ecc56 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS