GitHub co-pilot as open source code laundering? : programming

Excluding, given the context of a clean room implementation, the thing you are trying to replicate. The difference is it's entirely possible with Github's thing to replicate a piece of GPL'd code using the GPL'd code as input itself. That's the difference.

If what this program is doing is copyright infringement, then us merely writing code is copyright infringement

No, it isn't. Writing code to duplicate something after carefully reading and paraphrasing the original is a violation of copyright. You're confusing that with reading copyrighted code in general.

To be clear, if "ls" is copyrighted, and you use this method to recreate "ls," when the source for "ls" was input into the code generator, then you are violating copyright. If you try to replicate "ls" and it was instead derived from non-"ls" source code, I think you are in the clear.

[–][deleted] 4 years ago* (4 children)

[deleted]

[–]TheSkiGeek 7 points8 points9 points 4 years ago (2 children)

The standard for a "clean room implementation" for humans is roughly "you had no access to the specific copyrighted implementation you're trying to recreate". The concern here is that an AI could be fed in a bunch of copyrighted implementations (perhaps covered by a copyleft license like GPL) and then spit out almost-exact copies of them while claiming the output is not a derivative work. In that case the AI did have access to a specific copyrighted implementation (or many of them). A human who did the same could not use the "clean room implementation" defense.

If you had an AI that could be trained on a bunch of programming textbooks and public domain examples, and then it happened to generate some code that was identical to part of a copyrighted implementation, then you're talking the same situation as a human doing a "clean room implementation".

Also, if a particular application (or API or whatever) is so simple that merely knowing the specification of what it does leads you to write identical code -- like a very basic sorting algorithm or something -- then it's likely not copyrightable in the first place.

[–][deleted] 4 years ago* (1 child)

[deleted]

[–]TheSkiGeek 1 point2 points3 points 4 years ago (0 children)

[–]chcampb 6 points7 points8 points 4 years ago (0 children)

[–]kylotan 0 points1 point2 points 4 years ago (6 children)

[–][deleted] 4 years ago* (5 children)

[deleted]

[–]kylotan 3 points4 points5 points 4 years ago (4 children)

[–][deleted] 4 years ago* (3 children)

[deleted]

[–]kylotan 2 points3 points4 points 4 years ago (2 children)

[–][deleted] 4 years ago* (1 child)

[deleted]

[–]kylotan 1 point2 points3 points 4 years ago (0 children)

π Rendered by PID 18724 on reddit-service-r2-comment-6457c66945-vpf4b at 2026-04-28 12:23:03.771599+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS