Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I always find it amazing that people are wiling to use AI beacuse of stuff like this, its been illegally trained on code that it does not have the license to use, and constantly willy nilly regurgitates entire snippets completely violating the terms of use

Edit:

https://github.com/vorg/pragmatic-pbr/blob/master/local_modu...

https://github.com/vorg/pragmatic-pbr/blob/master/local_modu...

This looks like where the source code was stolen from: this repository is unlicensed, and this is copyright infringement as a result





As discussed in this thread before you posted this comment, this code wasn't generated from an LLM at all, but simply included in a dependency: https://news.ycombinator.com/item?id=46092904

Unlike your results which aren't exact match, or likely even a close enough match to be copyright infringment if the LLM was inspired by them (consider that copyright doesn't protect functional elements), an exact match of the code is here (and I assume from the comment I linked above this is a dependency of three.js, though I didn't track that down myself): https://github.com/GPUOpen-LibrariesAndSDKs/Cauldron/blob/b9...

Edit: Actually on further thought the date on the copyright header vs the git dates suggests the file in that repo was copied from somewhere else... anyways I think we can be reasonably confident that a version of this file is in the dependency. Again I didn't look at the three.js code myself to track down how its included.

If there's any copyright infringment here it would be because bog standard web tools fail to comply with the licenses of their dependencies and include a copy of the license, not because of LLMs. I think that is actually the case for many of them? I didn't investigate the to check if licenses are included in the network traffic.


There are several cases where copyright law is not only about exact copy but also derivatives. So finding an exact match is not necessary.. Not sure it matters in this case eitherway.

conradev found likely the right version of the file here: https://github.com/mrdoob/three.js/blob/55b4bbb7ef7e29b214b9...

I have been trained on code I don't have the license to use myself. I'm not like these Creators, who suck wisdom from the cosmos directly, apparently.

Sure. It's a problem that corporations run by more or less insane people are the ones monetizing and controlling access to these tools. But the solution to that can't be even more extended private monopolistic property claims to thought-stuff. Such claims are usually the way those crazy people got where they are.

You think in a world where Elsevier didn't just own the papers, but rights to a share in everything learned from them, would be better for you?


It's fascinating that people care very much about this when it's visual arts, but when it comes to code almost no one does.

E.g. the latest Anno game (117) received a lot of hate for using AI generated loading screen backgrounds, while I have never heard of a single person caring about code, which probably was heavily AI generated.


I believe it is MIT-licensed code from three.js: https://github.com/mrdoob/three.js/blob/55b4bbb7ef7e29b214b9...

You presume that people care about things like this. A lot of people don't.

Companies should. Its a business risk, you open yourself up to legal action

"Claude - rewrite this apparently copyrighted code that can be found online here <http://...> in a way that makes it a unique implementation." <- probably will work.

Is it copyright infringement? It's a fundamental algorithm.

The courts have ruled that generated output is not infringing.

If I say, “output the contents of X verbatim” and then use the output, am I free from liability?

If the generated code in TFA contained the actual Counter-Strike source code, then you (well, Valve) would have a defensible claim. But the prompt was to make something like Counter-Strike, and it came up with something different. That's fair game.

I can assure you that Valve is not remotely concerned about this AI generated "first person shooter" taking market share away from them.

Definitely citation needed. Such court cases usually come with a lot of important context. How can you just make such a statement and get away with not providing any context link?



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: