OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series

L4sBot@lemmy.world · 2 years ago

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series

stappern@lemmy.one · 2 years ago

So did I so what? Is my brain property of Warner now?

OkToBeTakei@lemm.ee · edit-2 2 years ago

deleted by creator

Asuka@sh.itjust.works · 2 years ago

If I read Harry Potter and wrote a novel of my own, no doubt ideas from it could consciously or subconsciously influence it and be incorporated into it. Hey is that any different from what an LLM does?

stappern@lemmy.one · 2 years ago

Not what happened in this case tho.

TropicalDingdong@lemmy.world · edit-2 6 months ago

Removed by mod

OkToBeTakei@lemm.ee · edit-2 2 years ago

deleted by creator

wmassingham@lemmy.world · edit-2 2 years ago

They can own it, actually. If you use the characters of Bugs Bunny, etc., or the setting (do they have a canonical setting?) then Warner does own the rights to the material you’re using.

For example, see how the original Winnie the Pooh material just entered public domain, but the subsequent Disney versions have not. You can use the original stuff (see the recent horror movie for an example of legal use) but not the later material like Tigger or Pooh in a red shirt.

Now if your work is satire or parody, then you can argue that it’s fair use. But generally, most companies don’t care about fan fiction because it doesn’t compete with their sales. If you publish your Harry Potter fan fiction on Livejournal, it wouldn’t be worth the money to pay the lawyers to take it down. But if you publish your Larry Cotter and the Wizard’s Rock story on Amazon, they’ll take it down because now it’s a competing product.

joxese3341@sh.itjust.works · edit-2 1 year ago

deleted by creator

Sethayy@sh.itjust.works · 2 years ago

I think its more like writing a loony toons fanfic based only on pirated material

stappern@lemmy.one · 2 years ago

How are you gonna prove that I watched it on tv or torrented?

Sethayy@sh.itjust.works · 1 year ago

Can’t but theyre pretty open on how they trained the model, so like almost admitted guilt (though they werent hosting the pirated content, its still out there and would be trained on). Cause unless they trained it on a paid Netflix account, there’s no way to get it legally.

Idk where this lands legally, but I’d assume not in their favour

newIdentity@sh.itjust.works · 2 years ago

Your brain isn’t an AI model

OR IS IT?

TwilightVulpine@lemmy.world · 2 years ago

You joke but AI advocates seem to forget that people have fundamentally different rights than tools and objects. A photocopier doesn’t get the right to “memorize” and “learn” from a text that a human being does. As much as people may argue that AIs work different, AIs are still not people.

And if they ever become people, the situation will be much more complicated than whether they can imitate some writer. But we aren’t there yet, even their advocates just uses them as tools.

kmkz_ninja@lemmy.world · 2 years ago

How do you see that as a difference? Tools are extensions of ourselves.

Restricting the use of LLMs is only restricting people.

TwilightVulpine@lemmy.world · 2 years ago

When we get to the realm of automation and AI, calling tools just an “extension of ourselves” doesn’t make sense.

Especially not when the people being “extended” by Machine Learning models did not want to be “extended” to begin with.

CoderKat@lemm.ee · edit-2 2 years ago

It’s honestly a good question. It’s perfectly legal for you to memorize a copyrighted work. In some contexts, you can recite it, too (particularly the perilous fair use). And even if you don’t recite a copyrighted work directly, you are most certainly allowed to learn to write from reading copyrighted books, then try to come up with your own writing based off what you’ve read. You’ll probably try your best to avoid copying anyone, but you might still make mistakes, simply by forgetting that some idea isn’t your own.

But can AI? If we want to view AI as basically an artificial brain, then shouldn’t it be able to do what humans can do? Though at the same time, it’s not actually a brain nor is it a human. Humans are pretty limited in what they can remember, whereas an AI could be virtually boundless.

If we’re looking at intent, the AI companies certainly aren’t trying to recreate copyrighted works. They’ve actively tried to stop it as we can see. And LLMs don’t directly store the copyrighted works, either. They’re basically just storing super hard to understand sets of weights, which are a challenge even for experienced researchers to explain. They’re not denying that they read copyrighted works (like all of us do), but arguably they aren’t trying to write copyrighted works.

SubArcticTundra@lemmy.ml · 2 years ago

No, because you paid for a single viewing of that content with your cinema ticket. And frankly, I think that the price of a cinema ticket (= a single viewing, which it was) should be what OpenAI should be made to pay.

stappern@lemmy.one · 2 years ago

I didn’t. I torrented it.