One prominent author responds to the revelation that his writing is being used to coach artificial intelligence.

By Stephen King

Non-paywalled link: https://archive.li/8QMmu

    • Storksforlegs@beehaw.org
      link
      fedilink
      English
      arrow-up
      49
      ·
      edit-2
      1 year ago

      Youre right, but this is Steven King, when is he not putting out a new book?

      Also hes not struggling for attention, he’s probably Americas most famous author. He speaks out about stuff all the time I dont think its fair to write off his opinion on this being solely a publicity stunt.

      • RyanHeffronPhoto@kbin.social
        link
        fedilink
        arrow-up
        42
        ·
        1 year ago

        It’s baffling to me seeing comments like this as if the ‘AI’ is some natural intelligence just hanging out going around reading books it’s interested in for the hell of it… No. These are software companies illegally using artists works (which we require licensing for commercial use) to develop a commercial, profit generating product. Whatever the potential outputs of the AI are is irrelevant when the sources used to train it were obtained illegally.

        • FaceDeer@kbin.social
          link
          fedilink
          arrow-up
          12
          ·
          1 year ago

          These are software companies illegally using artists works

          There is nothing illegal about what they’re doing. You may want it to be illegal, but it’s not illegal until laws are actually passed to make it illegal. Things are not illegal by default.

          Copyright only prevents copying works. Not analyzing them. The results of the analysis are not the same as the original work.

          • RyanHeffronPhoto@kbin.social
            link
            fedilink
            arrow-up
            15
            ·
            edit-2
            1 year ago

            It is illegal. As an artist, if another individual or company wants to use my work for their own commercial purposes in any way, even if just to ‘analyze’ (since the analysis is part of their private commercial product), they still need to pay for a license to do so. Otherwise it’s an unauthorized use and theft. Copyright doesn’t even play into it at that point, and would be a separate issue.

            • FaceDeer@kbin.social
              link
              fedilink
              arrow-up
              20
              ·
              1 year ago

              As an artist, if another individual or company wants to use my work for their own commercial purposes in any way, even if just to ‘analyze’, they still need to pay for a license to do so.

              I think you need to review the relevant laws, that’s not true.

              For example, your comment that I’m responding to is copyrighted and you own the copyright. I just quoted part of it in my response without your permission, and that’s an entirely legal fair use. I also pasted your comment into Notepad++ and did a word count, there are 64 words in it. That didn’t break any laws either.

              A lot of people have very expansive and incorrect ideas about how intellectual property works.

              • Kaldo@kbin.social
                link
                fedilink
                arrow-up
                5
                ·
                1 year ago

                First of all, a random online comment is not protected by copyright law afaik.

                Secondly, if you did take something protected by copyright and then used it for commercial purposes (to make money off it), like these LLMs do, then you would be breaking the law.

                In short, I’d say you are using a flawed analogy from the start.

                Also copyright is not about just copying but also distributing as well. Playing.(radio) songs in your coffee shop for clients is treated differently than you listening to it at home. You generally can’t just profit off someone else’s work without them allowing it.

                • FaceDeer@kbin.social
                  link
                  fedilink
                  arrow-up
                  8
                  ·
                  1 year ago

                  First of all, a random online comment is not protected by copyright law afaik.

                  You got a fundamental aspect of copyright law wrong right in the first line.

                  Your comments are indeed protected by copyright.

                  Secondly, if you did take something protected by copyright and then used it for commercial purposes

                  That’s wrong too. Whether or not someone’s making money off of a copyright violation will affect the damages you can sue them for, but it’s copyright violation either way.

                  Also copyright is not about just copying but also distributing as well. Playing.(radio) songs in your coffee shop for clients is treated differently than you listening to it at home.

                  Technically true, but what does it have to do with these circumstances?

                  You generally can’t just profit off someone else’s work without them allowing it.

                  Generally speaking, sure you can. Why couldn’t you? People do work that other people profit off of all the time. If a carpenter builds a desk and then I go sit at it while doing my job and earning millions of dollars, I don’t need to ask the carpenter’s permission.

                  Copyright has a few extra limitations, but those limitations are on copying stuff without permission.

              • RyanHeffronPhoto@kbin.social
                link
                fedilink
                arrow-up
                4
                ·
                1 year ago

                that’s an entirely legal fair use

                Yet what these companies are doing does not constitute ‘fair use’, period, no matter how much you want to argue otherwise.

            • FaceDeer@kbin.social
              link
              fedilink
              arrow-up
              6
              ·
              1 year ago

              No, it’s not. Something that is merely in the style of something else is not a derivative work. If that were the case there’d be lawsuits everywhere.

              • anachronist@midwest.social
                link
                fedilink
                English
                arrow-up
                5
                ·
                edit-2
                1 year ago

                LLMs regurgitate their training set. This has been proven many times. In fact from what I’ve seen LLMs are either regurgitating or hallucinating.

                • sunbeam60@lemmy.one
                  link
                  fedilink
                  arrow-up
                  5
                  ·
                  1 year ago

                  With great respect I believe that to be a gross simplification of what an LLMs does. There is no training set stored in the LLM, only statistics about what word set is likely to follow what word set. There is not regurgitation of the date - if that was the case, they temperature parameter wouldn’t matter when it very much does.

        • admiralteal@kbin.social
          link
          fedilink
          arrow-up
          12
          ·
          1 year ago

          Yeah, and even if it WERE truly intelligent – which these SALAMIs are almost certainly not – it doesn’t even matter.

          A human and a robot are not the same. They have different needs and must be afforded different moral protections. Someone can buy a book, read it, learn from it, and incorporate things it learned from that experience into their own future work. They may transform it creatively or it may plagiarize or it may rest in some grey area in-between where it isn’t 100% clear if it was novel or plagiarized. All this is also true for a LLM “AI”. – But whether or not this process is fundamentally the same or not isn’t even a relevant question.

          Copyright law isn’t something that exists because it is a pure moral good to protect the creative output of a person from theft. It would be far more ethical to say that all the outputs of human intellect should be shared freely and widely for all people to use, unencumbered by such things. But if creativity is rewarded with only starvation, creativity will go away, so copyright exists as a compromise to try and ensure there is food in the bellies of artists. And with it, we have an understanding that there is a LOT of unclear border space where one artist may feed on the output of another to hopefully grow the pot for everyone.

          The only way to fit generative bots into the philosophical framework of copyright is to demand that the generative bots keep food in the bellies of the artists. Currently, they threaten it. It’s just that simple. People act like it’s somehow an important question whether they “learn” the same way people do, but the question doesn’t matter at all. Robots don’t get the same leeway and protection afforded to humans because robots do not need to eat.

          • Storksforlegs@beehaw.org
            link
            fedilink
            English
            arrow-up
            7
            ·
            edit-2
            1 year ago

            Robots don’t get the same leeway and protection afforded to humans because robots do not need to eat.

            Well said.

      • Phanatik@kbin.social
        link
        fedilink
        arrow-up
        12
        ·
        1 year ago

        LLMs have been caught plagiarising works, by the simple nature of how they function. They predict the next word based on an assumed context of the previous words, they’re very good at constructing sentences but often the issue is “where is it getting its information from?” Authors never consented to their works being fed into an optimisation algorithm and neither did artists when DALL E was created.

        For authors, you buy the book and thus the author is paid but that’s not what happened with ChatGPT.

          • Phanatik@kbin.social
            link
            fedilink
            arrow-up
            5
            ·
            1 year ago

            Copyright Law doesn’t talk about who can consume the work. ChatGPT’s theft is no different to piracy and companies have gotten very pissy about their shit being pirated but when ChatGPT does it (because the piracy is hidden behind its training), it’s fine. The individual authors and artists get shafted in the end because their work has been weaponised against them.

            • FaceDeer@kbin.social
              link
              fedilink
              arrow-up
              3
              ·
              1 year ago

              Copyright Law doesn’t talk about who can consume the work.

              What law does talk about it, then?

                • FaceDeer@kbin.social
                  link
                  fedilink
                  arrow-up
                  5
                  ·
                  1 year ago

                  You seem to be suggesting that training these LLMs is illegal, with things like “ChatGPT’s theft” and " the piracy is hidden behind its training".

                  In order for something to be illegal there has to be a law making it illegal. What law is that?

        • Duxon@feddit.de
          link
          fedilink
          arrow-up
          3
          ·
          edit-2
          1 year ago

          LLMs have been caught plagiarising works

          Any source for this? I have never seen that.

          I’m highly skeptical about GPT4 having been directly trained on copyrighted material by Stephen King. Simply by all the sheer information about his works, including summaries, themes, characters, and critical analyses that are publicly available, a good LLM can appear to be able to plagiarize these works, while it doesn’t. If I’m right, there is no leverage for creators to complain. Just accept that that’s the world we’re living in now. I don’t see why this world will stop the sales of books or movie rights on books, etc.

          • Em Adespoton@lemmy.ca
            link
            fedilink
            English
            arrow-up
            3
            ·
            edit-2
            1 year ago

            Especially since copyright only protects human authored works. Meaning anything created by an LLM is in the public domain, and the publisher using it loses control of the work.

            Of course, this has the potential to be a significant issue, as I can take a copyrighted work, train an LLM using it, and then get it to generate a similar but unique work that is in the public domain. This new work will likely impact the original author’s ability to profit off their original work, thus decreasing supply of human created works in the long run.

            But it’s currently all legal and above board.

            • Duxon@feddit.de
              link
              fedilink
              arrow-up
              1
              ·
              1 year ago

              Sure, it can plagiarize works it has been trained on. They didn’t show in the study, however, that this has occurred for copyright protected material like fiction books.

              • xapr@lemmy.sdf.org
                link
                fedilink
                English
                arrow-up
                2
                ·
                1 year ago

                I saw a comment, probably on Mastodon, from an author saying that (I believe) ChatGPT had plagiarized some of his work verbatim. I don’t recall if it was a work of fiction or not, although for the purpose of copyright it doesn’t matter.

                I wouldn’t be surprised if it’s trained on works of fiction just as much as non-fiction though. I think that from what I’ve heard, you can ask ChatGPT to write something in the style of particular writers? If it’s possible to give a very specific prompt for it to write something with the same plot points as a Stephen King story in the style of Stephen King, I wonder just how close it would look like the original?

    • TheDankHold@kbin.social
      link
      fedilink
      arrow-up
      7
      ·
      1 year ago

      You have, LLMs don’t read because they aren’t intelligent or alive. They aren’t comparable to humans.

      • sunbeam60@lemmy.one
        link
        fedilink
        arrow-up
        4
        ·
        1 year ago

        Are you sure?

        And I don’t mean the sarcastically or snidely.

        I’ve met a fair share of people who seems to be nothing more than an LLM.

    • gelberhut@lemdro.id
      link
      fedilink
      English
      arrow-up
      7
      ·
      edit-2
      1 year ago

      You can offer a service to explain people what these books are about, your opinion about them, answer people’s questions about these books.

      You can automate this as well.

      But, an “AI” for whatever reason should not…