ChatGPT use declines as users complain about ‘dumber’ answers, and the reason might be AI’s biggest threat for the future::AI for the smart guy?

  • ragnar_ok@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    31
    ·
    edit-2
    1 year ago

    The free version of ChatGPT DEFINITELY is dumber than it was even a couple of months ago. Used to be able to get decent, useful code reviews out of it, now it barely knows how to write a nested loop anymore.

    It’s storytelling capabilities fell off a cliff too, the drive towards safely sanitized unoffensive-at-all-times content it can output has rendered every story, choose-your-own-adventure or collaborative role playing game sterile, empty expressions of black and white stories with no nuance allowed where saintly goodness is the only choice possible

    In my own experience, chatGPT has been massively nerfed for the use cases I used it for

    • Pixlbabble@lemmy.world
      link
      fedilink
      English
      arrow-up
      13
      ·
      1 year ago

      I tried a naughtybot for some steamy chat for the first time yesterday. I ended up in fight with the bitch.

    • mememuseum@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      ·
      1 year ago

      Do you know of any good alternatives for role playing? I used it a while back to flesh out some NPCs and location for a DnD game I was planning on running but if it’s gotten noticeably worse I’d like to try something else.

    • Kerfuffle@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      1
      ·
      1 year ago

      65B LLaMA models can actually do a pretty good job of writing. Here’s two examples with Guanaco 65B (bold part is my prompt):

      tags: fairy tale, animals, dog, wolf, fox, death, humor, dark storyline, violence, loss

      Animals in a forest setting investigate a series of gruesome murders that recently occured. Written in the style of classics like Animal Farm and Watership Down, the writing is vivid and very descriptive. The dialog is thoughtful and often quite witty. It’s clear the author really understands animal behavior and has done their research well!

      Chapter 1: An Unfortunate Discovery

      A fox by species, a detective by trade, you’ll only have reason to know my name if the worst has come to pass. It’s nearly winter and a bitter wind drives stinging sleet into my face, but it’s still not as chilling as the scene before me now…

      I was summoned here by an urgent message from a panicked hedgehog. She told me she found her friend lying dead on his doorstep with his throat ripped out - black fur matted with blood. I rushed over immediately and saw for myself what had happened. He looked so peaceful despite being cut down in his prime - a young wolf just starting doing well for himself - taken too soon. His home nearby showed no signs of struggle or forced entry. Whoever did this must have caught him off guard somehow. I didn’t hear any howling either. That would have been easy enough to miss though since he lived alone deep in these woods where few creatures tread. I made sure to secure the crime scene and look around carefully without disturbing anything more than necessary until other investigators arrived. They should be here soon. We can’t afford to lose evidence in cases like this. Times are already hard enough as it is…

      As I waited and pondered who might have committed such a horrible act, something felt wrong about all this. Surely a predator like me could smell another predator’s scent? But there were none that stood out among the melange of odors lingering here. How strange…

      Soon after, several others joined me including some friends of mine. A stoic bear named Growler, grumpy badger called Grumpus, and a kindly old raccoon who goes by “Doc” due to his extensive knowledge of medicinal plants and wildlife biology. Together we began our investigation proper by gathering clues and questioning nearby witnesses while trying to minimize contamination through contact with them (we took care to put up caution tape first). Unfortunately, they hadn’t seen much useful apart from occasional glimpses of movement deeper in the undergrowth. Still, we decided to check those areas next along with searching the surrounding woodlands thoroughly for any additional victims. And find one we did, although sadly she suffered the same fate as poor George. Another young wolf lay beside a stream, body cold and stiffening. Someone snapped her neck cleanly like a twig. This couldn’t be coincidence surely? Not two deaths in quick succession like this.


      <prompt> before me or what I must relay when I return home. My mind races with questions as I survey this brutal crime scene; who could do such a thing? Why was no one there to stop them? And how many more victims lie undiscovered further out in these woods? A lone paw print leads away from where a rabbit family once happily resided, now reduced to bloody scraps scattered among splintered twigs and torn fur. This isn’t just some random predator attack, something far more sinister is at play here…

      I turn back toward town, determined to find answers. As I trudge through the snow, each step feels heavier than the last. How will I break the news to those closest to them? What words can possibly bring comfort after so much senseless suffering? I shake my head, trying to focus on the task at hand. There are others depending on me right now, they need someone strong and steady to guide them through this darkness. I won’t let them down.

      The first person I see upon entering town is old man Badger, his white whiskers blown about by the gusting wind. He looks up at me expectantly, “Well Fox, did you find anything?” His voice quavers slightly, betraying his concern for the missing families.

      “I’m afraid I have bad news,” I say somberly, “there were signs of a struggle near the burrow entrance, but no survivors.”

      Badger sighs heavily, “Dear me, this is worse than we feared. We must alert the council immediately!”

      We hurry over to the meeting hall, a large hollow tree that serves as our central gathering place. Inside, several other animals are already waiting anxiously for word of their loved ones. They look up hopefully as we enter, then quickly realize the grim tidings we bear. A collective groan fills the room, followed by muffled sobs and whispered prayers.

      Mayor Bear calls for order, her deep voice cutting through the din. She motions for us to join her at the front of the room, “Please tell us everything you saw, Fox. Every detail may be important.”

      • Calimhero@lemmy.world
        link
        fedilink
        English
        arrow-up
        16
        arrow-down
        3
        ·
        1 year ago

        Writer here. Very sorry to contradict you, but this is absolute shit. It looks good on the surface, but that’s all.

        • Kerfuffle@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          11
          ·
          1 year ago

          Very sorry to contradict you, but this is absolute shit.

          To be clear, I’m talking in relative terms. Would you argue that ChatGPT did a massively better job and didn’t write “absolute shit”?

          It looks good on the surface, but that’s all.

          From some of the stuff I’ve seen published, that might just be enough for certain people. I could even be that “certain people” from time to time, sometimes just the right theme, setting and some time to fill is sufficient.

            • Gutless2615@ttrpg.network
              link
              fedilink
              English
              arrow-up
              7
              arrow-down
              2
              ·
              1 year ago

              Why should we trust you? They’re plenty of shit writing out there that’s Good Enough to get paid.

            • Kerfuffle@sh.itjust.works
              link
              fedilink
              English
              arrow-up
              6
              arrow-down
              1
              ·
              1 year ago

              Trust me, it’s not.

              That’s a silly thing to say. Like I said, I could read something like that from time to time so you’re asking me to trust you over my own experience. People also post/publish fiction of varying quality all over the internet, and it gets read. I’ve seen worse writing than that with hundreds or thousands of reads.

              Maybe you’re only talking about a publisher buying the work and publishing it but you never said anything like that. Even so, I’ve seen some books that were pretty bad so I’m not sure I’d trust you even there.

              • SSTF@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                1 year ago

                I’ve read two books written by A. American.

                Anybody can get published.

        • Sparking@lemm.ee
          link
          fedilink
          English
          arrow-up
          6
          ·
          edit-2
          1 year ago

          Yeah, while it’s cool that a computer can make a story, I have yet to see one that you would think was written by a human and would want to read.

          • bitcrafter@lemmy.sdf.org
            link
            fedilink
            English
            arrow-up
            5
            ·
            1 year ago

            I don’t know, this story is very reminiscent of the kind of thing my elementary school age cousin writes, but with a greater mastery of vocabulary and grammar. It’s not in any way great, bit it’s charming in it’s own way when held against that (low) standard.

    • HandwovenConsensus@lemm.ee
      link
      fedilink
      English
      arrow-up
      4
      ·
      1 year ago

      Really? I actually found it’s gotten less restrictive recently. Maybe it’s just because now I’ve learned to control the context so it doesn’t perceive a request as offensive.

    • RocksForBrains@lemm.ee
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      I find the quality is controllable to a degree by instructing it which sources to use.

      Obviously proofread the damn thing and fix any glaring errors.

    • ribboo@lemm.ee
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      2
      ·
      edit-2
      1 year ago

      It has not gotten worse for coding. GPT4 is incredibly much better, if anything. And it’s total bullshit that it can’t write a nested loop.

      I use it daily for work, so I’d definitely know.

        • ribboo@lemm.ee
          link
          fedilink
          English
          arrow-up
          3
          ·
          1 year ago

          I should honestly have understood that! Never mind then, glad we could clear that up

      • fraydabson@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        0
        ·
        1 year ago

        I know he didn’t say he wasn’t using gpt4 but it seems pretty clear. So saying it’s bullshit that gpt3.5 is dumber then 4 is pretty inaccurate.

        • ribboo@lemm.ee
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          Fair enough. Saying chatgpt has gotten dumber is false, saying 3.5 has might be true!

      • Calimhero@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        arrow-down
        1
        ·
        1 year ago

        Don’t know why you’re downvoted. I use GPT4 to code and design infrastructure and it’s very, very good. Around 500% productivity boost.

    • mrmanager@lemmy.today
      link
      fedilink
      English
      arrow-up
      0
      arrow-down
      8
      ·
      1 year ago

      Why did they do this? Did government step in and forced them to nerf it, because it was too powerful for citizens to use?

      • Sjatar@sjatar.net
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 year ago

        I’m sorry but this sounds more like a conspiracy theory then a real concern. Occam’s razor probably says it’s expensive to run the service at full power. ChatGPT already generated a cult like following for AI so no need to spend a ton on the service and they can profit of the hype.

        Not that openAI is held back by a government that is somehow afraid that it will empower the people, to do what? Revolution?

      • ragnar_ok@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        I don’t think anybody stepped in, I’m only talking about the free version. It makes some sense they’d gimp it in order to make more people sign up for the paid version, I guess

    • Wololo@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      1 year ago

      I’ve had similar experiences lately. Either that or it decides to review and analyze my code unprompted when I’m trying to troubleshoot a particularly tricky line. Had a few instances where it tried to borderline gaslight me into thinking that it was right and I was wrong about certain solutions. It feels like it happened rather suddenly too, it never used to do that save for the odd exception.

  • unhook2048@lemmy.world
    link
    fedilink
    English
    arrow-up
    29
    arrow-down
    3
    ·
    1 year ago

    It’s getting worse based on the feedback unfortunately, the need for safety and lack of meaningful deliberation towards how AI companies should operate and what should and should not be done has led Sam and co to be indesicive towards doing anything. Alongside the “morality” of the thing being hyjacked has lead to other AI’s performing better… lead by x employees of OpenAI, with actual bound morals and not inherently relying on user input to train future models, this will be the path forward, this will lead to safe and controlled integration.

    I guess at the core of this, we are afraid of ourselves. We are afraid that the worste of humanity outpaces the better parts, that the inputs and training aren’t altruistic but are more pointedly “bad” or “wrong”, and thus leading to “harmful”, whether through misinformation, lies, or fabrications.

    I hope we find a way to do better. I’m still excited for the future of AI, I mean crap, I’m closer to having a family doctor that’s a robot then I am to a real human doctor.

    • asparagus9001@lemmy.world
      link
      fedilink
      English
      arrow-up
      12
      arrow-down
      3
      ·
      1 year ago

      I guess at the core of this, we are afraid of ourselves. We are afraid that the worste of humanity outpaces the better parts, that the inputs and training aren’t altruistic but are more pointedly “bad” or “wrong”, and thus leading to “harmful”, whether through misinformation, lies, or fabrications.

      Is there any reason not to be afraid? I think you could say that Tay was essentially the same idea a few years back and it took like 48 hours loose on the internet for it to spout literal Nazi (1930s-40s German NSDAP) rhetoric. Besides that being a PR disaster - if “AI” is only getting stronger and more integrated into human life and society, that can be pretty problematic.

    • BehindTheBarrier@lemmy.world
      link
      fedilink
      English
      arrow-up
      14
      ·
      1 year ago

      They could make it paid only today, and it’d be instantly profitable. Most free users would transition to a free alternative, but the corporate world would easily pay for use. So would some power users. But I’m sure they are making good money with all the API use anyways, the free access is a cheap way to get mass testing and training data.

      • Corkyskog@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        8
        ·
        1 year ago

        I know so many average Joe’s that use it all the time and would instantly pay $5 a month for it, even just a phone app.

  • Open@lemmy.world
    link
    fedilink
    English
    arrow-up
    19
    arrow-down
    1
    ·
    1 year ago

    Article talks about the potential of AI cannibalism were it is now learning from data that it (or other AI) has generated.

    Does ChatGPT use modern data I was under the impression that it’s most modern dataset was a few years old

    • Hyperi0n@lemmy.film
      link
      fedilink
      English
      arrow-up
      7
      ·
      1 year ago

      How does it do your resume?

      You have to feed it all the information. Then it spits that back to you unformatted and you have to format it.

      • d4rknusw1ld@lemmy.world
        link
        fedilink
        English
        arrow-up
        9
        ·
        1 year ago

        Exactly. I don’t have to use my brain to write summaries and etc. I’m lazy and don’t deserve a job haha.

      • solstice@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 year ago

        Yeah seriously, I pay a resume writer almost entirely because I don’t want to fuck around with Word formatting it. Lazy I know but totally worth it.

  • Strangle@lemmy.world
    link
    fedilink
    English
    arrow-up
    14
    arrow-down
    1
    ·
    1 year ago

    Back in my day, we used to call ‘prompt engineering’ ‘asking a question’.

      • nottheengineer@feddit.de
        link
        fedilink
        English
        arrow-up
        9
        ·
        1 year ago

        And then we had to actively unlearn that google fu because google no longer works with keywords, but rather has an NLP pipeline that expects a question.

        • LordXenu@lemmy.world
          link
          fedilink
          English
          arrow-up
          5
          ·
          1 year ago

          So that’s why I can’t find shit. I always just use keywords, asking a whole question seems almost wasteful.

          • Sylver@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            ·
            1 year ago

            Wow, no wonder most of the old search commands don’t even feel like they work…

            • clearleaf@lemmy.world
              link
              fedilink
              English
              arrow-up
              8
              ·
              1 year ago

              The last straw in utterly ruining it was when they removed using quotes to get exact matches. That was the only way to cut through the garbage. Now the only use for google search is searching within specific websites that never bothered to make their own decent search function.

              • Hyperi0n@lemmy.film
                link
                fedilink
                English
                arrow-up
                2
                ·
                1 year ago

                Using quotes for exact matches works in both Google and Bing. Literally just tested it

            • LordXenu@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              ·
              1 year ago

              Fucking right?!

              It adds this weird abstract where the search keywords the question you ask but that requires you to ask the right question. Sometimes I just need the page that has the most mentions of a specific word or phrase.

        • kmkz_ninja@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          Someone used the phrase “dead-catting” on here the other day, so I went to google to figure out what the hell that meant. It gave me reults for the Catechism. Between the actual phrase “dead cat bounce” and “Catechism”, it chose the latter to show me.

        • Flying Squid@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          That’s DJ Qualls, who was great in Z Nation. Too bad no one watched Z Nation. It was hilariously insane. I mean, radioactive post-nuke zombies? At least it got an ending.

          • Hyperi0n@lemmy.film
            link
            fedilink
            English
            arrow-up
            2
            ·
            1 year ago

            I caught an episode of it on sci-fi and it was great. Much better than its contemporaries.

    • CosmoNova@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      1 year ago

      They got to have a special termonology because what they do is oh so special. Some AI users act like they’re Louise Banks from the movie Arrival cracking the code to an alien language or something. And I don’t think it’s far fetched to assume they’re often from the same breed who had NFT monkeys as their twitter pfp about 18 months ago.

      • Gerbler@lemmy.ml
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        1 year ago

        Blockchain > Crypto > NFTs > LLMs > whatever’s next.

        These people will always be sniffing around for the next big thing to oversell and fleece their audience.

  • fidodo@lemm.ee
    link
    fedilink
    English
    arrow-up
    5
    ·
    1 year ago

    AI cannibalism simply isn’t a thing yet. It definitely will be and good models will need to spend a lot of time and money sourcing good training data, but the models are not up to date enough to be contaminated yet.

    I’m very confident the degradation has come from them trying to scale up. Generative AI is the most expensive thing on the cloud you can provide, and not only are they trying to make it faster, they’re trying to roll it out for way more consumption. Major optimizations will require an algorithmic breakthrough so in the meanwhile all they can really do is find which corners they can cut that are less bad.

  • Immersive_Matthew@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    3
    ·
    1 year ago

    I had my first WTF moment with AI today. I use the paid Chat-GPT+ to help me with my c# in Unity. It has been a struggle to use, even with the smaller basic scripts you can paste into its character limited prompt, as they often have compile errors. That said if you keep feeding it the errors, guide it where it is making mistakes in design, logic etc. it can often produce a working script about 60-70% of the time. It takes a fair amount of time quite often to get to that working script but the code that finally works is good.

    Today I was asking it to edit a large c# script with 1 small change that meant lots of repetitive edits and references. Perfect for AI, however ChatGPT+ really struggled on this one which was a surprise. We went round and round with edits and ultimately more and more errors appeared in the console. It often ends up in these never ending coding edit loops to fix the next set errors from the last corrected script. We are taking 3 hours of this with ChatGPT+ finally saying that it needs to be able to see more of my project which of course it cannot due to many of its input limitations including number of characters so that is often when I give up. That is the 30-40% that do not work out. Real bummer as I invest so much time for no results.

    It was at the movement so gave up today that a YouTube notification popped up about how Claude.ai is even better than ChatGPT so I gave it the initial prompt that I gave ChatGPT above and it got the code right the first time. WOW!!!

    Only issue was it would stop spitting out code every 300 or so lines (unsure what the character limit is). To get around this I just asked if it could give me the code from line 301 onwards until I had the full script.

    Unsure if this one situation confirms coding with Claude.ai is better than ChatGPT+, but it certainly has my attention and I will be using it more this week as maybe that $20/month for ChatGPT+ no longer makes sense. Claude is free with no plans for a premium service it said. Unsure if this is true as I have not spent anytime investing it yet, but I will be.

    • foggy@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      I had a similar use case.

      I need it to alphabetize a list for me, only I need it to alphabetize the inner, non HTML elements. simplified, but like:

      <p>banana</p> <p>apple</p> <p>french fries</p>

      It would get like 5 or 6 in alphabetical order and then just fuck it all up.

  • TheFutureIsDelaware@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    3
    ·
    1 year ago

    ChatGPT usage is a very poor metric. Anything interesting is happening via API. Even the chat completion endpoint still isn’t “ChatGPT” on its own. None of these complaints about it being “dumber” apply to the API outputs. OpenAI don’t care about nerfing chatGPT because it’s not their real product.

  • Nobilmantis@feddit.it
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    1 year ago

    I feel like it is still too early to talk about “AI cannibalization” or “feedback loops” as that would mean that a big proportion of the training data is AI-generated content itself, against all the rest that could be scraped off the internet or the public domain, I don’t think this is happening yet.

    What people might experience instead, and perceive as dumbness, is that given that the datasets used to train AIs cannot really change that much in a short time (unless we wait for another hundred years so humans can produce actual human original content to train the AI again), and as the mathematical models used to build answers based on the datasets are pretty much the same, a person talking with ChatGPT will over time perceive more and more that the answers are built using a “pattern” or a “structure”, aka the model derived from feeding the dataset into the AI training itself.

    Just my pennies on this, let’s also consider that is in human nature to be excited for something new that sounds cool, and then to get bored when you got accustomed to it and pushed it to its boundaries.

  • CosmoNova@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 year ago

    Why is it relevant what Peter Yang - Roblox product lead and enthusiastic child labor exploiter - tweets about it? Let me guess he’s a prompt engineer?