Top physicist says chatbots are just ‘glorified tape recorders’::Leading theoretical physicist Michio Kaku predicts quantum computers are far more important for solving mankind’s problems.

  • jaden@partizle.com
    link
    fedilink
    English
    arrow-up
    93
    arrow-down
    24
    ·
    1 year ago

    A physicist is not gonna know a lot more about language models than your average college grad.

    • JoBo@feddit.uk
      link
      fedilink
      English
      arrow-up
      41
      arrow-down
      18
      ·
      1 year ago

      That’s absolute nonsense. Physicists have to be excellent statisticians and, unlike data scientists, statisticians have to understand where the data is coming from, not just how to spit out simple summaries of enormously complex datasets as if it had any meaning without context.

      And his views are exactly in line with pretty much every expert who doesn’t have a financial stake in hyping the high tech magic 8-ball. On the Dangers of Stochastic Parrots.

      • Jerkface@lemmy.world
        link
        fedilink
        English
        arrow-up
        22
        arrow-down
        14
        ·
        1 year ago

        Okay but LLMs have multiplied my productivity far more than any tape recorder ever could or ever will. The statement is absolute nonsense.

        • JoBo@feddit.uk
          link
          fedilink
          English
          arrow-up
          6
          arrow-down
          2
          ·
          edit-2
          1 year ago

          Do you imagine that music did not exist before we had the means to record it? Or that it had no effect on the productivity of musicians?

          Vinyl happened before tape but in the early days of computers, tape was what we used to save data and code. Kids TV programmes used to play computer tapes for you to record at home, distributing the code in an incredibly efficient way.

          • Spzi@lemm.ee
            link
            fedilink
            English
            arrow-up
            2
            ·
            1 year ago

            Kids TV programmes used to play computer tapes for you to record at home, distributing the code in an incredibly efficient way.

            Could you expand on this? Sounds interesting.

            • JoBo@feddit.uk
              link
              fedilink
              English
              arrow-up
              3
              ·
              edit-2
              1 year ago

              [New comment instead of editing the old so that you see it]

              I managed to find a video of an old skool game loading. That’s what it sounded like when you loaded a program and it’s exactly what they’d play on the TV so you could create your tape.

              • Spzi@lemm.ee
                link
                fedilink
                English
                arrow-up
                2
                ·
                1 year ago

                Thank you very much for the effort! I also searched for text or video, but found none.

                I understand now what you previously meant, streaming code via TV.

                That’s what it sounded like when you loaded a program and it’s exactly what they’d play on the TV so you could create your tape.

                Now I have a new confusion: Why would they let the speaker play the bits being processed? It surely was technically possible to load a program into memory without sending anything to the speaker. Or wasn’t it, and it was a technical necessity? Or was it an artistic choice?

                • JoBo@feddit.uk
                  link
                  fedilink
                  English
                  arrow-up
                  2
                  ·
                  1 year ago

                  I assume it was because they used ordinary tape recorders, that people would otherwise use as dictaphones or to play music. I guess there wasn’t a way to transfer the data silently because the technology was designed to play sound? We had to wait for the floppy disk for silent-ish loading. Ish because they click-clacked a lot, but that was moving parts rather than the code itself.

            • JoBo@feddit.uk
              link
              fedilink
              English
              arrow-up
              3
              ·
              edit-2
              1 year ago

              They just played the tapes on TV, kinda screechy, computer-y sounds. They’d tell you when to press record on your cassette player before they started. You’d hold it close to the TV speakers until it finished playing, then plug the cassete player in to your computer, and there’d be some simple free game to play. I didn’t believe it would work but it did. I still don’t believe it worked. But it did.

              There must be a clip somewhere on the internet but my search skills are nowhere near good enough to find one.

        • BargsimBoyz@lemmy.world
          link
          fedilink
          English
          arrow-up
          6
          arrow-down
          5
          ·
          edit-2
          1 year ago

          Your statement and the original one can both be in sync with another.

          Microsoft Word is just a glorified notepad but it still improves my productivity significantly.

          And everyone will have different uses depending on their needs. Chatgpt has done nothing for my productivity/usually adds work as I have to double check all the nonsensical crap it gives me for example and then correct it.

          • LibertyLizard@slrpnk.net
            link
            fedilink
            English
            arrow-up
            8
            arrow-down
            2
            ·
            edit-2
            1 year ago

            I think describing word processors as glorified notepads would also be extremely misleading, to the extent that I would describe that statement as incorrect.

          • fidodo@lemm.ee
            link
            fedilink
            English
            arrow-up
            5
            ·
            1 year ago

            Those are all gross oversimplifications. By the same logic the internet is just a glorified telephone, the computer is a glorified abacus, the telephone is just a glorified messenger pigeon. There are lots of people who don’t understand LLMs and exaggerate its capabilities but dismissing it is also bad.

      • jaden@partizle.com
        link
        fedilink
        English
        arrow-up
        8
        ·
        1 year ago

        I had that paper in mind when I said that. Doesn’t exhibit a very thorough understanding of how these models actually work.

        A common argument is that the human brain very well may work the exact same, ergo the common phrase, “I’m a stochastic parrot and so are you.”

        • JoBo@feddit.uk
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          That’s a Sam Altman line and all it shows is that he does not know how knowledge is acquired, developed, or applied. He has no concept of how the world actually works and has likely never thought deeply about anything in his life beyond how to grift profitably. And he can’t afford to examine his (professed) beliefs because he’s trying to cash out on a doomed fantasy before too many people realise it is doomed.

      • Chickenstalker@lemmy.world
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        4
        ·
        1 year ago

        Nope. Biologists also use statistical models and also know where the data is coming from etc etc. They are not experts in AI. This Michio Kaku guy is more like the African American Science Guy to me, more concerned with being a celeb.

        • JoBo@feddit.uk
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          2
          ·
          edit-2
          1 year ago

          Biologists are (often) excellent statisticians too, you’re correct. That’s why the most successful quants are biologists or physicists, despite not having trained in finance.

          They’re not experts in (the badly misnamed) AI. They’re experts in the statistical models AI uses. They know an awful lot more than the likes of Sam Altman and the AI-hypers. Because they’re trained specialists, not techbro grifters.

    • ComradeKhoumrag@infosec.pub
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      1 year ago

      I disagree, physics is the foundational science of all sciences. It is the science with the strongest emphasis on understanding math well enough to derive the equations that actually take form in the real world

  • demesisx@infosec.pub
    link
    fedilink
    English
    arrow-up
    61
    arrow-down
    9
    ·
    1 year ago

    Yes. Glorified tape recorders that can provide assistance and instruction in certain domains that is very useful beyond what a simple tape recorder could ever provide.

      • deranger@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        7
        arrow-down
        2
        ·
        1 year ago

        I reckon it’s somewhere in between. I really don’t think it’s going to be the revolution they pitched, or some feared. It’s also not going to be completely dismissed.

        I was very excited when I started to play with various AI tools, then about two weeks in I realized how limited they are and how they need a lot of human input and editing to produce a good output. There’s a ton of hype and it’s had little impact on the regular persons life.

        Biggest application of AI I’ve seen to date? Making presidents talk about weed, etc.

          • deranger@sh.itjust.works
            link
            fedilink
            English
            arrow-up
            2
            ·
            1 year ago

            I use some image generation tools and LLMs.

            I think it’s a safe bet to estimate it will work out to be somewhere in the middle of the two extremes. I don’t think AI/ML is going to be worthless, but I also don’t think it’s going to do all these terrible things people are afraid of. It will find its applications and be another tool we have.

      • Meowoem@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        2
        ·
        1 year ago

        I find it fascinating how different sections of society see the tech totally differently, a lot of people seem to think because it can’t do everything it can do nothing. I’ve been fascinated by ai for decades so to have finally cracked language comprehension feels like huge news because it opens so many other doors - especially in human usability of new tools.

        We’re going to see a huge shift in how we use technology, I don’t think it will be long before we’re used to telling the computer what we want it to do - organising pictures, sorting inventory in a game, finding products in shops… Being able to actually tell it ‘i want a plug for my bath’ and not being offered electrical plugs, even being told ‘there are three main types as seen here, you will need to know the size of your plug hole to ensure the correct fit’

        As the technology refines we’ll see it get increasingly reliable for things like legal and medical knowledge, even if it’s just referring people to doctors it could save a huge amount of lives.

        It’s absolutely going to have as much effect on our lives as the internet’s development did, but I think a lot of people forget how significant that really was.

    • whatisallthis@lemm.ee
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      1
      ·
      1 year ago

      Well it’s like a super tape recorder that can play back anything anyone has ever said on the internet.

    • kinsnik@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      yeah, a “tape recorder” that adapts to what you ask… if there was a tape recorder before where i could put the docs i written and get recommendations on how to improve my style and organization, i missed it

  • Goodman@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    32
    arrow-down
    3
    ·
    1 year ago

    I wouldn’t call this guy a top physicist… I mean he can say what he wants but you shouldn’t be listening to him. I also love that he immediately starts shilling his quantum computer book right after his statements about AI. And mind you that this guy has some real garbage takes when it comes to quantum computers. Here is a fun review if you are interested https://scottaaronson.blog/?p=7321.

    The bottom line is. You shouldn’t trust this guy on anything he says expect maybe string theory which is actually his specialty. I wish that news outlets would stop asking this guy on he is such a fucking grifter.

    • hoodlem@hoodlem.me
      link
      fedilink
      English
      arrow-up
      11
      arrow-down
      1
      ·
      edit-2
      1 year ago

      I wouldn’t call this guy a top physicist… I mean he can say what he wants but you shouldn’t be listening to him.

      Yeah I don’t see how he has any time to be a “top physicist” when it seems he spends all his time on as a commenter on tv shows that are tangentially related to his field. On top of that LLM is not even tangentially related.

  • MooseBoys@lemmy.world
    link
    fedilink
    English
    arrow-up
    21
    arrow-down
    1
    ·
    edit-2
    1 year ago

    Leading theoretical physicist Michio Kaku

    I wouldn’t listen too closely to discount Neil deGrasse Tyson these days, especially in domains in which he has no qualifications whatsoever.

  • A2PKXG@feddit.de
    link
    fedilink
    English
    arrow-up
    19
    ·
    1 year ago

    Just set your expectations right, and chat it’s are great. They aren’t intelligent. They’re pretty dumb. But they can say stuff about a huge variety of domains

  • PixelProf@lemmy.ca
    link
    fedilink
    English
    arrow-up
    17
    arrow-down
    1
    ·
    edit-2
    1 year ago

    I understand that he’s placing these relative to quantum computing, and that he is specifically a scientist who is deeply invested in that realm, it just seems too reductionist from a software perspective, because ultimately yeah - we are indeed limited by the architecture of our physical computing paradigm, but that doesn’t discount the incredible advancements we’ve made in the space.

    Maybe I’m being too hyperbolic over this small article, but does this basically mean any advancements in CS research are basically just glorified (insert elementary mechanical thing here) because they use bits and von Neumann architecture?

    I used to adore Kaku when I was young, but as I got into academics, saw how attached he was to string theory long after it’s expiry date, and seeing how popular he got on pretty wild and speculative fiction, I struggle to take him too seriously in this realm.

    My experience, which comes with years in labs working on creative computation, AI, and NLP, these large language models are impressive and revolutionary, but quite frankly, for dumb reasons. The transformer was a great advancement, but seemingly only if we piled obscene amounts of data on it, previously unspeculated of amounts. Now we can train smaller bots off of the data from these bigger ones, which is neat, but it’s still that mass of data.

    To the general public: Yes, LLMs are overblown. To someone who spent years researching creativity assistance AI and NLPs: These are freaking awesome, and I’m amazed at the capabilities we have now in creating code that can do qualitative analysis and natural language interfacing, but the model is unsustainable unless techniques like Orca come along and shrink down the data requirements. That said, I’m running pretty competent language and image models on 12GB of relatively cheap consumer video card, so we’re progressing fast.

    Edit to Add: And I do agree that we’re going to see wild stuff with quantum computing one day, but that can’t discount the excellent research being done by folks working with existing hardware, and it’s upsetting to hear a scientist bawk at a field like that. And I recognize I led this by speaking down on string theory, but string theory pop science (including Dr. Kaku) caused havoc in people taking physics seriously.

    • Goodman@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      11
      ·
      edit-2
      1 year ago

      He is trying to sell his book on quantum computers which is probably why he brought it up in the first place

      • PixelProf@lemmy.ca
        link
        fedilink
        English
        arrow-up
        7
        ·
        1 year ago

        Oh for sure. And it’s a great realm to research, but pretty dirty to rip apart another field to bolster your own. Then again, string theorist…

    • joe@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      1
      ·
      1 year ago

      My opinion is that a good indication that LLMs are groundbreaking is that it takes considerable research to understand why they give the output they give. And that research could be for just one prediction of one word.

      • PixelProf@lemmy.ca
        link
        fedilink
        English
        arrow-up
        11
        arrow-down
        1
        ·
        1 year ago

        For me, it’s the next major milestone in what’s been a roughly decade-ish trend of research, and the groundbreaking part is how rapidly it accelerated. We saw a similar boom in 2012-2018, and now it’s just accelerating.

        Before 2011/2012, if your network was too deep, too many layers, it would just breakdown and give pretty random results - it couldn’t learn - so they had to perform relatively simple tasks. Then a few techniques were developed that enabled deep learning, the ability to really stretch the amount of patterns a network could learn if given enough data. Suddenly, things that were jokes in computer science became reality. The move from deep networks to 95% image recognition ability, for example, took about 1 years to halve the error rate, about 5 years to go from about 35-40% incorrect classification to 5%. That’s the same stuff that powered all the hype around AI beating Go champions and professional Starcraft players.

        The Transformer (the T in GPT) came out in 2017, around the peak of the deep learning boom. In 2 years, GPT-2 was released, and while it’s funny to look back on now, it practically revolutionized temporal data coherence and showed that throwing lots of data at this architecture didn’t break it, like previous ones had. Then they kept throwing more and more and more data, and it kept going and improving. With GPT-3 about a year later, like in 2012, we saw an immediate spike in previously impossible challenges being destroyed, and seemingly they haven’t degraded with more data yet. While it’s unsustainable, it’s the same kind of puzzle piece that pushed deep learning into the forefront in 2012, and the same concepts are being applied to different domains like image generation, which has also seen massive boosts thanks in-part to the 2017 research.

        Anyways, small rant, but yeah - it’s hype lies in its historical context, for me. The chat bot is an incredible demonstration of the incredible underlying advancements to data processing that were made in the past decade, and if working out patterns from massive quantities of data is a pointless endeavour I have sad news for all folks with brains.

          • PixelProf@lemmy.ca
            link
            fedilink
            English
            arrow-up
            3
            ·
            edit-2
            1 year ago

            Hmm… Nothing off the top of my head right now. I checked out the Wikipedia page for Deep Learning and it’s not bad, but quite a bit of technical info and jumping around the timeline, though it does go all the way back to the 1920’s with it’s history as jumping off points. Most of what I know came from grad school and having researched creative AI around 2015-2019, and being a bit obsessed with it growing up before and during my undergrad.

            If I were to pitch some key notes, the page details lots of the cool networks that dominated in the 60’s-2000’s, but it’s worth noting that there were lots of competing models besides neural nets at the time. Then 2011, two things happened at right about the same time: The ReLU (a simple way to help preserve data through many layers, increasing complexity) which, while established in the 60’s, only swept everything for deep learning in 2011, and majorly, Nvidia’s cheap graphics cards with parallel processing and CUDA that were found to majorly boost efficiency of running networks.

            I found a few links with some cool perspectives: Nvidia post with some technical details

            Solid and simplified timeline with lots of great details

            It does exclude a few of the big popular culture events, like Watson on Jeopardy in 2011. To me it’s fascinating because Watson’s architecture was an absolute mess by today’s standards, over 100 different algorithms working in conjunction, mixing tons of techniques together to get a pretty specifically tuned question and answer machine. It took 2880 CPU cores to run, and it could win about 70% of the time at Jeopardy. Compare that to today’s GPT, which while ChatGPT requires way more massive amounts of processing power to run, have an otherwise elegant structure and I can run awfully competent ones on a $400 graphics card. I was actually in a gap year waiting to go to my undergrad to study AI and robotics during the Watson craze, so seeing it and then seeing the 2012 big bang was wild.

  • ClemaX@lemm.ee
    link
    fedilink
    English
    arrow-up
    13
    arrow-down
    1
    ·
    1 year ago

    Well, one could argue that our brain is a glorified tape recorder.

  • Feathercrown@lemmy.world
    link
    fedilink
    English
    arrow-up
    8
    ·
    1 year ago

    He’s a physicist. That doesn’t make him wise, especially in topics that he doesn’t study. This shouldn’t even be an article.

    • velvetinetouch@lemmy.ml
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      They could know quite a lot, ML is still a rather shallow field compared to the more established sciences, it’s arguably not even a proper science yet, perhaps closer to alchemy than chemistry. Max Tegmark is a cosmologist and he has learned it well enough for his opinion to count, this guy on the other hand is famous for his bad takes and has apparently gotten a lot wrong about QC even though he wrote a whole book about it.

  • 👍Maximum Derek👍@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    9
    arrow-down
    6
    ·
    1 year ago

    I call them glorified spread sheets, but I see the correlation to recorders. LLMs, like most “AIs” before them, are just new ways to do line of best fit analysis.

      • 👍Maximum Derek👍@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        1
        ·
        1 year ago

        It is. The tokenization and intent processing are the thing that impress me most. I’ve been joking since the 90’s that the most impressive technological innovation shown on Star Trek TNG was computers that understand the intent of instructions. Now we have that… mostly.

    • Prager_U@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      1 year ago

      To counter the grandiose claims that present-day LLMs are almost AGI, people go too far in the opposite direction. Dismissing it as being only “line of best fit analysis” fails to recognize the power, significance, and difficulty of extracting meaningful insights and capabilities from data.

      Aside from the fact that many modern theories in human cognitive science are actually deeply related to statistical analysis and machine learning (such as embodied cognition, Bayesian predictive coding, and connectionism), referring to it as a “line” of best fit is disingenuous because it downplays the important fact that the relationships found in these data are not lines, but rather highly non-linear high-dimensional manifolds. The development of techniques to efficiently discover these relationships in giant datasets is genuinely a HUGE achievement in humanity’s mastery of the sciences, as they’ve allowed us to create programs for things it would be impossible to write out explicitly as a classical program. In particular, our current ability to create classifiers and generators for unstructured data like images would have been unimaginable a couple of decades ago, yet we’ve already begun to take it for granted.

      So while it’s important to temper expectations that we are a long way from ever seeing anything resembling AGI as it’s typically conceived of, oversimplifying all neural models as being “just” line fitting blinds you to the true power and generality that such a framework of manifold learning through optimization represents - as it relates to information theory, energy and entropy in the brain, engineering applications, and the nature of knowledge itself.

      • 👍Maximum Derek👍@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        edit-2
        1 year ago

        Ok, it’s a best fit line on an n-dimentional matrix querying a graphdb ;)

        My only point is that this isn’t AGI and too many people still fail to recognize that. Now people are becoming disillusioned with it because they’re realizing it isn’t actually creative. It’s still still just a fancy comparison engine. That’s not not world changing, but it’s also not Data from Star Trek

        • Prager_U@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          1 year ago

          I get that, but what I’m saying is that calling deep learning “just fancy comparison engine” frames the concept in an unnecessarily pessimistic and sneery way. It’s more illuminating to look at the considerable mileage that “just pattern matching” yields, not only for the practical engineering applications, but for the cognitive scientist and theoretician.

          Furthermore, what constitutes being “actually creative”? Consider DeepMind’s AlphaGo Zero model:

          Mok Jin-seok, who directs the South Korean national Go team, said the Go world has already been imitating the playing styles of previous versions of AlphaGo and creating new ideas from them, and he is hopeful that new ideas will come out from AlphaGo Zero. Mok also added that general trends in the Go world are now being influenced by AlphaGo’s playing style.

          Professional Go players and champions concede that the model developed novel styles and strategies that now influence how humans approach the game. If that can’t be considered a true spark of creativity, what can?