The problem with AI alignment is that humans aren't aligned

preasket@lemy.lol · edit-2 1 year ago

The problem with AI alignment is that humans aren't aligned

Rhaedas@kbin.social · 1 year ago

To continue the thought, even if the alignment problem within AI could be solved (I don’t think it can fully), who is developing this AI and determining it matched up with human needs? Just listening to the experts both acknowledge the issues and dangers and in the next sentence speculate “but if we can do it” fantasies is always concerning. Yet another example of a few determining the rest of humanity’s future with very high risks. Our best luck would be if AGI and beyond simply isn’t possible, and even then the “dumb” AI still have similar misalignment issues - we see them in current language models, and yet ignore the flags to make things more powerful.

I forgot to add - I’m totally on the side of our AI overlords and Roko’s Basilisk.

JunctionSystem@lemmy.world · 1 year ago

C: AGI is possible. If it weren’t, we wouldn’t exist. The laws of physics permit the creation of conscious agents, therefore it is possible for one to be deliberately engineered.

Zo0@feddit.de · 1 year ago

That’s a future problem for general AI. Right now it’s still very difficult to make an AI in a specific subject that does it’s job perfectly. That’s why even the commercial AI that we have are (should be) treated more like an ‘Assistant’

preasket@lemy.lol · 1 year ago

Sure, tbh, I think ChatGPT is overhyped. It can be useful, but it’s nowhere near AGI. I even have a controversial opinion that the rate of progress will not be exponential - it will be logarithmic, because, I think, the data will be the constraint.

Zo0@feddit.de · 1 year ago

I’m not gonna go too deep into it because I’m not qualified to, but I think the issue currently at hand, is that we’re throwing stuff at the wall to see what sticks. Most of the AI models currently used in different branches are being used because they showed promise in the original problem they were designed for. All these tools you see today were more or less designed over than 30 years ago. There’s a lot of interesting stuff being done at an academic level today but we (understandably so) don’t see those in an everyday conversation

fubo@lemmy.world · edit-2 1 year ago

Some of the human-alignment projects look like “religions” and some look like “economies” and some look like “just talking to each other and trying to be halfway decent folks and not flipping out or some shit”.

Heck, arguably the United Nations is a human-alignment project for x-risk mitigation.

DeVaolleysAdVocate@lemmy.world · 1 year ago

We’d like to bring all those and their existing versions together with the A-Better-World Consensus-Engine idea.

Tell me more about some of these other projects though please.

Quatity_Control@lemm.ee · 1 year ago

Align means two very different things here, despite being the same word.

preasket@lemy.lol · edit-2 1 year ago

Does it? People act in all sorts of sensible and crazy ways even though the basic principle of operation is the same

Quatity_Control@lemm.ee · 1 year ago

What loss function do you want AI to align on?

If I have a language model AI and an AI designed to function as a nurse, what are they going to align on?