The problem with AI alignment is that humans aren't aligned

preasket@lemy.lol · edit-2 1 year ago

The problem with AI alignment is that humans aren't aligned

fubo@lemmy.world · edit-2 1 year ago

Some of the human-alignment projects look like “religions” and some look like “economies” and some look like “just talking to each other and trying to be halfway decent folks and not flipping out or some shit”.

Heck, arguably the United Nations is a human-alignment project for x-risk mitigation.

preasket@lemy.lol · 1 year ago

Mmmm, agents training each other. Very Deepmind of you to mention that.

fubo@lemmy.world · 1 year ago

If you were doing your job and reading some web site, and you happened to notice that there were posts on that site containing child porn, wouldn’t you hit the “report” button too?

milicent_bystandr@lemmy.ml · 1 year ago

Some of the human-alignment projects

And some look like “I flip shit bigger, align with me or I will flip your shit”

DeVaolleysAdVocate@lemmy.world · 1 year ago

We’d like to bring all those and their existing versions together with the A-Better-World Consensus-Engine idea.

Tell me more about some of these other projects though please.