AI ALIGNMENT FORUM
AF

117

PeterMcCluskey — AI Alignment Forum

PeterMcCluskey

000

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by

No posts to display.

No wikitag contributions to display.

LeCun’s “A Path Towards Autonomous Machine Intelligence” has an unsolved technical alignment problem

PeterMcCluskey2y10

I haven't thought this out very carefully. I'm imagining a transformer trained both to predict text, and to predict the next frame of video.

Train it on all available videos that show realistic human body language.

Then ask the transformer to rate on a numeric scale how positively or negatively a human would feel in any particular situation.

This does not seem sufficient for a safe result, but implies that LeCun is less nutty than your model of him suggests.

LeCun’s “A Path Towards Autonomous Machine Intelligence” has an unsolved technical alignment problem

PeterMcCluskey2y10

Why assume LeCun would use only supervised learning to create the IC module?

If I were trying to make this model work, I'd use mainly self-supervised learning that's aimed at getting the module to predict what a typical human would feel. (I'd also pray for a highly multipolar scenario if I were making this module immutable when deployed.)