Archetypal Transfer Learning - AI Alignment Forum

Archetypal Transfer Learning (ATL) is a proposal by @whitehatStoic for what is argued by the author to be a fine tuning approach that "uses archetypal data" to "embed Synthetic Archetypes". These Synthetic Archetypes are derived from patterns that models assimilate from archetypal data, such as artificial stories. The method yielded a shutdown activation rate of 50.67% in the GPT-2-XL model after fine-tuning.

The team, consisting of @MiguelDev, @marc/er, @Abhay Chowdhry, Mazianni and @Linda Linsefors is working to improve the build to a 100%.

...

(Read More)