Quantilization

Edited by plex, Mateusz Bagiński, et al. last updated 2nd Dec 2024

A Quantilizer is a proposed AI design that aims to reduce the harms from Goodhart's law and specification gaming by selecting reasonably effective actions from a distribution of human-like actions, rather than maximizing over actions. It is more of a theoretical tool for exploring ways around these problems than a practical buildable design.

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

Quantilization

See also

Quantilization

See also