AI ALIGNMENT FORUM
AF

Stephen Fowler
Ω24110
Message
Dialogue
Subscribe

Masters student in Physics at the University of Queensland.

I am interested in Quantum Computing, physical AI Safety guarantees and alignment techniques that will work beyond contemporary models.

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
2Stephen Fowler's Shortform
3y
0
No wikitag contributions to display.
Fixing The Good Regulator Theorem
Stephen Fowler3y*41

"Good regulator" is used here to mean that it is good at keeping the output "regular". That is, reducing the entropy instead of "a regulator which produces good".

On page 4 the authors acknowledge that there are numerous ways by which one could consider a regulator successful, then go on to say "In this paper we shall use the last measure, H(Z), and we define ‘successful regulation’ as equivalent, to ‘H(Z) is minimal’."

The fact that reducing entropy doesn't line up with maximizing utility is true, but the authors never claimed it did. Reducing entropy generalises to a lot of real world problems.

Reply
10Scaffolded LLMs: Less Obvious Concerns
2y
0
7How Do We Align an AGI Without Getting Socially Engineered? (Hint: Box It)
3y
1
13Race Along Rashomon Ridge
3y
0