Knight Lee

Don't you mean "the most conditionally forbidden technique?"

Hear me out, I think the most forbidden technique is very useful and should be used, as long as we avoid the "most forbidden aftertreatment:" 1. An AI trained on interpretability techniques must not be trained on capabilities after (or during) it is trained on interpretability techniques, otherwise it will...

Apr 26, 202519

Knight Lee

Knight Lee

Don't you mean "the most conditionally forbidden technique?"

An idea for avoiding neuralese architectures

Reduce AI Self-Allegiance by saying "he" instead of "I"

Commitment Races are a technical problem ASI can easily solve

Knight Lee

Don't you mean "the most conditionally forbidden technique?"

Commitment Races are a technical problem ASI can easily solve

An idea for avoiding neuralese architectures

Reduce AI Self-Allegiance by saying "he" instead of "I"

Don't you mean "the most conditionally forbidden technique?"

An idea for avoiding neuralese architectures

Reduce AI Self-Allegiance by saying "he" instead of "I"

Commitment Races are a technical problem ASI can easily solve

Knight Lee

Don't you mean "the most *conditionally* forbidden technique?"

An idea for avoiding neuralese architectures

Reduce AI Self-Allegiance by saying "he" instead of "I"

Commitment Races are a technical problem ASI can easily solve

Knight Lee

Don't you mean "the most *conditionally* forbidden technique?"

Commitment Races are a technical problem ASI can easily solve

An idea for avoiding neuralese architectures

Reduce AI Self-Allegiance by saying "he" instead of "I"

Don't you mean "the most *conditionally* forbidden technique?"

An idea for avoiding neuralese architectures

Reduce AI Self-Allegiance by saying "he" instead of "I"

Commitment Races are a technical problem ASI can easily solve

Don't you mean "the most conditionally forbidden technique?"

Don't you mean "the most conditionally forbidden technique?"

Don't you mean "the most conditionally forbidden technique?"