AI ALIGNMENT FORUM
AF

wuthejeff
Ω2010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
ProLU: A Nonlinearity for Sparse Autoencoders
wuthejeff1y*30

This is great!  We were working on very similar things concurrently at OpenAI but ended up going a slightly different route. 

A few questions:
- What does the distribution of learned biases look like?
- For the STE variant, did you find it better to use the STE approximation for the activation gradient, even though the approximation is only needed for the bias?

Reply
No posts to display.