This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
GPT
•
Applied to
GPT, the magical collaboration zone, Lex Fridman and Sam Altman
by
Bill Benzon
11d
ago
•
Applied to
Is analyzing LLM behavior a valid means for assessing potential consciousness, as described by global workspace theory and higher order theories?
by
Amelia
18d
ago
•
Applied to
Can RLLMv3's ability to defend against jailbreaks be attributed to datasets containing stories about Jung's shadow integration theory?
by
Miguel de Guzman
1mo
ago
•
Applied to
What experiment settles the Gary Marcus vs Geoffrey Hinton debate?
by
Valentin Baltadzhiev
1mo
ago
•
Applied to
Transfer learning and generalization-qua-capability in Babbage and Davinci (or, why division is better than Spanish)
by
RP
2mo
ago
•
Applied to
Implementing activation steering
by
Annah
2mo
ago
•
Applied to
Requirements for a Basin of Attraction to Alignment
by
Roger Dearnaley
2mo
ago
•
Applied to
The case for more ambitious language model evals
by
Arun Jose
2mo
ago
•
Applied to
Putting multimodal LLMs to the Tetris test
by
Lovre
2mo
ago
•
Applied to
' petertodd'’s last stand: The final days of open GPT-3 research
by
mwatkins
2mo
ago
•
Applied to
OpenAI Credit Account (2510$)
by
Emirhan BULUT
2mo
ago
•
Applied to
OpenAI Credit Account (2510$)
by
Emirhan BULUT
2mo
ago
•
Applied to
Maybe talking isn't the best way to communicate with LLMs
by
Manav Rathi
2mo
ago