This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
GPT
•
Applied to
Fix simple mistakes in ARC-AGI, etc.
by
Oleg Trott
18d
ago
•
Applied to
Getting 50% (SoTA) on ARC-AGI with GPT-4o
by
Vanessa Kosoy
1mo
ago
•
Applied to
Is This Lie Detector Really Just a Lie Detector? An Investigation of LLM Probe Specificity.
by
Josh Levy
2mo
ago
•
Applied to
Do Not Mess With Scarlett Johansson
by
Vanessa Kosoy
2mo
ago
•
Applied to
GPT-4o My and Google I/O Day
by
Tobias D.
2mo
ago
•
Applied to
How is GPT-4o Related to GPT-4?
by
Joel Burget
2mo
ago
•
Applied to
OpenAI releases GPT-4o, natively interfacing with text, voice and vision
by
Tobias D.
2mo
ago
•
Applied to
Navigating LLM embedding spaces using archetype-based directions
by
mwatkins
3mo
ago
•
Applied to
What's up with all the non-Mormons? Weirdly specific universalities across LLMs
by
mwatkins
3mo
ago
•
Applied to
Barcoding LLM Training Data Subsets. Anyone trying this for interpretability?
by
right..enough?
3mo
ago
•
Applied to
Language and Capabilities: Testing LLM Mathematical Abilities Across Languages
by
Ethan Edwards
4mo
ago
•
Applied to
GPT, the magical collaboration zone, Lex Fridman and Sam Altman
by
Bill Benzon
4mo
ago
•
Applied to
Is analyzing LLM behavior a valid means for assessing potential consciousness, as described by global workspace theory and higher order theories?
by
Amelia
5mo
ago
•
Applied to
Can RLLMv3's ability to defend against jailbreaks be attributed to datasets containing stories about Jung's shadow integration theory?
by
Miguel de Guzman
5mo
ago
•
Applied to
What experiment settles the Gary Marcus vs Geoffrey Hinton debate?
by
Valentin Baltadzhiev
5mo
ago