This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
Truthful AI
•
Applied to
How do LLMs give truthful answers? A discussion of LLM vs. human reasoning, ensembles & parrots
by
Owain Evans
1d
ago
•
Applied to
Benchmark Study #2: TruthfulQA (Task, MCQ)
by
Bruce W. Lee
3mo
ago
•
Applied to
Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models
by
Felix Hofstätter
5mo
ago
•
Applied to
A tension between two prosaic alignment subgoals
by
Ruben Bloom
1y
ago
•
Applied to
Truthfulness, standards and credibility
by
Ruben Bloom
2y
ago
•
Created by
Ruben Bloom
at
2y