AI ALIGNMENT FORUM
AF

599
Wikitags

Truthful AI

This page is a stub.
Subscribe
Discussion
Subscribe
Discussion
Posts tagged Truthful AI
31Gaming TruthfulQA: Simple Heuristics Exposed Dataset Weaknesses
TurnTrout
9mo
0
39New, improved multiple-choice TruthfulQA
Owain_Evans, James Chua, Steph Lin
9mo
0
16How do LLMs give truthful answers? A discussion of LLM vs. human reasoning, ensembles & parrots
Owain_Evans
2y
0
8Truthfulness, standards and credibility
Joe Collman
4y
2
18Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models
Felix Hofstätter, Francis Rhys Ward, HarrietW, LAThomson, Ollie J, Patrik Bartak, Sam F. Brown
2y
0
Add Posts