x

AI ALIGNMENT FORUM

AF

David Guzman Piedrahita — AI Alignment Forum

David Guzman Piedrahita

David Guzman Piedrahita

Message

33

Ω

13

1

1

1y

David Guzman Piedrahita

33

Ω

13

1y

Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games

Summary: * Traditional LLMs outperform reasoning models in cooperative Public Goods tasks. Models like Llama-3.3-70B maintain ~90% contribution rates in public goods games, while reasoning-focused models (o1, o3 series) average only ~40%. * We observe an "increased tendency to escape regulations" in reasoning models. As models improve in analytical capabilities,...

Apr 22, 2025•24