Cleo Nardo

Uncertainty in all its flavours

Acknowledgements: This research began during the SERI MATS program, under the joint mentorship of John Wentworth, Nicholas Kees, and Janus. Thanks also to Davidad, Jack Sagar, and David Jaz Myers for discussion. Abstract: I think that there is a uniform correspondence between flavours of uncertainty and monads taking state-spaces to...

Jan 9, 202434

Game Theory without Argmax [Part 1]

Written during the SERI MATS program under the joint mentorship of John Wentworth, Nicholas Kees, and Janus. Preface In classical game theory, we characterise agents by a utility function and assume that agents choose options which cause maximal utility. This is a pretty good model, but it has some conceptual...

Nov 11, 202378

MetaAI: less is less for alignment.

Summary In May 2023, MetaAI submitted a paper to arxiv called LIMA: Less Is More for Alignment. It's a pretty bad paper and (in my opinion) straightforwardly misleading. Let's get into it. The Superficial Alignment Hypothesis The authors present an interesting hypothesis about LLMs — > We define the Superficial...

Jun 13, 202371

Excessive AI growth-rate yields little socio-economic benefit.

Basic argument In short — 1. There exists an adaptive limit representing the maximum rate at the economy can adapt to new technology. 2. Exceeding this limit yields no socio-economic benefit, in the general case. 3. Exceeding this limit imposes significant risks and costs. 4. AI growth-rate currently exceeds the...

Apr 4, 202327

The 0.2 OOMs/year target

TLDR: Humanity — which includes all nations, organisations, and individuals — should limit the growth rate of machine learning training runs from 2020 until 2050 to below 0.2 OOMs/year. Paris Climate Accords In the early 21st century, the climate movement converged around a "2°C target", shown in Article 2(1)(a) of...

Mar 30, 202384

Wittgenstein and ML — parameters vs architecture

Status: a brief distillation of Wittgenstein's book On Certainty, using examples from deep learning and GOFAI, plus discussion of AI alignment and interpretability. > "That is to say, the questions that we raise and our doubts depend on the fact that some propositions are exempt from doubt, are as it...

Mar 24, 202344

Want to predict/explain/control the output of GPT-4? Then learn about the world, not about transformers.

Introduction Consider the following scene from William Shakespeare's Julius Caesar. In this scene, Caesar is at home with his wife Calphurnia. She has awoken after a bad dream, and is pleaded with Caesar not to go to the Senate. Although Caesar initially agrees to stay at home with her, he...

Mar 16, 2023107

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

Cleo Nardo

Cleo Nardo

Cleo Nardo

Uncertainty in all its flavours

Game Theory without Argmax [Part 1]

MetaAI: less is less for alignment.

Excessive AI growth-rate yields little socio-economic benefit.

The 0.2 OOMs/year target

Wittgenstein and ML — parameters vs architecture

Want to predict/explain/control the output of GPT-4? Then learn about the world, not about transformers.