Roman Leventov

Message

An independent researcher/blogger/philosopher about intelligence and agency (esp. Active Inference), alignment, ethics, interaction of the AI transition with the sociotechnical risks (epistemics, economics, human psychology), collective mind architecture, research strategy and methodology.

Twitter: https://twitter.com/leventov. E-mail: leventov.ru@gmail.com (the preferred mode of communication). I'm open to collaborations and...

1457

109

445

Roman Leventov

Critique of some recent philosophy of LLMs’ minds

I structure this post as a critique of some recent papers on the philosophy of mind in application to LLMs, concretely, on whether we can say that LLMs think, reason, understand language, refer to the real world when producing language, have goals and intents, etc. I also use this discussion as a springboard to express some of my views about the ontology of intelligence, agency, and alignment. * Mahowald, Ivanova, et al., “Dissociating language and thought in large language models: a cognitive perspective” (Jan 2023). Note that this is a broad review paper, synthesising findings from computational linguistics, cognitive science, and neuroscience, as well as offering an engineering vision (perspective) of building an AGI (primarily, in section 5). I don’t argue with these aspects of the paper’s content (although I disagree with something about their engineering perspective, I think that engaging in this disagreement would be infohazarous). I argue with the philosophical content of the paper, which is revealed in the language that the authors use and the conclusions that they make, as well as the ontology of linguistic competencies that the authors propose. * Shanahan, “Talking About Large Language Models” (Dec 2022). Dissociating language and thought in large language models: a cognitive perspective In this section, I shortly expose the gist of the paper by Mahowald, Ivanova, et al., for the convenience of the reader. Abstract: > Today’s large language models (LLMs) routinely generate coherent, grammatical and seemingly meaningful paragraphs of text. This achievement has led to speculation that these networks are—or will soon become—“thinking machines”, capable of performing tasks that require abstract knowledge and reasoning. Here, we review the capabilities of LLMs by considering their performance on two different aspects of language use: ‘formal linguistic competence’, which includes knowledge of rules and patterns of a given language, and ’functional lingu

52Jan 20, 2023

A multi-disciplinary view on AI safety research

47Feb 8, 2023

Powerful mesa-optimisation is already here

35Feb 17, 2023

AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them

33Dec 27, 2023

Roman Leventov

Message

Twitter: https://twitter.com/leventov. E-mail: leventov.ru@gmail.com (the preferred mode of communication). I'm open to collaborations and...

1457

109

445

AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them

This post is prompted by two recent pieces: First, in the podcast "Emergency Pod: Mamba, Memory, and the SSM Moment", Nathan Labenz described how he sees that we are entering the era of heterogeneity in AI architectures because currently we have not just one fundamental block that works very well...

Dec 27, 202333

Assessment of AI safety agendas: think about the downside risk

This is a post-response to The 'Neglected Approaches' Approach: AE Studio's Alignment Agenda. You evidently follow a variant of 80000hours' framework for comparing (solving) particular problems in terms of expected impact: Neglectedness x Scale (potential upside) x Solvability. I think for assessing AI safety ideas, agendas, and problems to solve,...

Dec 19, 202313

Refinement of Active Inference agency ontology

[Submitted on 6 Dec 2023] Active Inference and Intentional Behaviour Karl J. Friston, Tommaso Salvatori, Takuya Isomura, Alexander Tschantz, Alex Kiefer, Tim Verbelen, Magnus Koudahl, Aswin Paul, Thomas Parr, Adeel Razi, Brett Kagan, Christopher L. Buckley, Maxwell J. D. Ramstead Abstract: > Recent advances in theoretical biology suggest that basal...

Dec 15, 202317

An LLM-based “exemplary actor”

Into and summary This post is the second section of "Aligning an H-JEPA agent via training on the outputs of an LLM-based "exemplary actor", posted separately because I think it could warrant a separate discussion, largely independent of the discussion of H-JEPA agent with GFlowNet actors. Here's the summary of...

May 29, 202316

Aligning an H-JEPA agent via training on the outputs of an LLM-based "exemplary actor"

1. Overview and conclusion In section 2, I describe the “exemplary actor”, an LMCA (language model cognitive architecture) that takes a simple, “brute force” approach to alignment: a powerful LLM (think GPT-5/6 level, with a vast, or quasi-unlimited context) is given a list of “approved” textbooks on methodological and scientific...

May 29, 202312

For alignment, we should simultaneously use multiple theories of cognition and value

This post is a follow-up to "A multi-disciplinary view on AI safety research". I elaborate on some arguments behind this view. TL;DR: please skim section headings and bolded sentences in the text. Computationally tractable mathematical models of alignment are bound to be biased and blind to certain aspects of human...

Apr 24, 202323

A reply to Byrnes on the Free Energy Principle

This post is a collection of my answers to each section of the post "Why I’m not into the Free Energy Principle" by Steven Byrnes. TLDR: none of Byrnes' arguments appear valid and strong criticisms of the FEP (some are valid, but are not strong, and shouldn't lead to the...

Mar 3, 202328

Load More (7/14)

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

Roman Leventov

Roman Leventov

Roman Leventov

Critique of some recent philosophy of LLMs’ minds

A multi-disciplinary view on AI safety research

Powerful mesa-optimisation is already here

AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them

Roman Leventov

AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them

Assessment of AI safety agendas: think about the downside risk

Refinement of Active Inference agency ontology

An LLM-based “exemplary actor”

Aligning an H-JEPA agent via training on the outputs of an LLM-based "exemplary actor"

For alignment, we should simultaneously use multiple theories of cognition and value

A reply to Byrnes on the Free Energy Principle

Critique of some recent philosophy of LLMs’ minds

A multi-disciplinary view on AI safety research

Powerful mesa-optimisation is already here

AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them

AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them

Assessment of AI safety agendas: think about the downside risk

Refinement of Active Inference agency ontology

An LLM-based “exemplary actor”

Aligning an H-JEPA agent via training on the outputs of an LLM-based "exemplary actor"

For alignment, we should simultaneously use multiple theories of cognition and value

A reply to Byrnes on the Free Energy Principle