Jobst Heitzig — AI Alignment Forum

[Aspiration-based designs] Outlook: dealing with complexity

Summary. This teaser post sketches our current ideas for dealing with more complex environments. It will ultimately be replaced by one or more longer posts describing these in more detail. Reach out if you would like to collaborate on these issues. Multi-dimensional aspirations For real-world tasks that are specified in...

Apr 28, 202413

[Aspiration-based designs] 3. Performance and safety criteria, and aspiration intervals

Summary. In this post, we extend the basic algorithm by adding criteria for choosing the two candidate actions the algorithm mixes, and by generalizing the goal from making the expected Total equal a particular value to making it fall into a particular interval. We only use simple illustrative examples of...

Apr 28, 202410

[Aspiration-based designs] 2. Formal framework, basic algorithm

Summary. In this post, we present the formal framework we adopt during the sequence, and the simplest form of the type of aspiration-based algorithms we study. We do this for a simple form of aspiration-type goals: making the expectation of some variable equal to some given target value. The algorithm...

Apr 28, 202418

[Aspiration-based designs] 1. Informal introduction

by B Jacobs, Jobst Heitzig, Simon Fischer, and Simon Dima

Sequence Summary. This sequence documents research by SatisfIA, an ongoing project on non-maximizing, aspiration-based designs for AI agents that fulfill goals specified by constraints ("aspirations") rather than maximizing an objective function . We aim to contribute to AI safety by exploring design approaches and their software implementations that we believe...

Apr 28, 202450

Aspiration-based Q-Learning

by Clément Dumas and Jobst Heitzig

Work completed during a two-month internship supervised by @Jobst Heitzig. Thanks to Phine Schikhof for her invaluable conversations and friendly support during the internship, and to Jobst Heitzig, who was an amazing supervisor. Epistemic Status: I dedicated two full months to working on this project. I conducted numerous experiments to...

Oct 27, 202338