x

AI ALIGNMENT FORUM

AF

AI Capabilities — AI Alignment Forum

AI Capabilities

Edited by plex last updated 29th Aug 2021

AI Capabilities are the growing abilities of AIs to act effectively in increasingly complex environments. It is often compared to to AI Alignment, which refers to efforts to ensure that these effective actions taken by AIs are also intended by the creators and beneficial to humanity.

Add Posts

Posts tagged AI Capabilities

6

50EfficientZero: human ALE sample-efficiency w/MuZero+self-supervised

4y

26

2

51[Paper] Stress-testing capability elicitation with password-locked models

Fabien Roger, ryan_greenblatt

2y

5

2

30A small update to the Sparse Coding interim research report

Lee Sharkey, Dan Braun, beren

3y

5

2

33Memorizing weak examples can elicit strong behavior out of password-locked models

Fabien Roger, ryan_greenblatt

2y

3

2

64EfficientZero: How It Works

4y

2

2

79Measuring no CoT math time horizon (single forward pass)

ryan_greenblatt

4mo

4

2

65Recent LLMs can use filler tokens or problem repeats to improve (no-CoT) math performance

ryan_greenblatt

4mo

6

2

41My picture of the present in AI

ryan_greenblatt

21d

10

2

53Recent LLMs can do 2-hop and 3-hop latent (no-CoT) reasoning on natural facts

ryan_greenblatt

4mo

0

1

42We have achieved Noob Gains in AI

4y

0

1

34Interpreting Yudkowsky on Deep vs Shallow Knowledge

4y

2

1

45Evaluations project @ ARC is hiring a researcher and a webdev/engineer

4y

6

1

42The alignment problem in different capability regimes

5y

12

1

26Capabilities and alignment of LLM cognitive architectures

3y

0

1

33PaLM in "Extrapolating GPT-N performance"

Lukas Finnveden

4y

15

Load More (15/37)

Add Posts