Ought will host a factored cognition “Lab Meeting”

stuhlmueller

The video from the factored cognition lab meeting is up:

Description:

Ought cofounders Andreas and Jungwon describe the need for process-based machine learning systems. They explain Ought's recent work decomposing questions to evaluate the strength of findings in randomized controlled trials. They walk through ICE, a beta tool used to chain language model calls together. Lastly, they walk through concrete research directions and how others can contribute.

Outline:

00:00 - 2:00 Opening remarks
2:00 - 2:30 Agenda
2:30 - 9:50 The problem with end-to-end machine learning for reasoning tasks
9:50 - 15:15 Recent progress | Evaluating the strength of evidence in randomized controlled trials trials
15:15 - 17:35 Recent progress | Intro to ICE, the Interactive Composition Explorer
17:35 - 21:17 ICE | Answer by amplification
21:17 - 22:50 ICE | Answer by computation
22:50 - 31:50 ICE | Decomposing questions about placebo
31:50 - 37:25 Accuracy and comparison to baselines
37:25 - 39:10 Outstanding research directions
39:10 - 40:52 Getting started in ICE & The Factored Cognition Primer
40:52 - 43:26 Outstanding research directions
43:26 - 45:02 How to contribute without coding in Python
45:02 - 45:55 Summary
45:55 - 1:13:06 Q&A

The Q&A had lots of good questions.

What is the agenda?

30 min | Updates on Ought’s work decomposing reasoning tasks

The specific alignment problems we’re trying to solve and our vision of a solution that mitigates these risks (see also: Supervise Process, not Outcomes)
Early progress on decomposing reasoning about evidence quality in randomized controlled trials
Our tools for building and debugging reasoning traces of language models (preview)

15 min | Related research directions we’re excited about and how they fit in, e.g.

Automating evaluation through critique models or verifier models
Distillation
Comparing scaling trends for process-based systems vs. end-to-end systems
Testing process-based systems for adversarial robustness

15 min | Q&A

There will be more to discuss than we can fit into an hour. We’ll get to what we can and consider making this a regular meeting if there’s appetite (likely with more sharing from other researchers)!

Who should attend?

You should attend if:

You are interested in Ought’s research and want updates.

You want to build off of Ought’s learnings from doing this research.

You want to use our tools for running and debugging compositional language models tasks.

You want concrete research ideas in this domain.

You are not a researcher but want to learn how other backgrounds can support this work (engineers can build debugging infrastructure, non-ML researchers can help create datasets, etc.).

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

15

Ought will host a factored cognition “Lab Meeting”

15

What is the agenda?

Who should attend?

How can I attend?