Touch reality as soon as possible (when doing machine learning research)

[-]Neel Nanda3y911

Thanks for writing this post! (And man, if this is you deliberately writing fast and below your standards, you should lower your standards way more!). I very strongly agree with this within mechanistic interpretability and within pure maths (and it seems probably true in ML and in life generally, but those are the two areas I feel vaguely qualified to comment on).

Aversion to Schlepping

Man, I strongly relate to this one... There have been multiple instances of me having an experiment idea I put off for days to weeks, only to do it in 1-3 hours and get really useful results. I've had some success experimenting with things like speedrunning afternoons, where I drop all of my ongoing tasks, try to pick a self-contained thing that seems high priority, and sprint on getting it done ASAP (this doesn't work well for day to week schleppy tasks, but I'm more OK with sucking at those)

Under why touch reality, IMO the most important reason is that it'll help you form ideas that are good! It's much much easier to do this when you have a lot of surface area on what's actually going on, and enough experience and loose threads to spark curiosities and new ideas.

Under why don't people touch reality, honestly the strongest reason for me is just procrastination/lacking urgency (which is somewhat aversion to schlepping, but less central) - even if I know exactly what it'd be sensible to do, there's rarely a reason to do it right now rather than later.

Some more strategies I like for touching reality faster (there's some overlap with your's):

Try explaining your understanding to other people. Notice when you're confused about a concept, and go and try to figure out what's going on (ideally by building some kind of toy model and coding something yourself)
Meta strategy - learn how to use good tooling, debug issues in your workflow, and just practice running a lot of quick experiments. I find that being able to test a hypothesis about GPT-2 Small in a few minutes makes it much easier to touch reality, in a way that I just wouldn't if it took hours to days. Even if the difference in time isn't that stark, the more you have the right muscle memory, the lower the activation energy
Try to Murphyjitsu your ideas - assume things will go wrong, or that there's some crucial flaw in your beliefs, and use your intuition to fill in the blanks re why. Use this to generate ideas to try falsifying your plan

[-]LawrenceC3y32

Thanks!

just procrastination/lacking urgency

This is probably true in general, to be honest. However, it's an explanation for why people don't do anything, and I'm not sure this differentially leads to delaying contact with reality more than say, delaying writing up your ideas in a Google doc.

Some more strategies I like for touching reality faster

I like the "explain your ideas to other people" point, it seems like an important caveat/improvement to the "have good collaborators" strategy I describe above. I also think the meta strategy point of building a good workflow is super important!

[-]Neel Nanda3y41

I like the "explain your ideas to other people" point, it seems like an important caveat/improvement to the "have good collaborators" strategy I describe above

Importantly, the bar for "good person to explain ideas to" is much lower than the bar for "is a good collaborator". Finding good collaborators is hard!

[-]LawrenceC1y50Review for 2023 Review

I think this post was useful in the context it was written in and has held up relatively well. However, I wouldn't active recommend it to anyone as of Dec 2024 -- both because the ethos of the AIS community has shifted, making posts like this less necessary, and because many other "how to do research" posts were written that contain the same advice.

Background

This post was inspired by conversations I had in mid-late 2022 with MATS mentees, REMIX participants, and various bright young people who were coming to the Bay to work on AIS (collectively, "kiddos"). The median kiddo I spoke with had read a small number of ML papers and a medium amount of LW/AF content, and was trying to string together an ambitious research project from several research ideas they recently learned about. (Or, sometimes they were assigned such a project by their mentors in MATS or REMIX.)

Unfortunately, I don't think modern machine learning is the kind of field where you can take several where research consistently works out of the box. Many high level claims even in published research papers are just... wrong, it can be challenging to reproduce results even when they are right, and even techniques that work reliably may not work for the reasons people think they do.

Hence, this post.

What do I think of the content of the post?

I think the core idea of this post held up pretty well with time. I continue to think that making contact with reality is very important, and I think the concrete suggestions for how to make contact with reality are still pretty good.

If I were to write it today, I'd probably add a fifth major reason for why it's important to make quick contact with reality: mental health/motivation. That is, producing concrete research outputs, even small ones, feels pretty essential to maintaining motivation for the vast majority of researchers. My guess is I missed this factor because I focused on the content of research projects, as opposed to the people doing the research.

Where do I feel the post stands now?

Over the past two years, the ethos of the AIS community has changed substantially toward empirical work, over the past two years, and especially in 2024.

The biggest part of this is because of the pace of AI. When this post was written, ChatGPT was a month old, and GPT-4 was still more than 2 months away. People both had longer timelines and thought of AIS in more conceptual terms. Many research conceptual research projects of 2022 have fallen into the realm of the empirical as of late 2024.

Part of this is due to the rise of (dangerous capability) evals as a major AIS focus in 2023, which is both substantially more empirical compared to the median 2022 AIS research topic, and an area where making contact with reality can be as simple as "pasting a prompt into claude.ai".

Part of this is due to Anthropic's rise to being the central place for AIS researchers. "Being able to quickly produce ML results" is a major part of what it takes to get hired there as a junior researcher, and people know this.

Finally, there's been a decent amount of posts or write-ups giving the same advice, e.g. Neel's written advice for his MATS scholars and a recent Alignment Forum post by Ethan Perez.

As a result, this post feels much less necessary or relevant in late December 2024 than in December 2022.

[-]Scott Emmons3y36

Thanks for writing this! I appreciate it and hope you share more things that you write faster without totally polishing everything.

One word of caution I'd share is: beware of spending too much effort running experiments on toy examples. I think toy examples are useful to gain conceptual clarity. However, if your idea is primarily empirical (such as an improvement to a deep neural network architecture), then I would recommend spending basically zero time running toy experiments.

With deep learning, it's often the case that improvements on toy examples don't scale to being improvements on real examples. In my experience, lots of papers in reinforcement learning don't actually work because the authors only tried out the method on toy examples. (Or, they tried out the method on more complex examples, but they didn't publish those experiments because the method didn't work.) So trying out a new empirical method on a toy example provides little information about how valuable the empirical method will be on real examples.

The flipside of this warning is advice: for empirical projects, test your idea on as diverse and complex a set of tasks as is possible. The good empirical ideas are few, and extensive empirical testing is the best way a researcher can determine if their idea will stand the test of time.

When running diverse and complex experiments, it is still important to design the simplest possible experiment that will be informative, as Lawrence describes in the section "Mock or simplify difficult components." I suggest being simple (such as Lawrence's example of using text-davinci-003 instead of finetuning one's own model) rather than being toy (using a tiny or hard-coded language model).

[-]LawrenceC3y30

I think this is a good word of caution. I'll edit in a link to this comment.

[-]gwern2y20

Unfortunately, it turned out that Bayesian neural networks were significantly trickier to get working in practice on the value learning tasks we were working with, and nothing came of the project despite several months of effort. A few months later, a research engineer at CHAI found that many Bayesian neural network algorithms (including the one we were using for our project) often failed to to approximate some toy 4-d distributions

Was that ever written up? I don't recall that result.

[-]LawrenceC2y10

I don't think so, unfortunately, and it's been so long that I don't think I can find the code, let alone get it running.

^{^}

After I published this post, Sam Toyer pointed me at Michael Bernstein's concepts of vectoring (identifying a key direction of uncertainty) and velocity (quickly iterating on ideas by testing directions of uncertainty), which seems like a good breakdown of how to touch reality.

^{^}

Detailed epistemic status: I'm pretty frustrated with how slow I write, so this is an experiment in writing fast as opposed to carefully. That being said, this is ~the prevailing wisdom amongst many ML practitioners and academics, and similar ideas have been previously discussed in the LessWrong/Alignment Forum communities, so I'm pretty confident that it's directionally correct. I also believe (less confidently) that this is good advice for most kinds of research or maybe even for life in general.

^{^}

As Michael Dennis pithily puts it, this is the point at which the process goes from only you correcting the theory, to the theory being able to correct you.

^{^}

Famously, you don’t even need the RNN parts, you only need attention.

^{^}

Though, to be fair, there were other circumstances - it was during the pandemic and I was feeling incredibly gloomy in general.

^{^}

(Edited to add:) That being said, as Scott Emmons points out in a comment below, it's important to not just have results on toy examples!

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

49

Touch reality as soon as possible (when doing machine learning research)

49

Background

What do I think of the content of the post?

Where do I feel the post stands now?

Introduction: two common mistakes in ML research

Why touch reality?

Your ideas may be bad

Other people's ideas may be bad or misleading

Your tools may not work the way you think they do

It helps you explain your ideas to other people

Why don't people touch reality?

Idea scarcity

Deference to authority

Aversion to Schlepping

Concrete ways to touch reality faster

Minimize time to (possible) failure

Create toy examples

Mock or simplify difficult components

Have good collaborators