Edit: the list is now a public GitHub repository, with all that implies. Added sections by media type. Last update: 6 Jan 2021

Motivation

This is The Best Textbooks on Every Subject, but for visualizations. I greatly adore good visualizations, chiefly because there are so many visualizations that are so terrible. I have seen many such tools mentioned here, but always in passing.

The actual motivator is re-reading the posts Exercises in Comprehensive Information Gathering and Fact Posts: How and Why. While there is no substitute for the wrench-time they recommend, I think these kinds of tools make the process more efficient and lend themselves to insights which are difficult to acquire through reading alone; in my experience scale and distance are both easier to grasp in a visual medium, for example.

Also there is a non-trivial sense in which they are beautiful in their own right. If we are able to compare many examples, people in the community might even be able to help advance the art.

Submission Rules

One nomination per comment; please include an explanation of why you nominated it. Contra the best textbooks list we won't require comparison with other visualizations because there are so few authoritative ones.

Current List

WEB:

History

Math

Economics

Machine Learning

Miscellaneous

VIDEO:

BOOKS:

JOURNALISM:

New to LessWrong?

New Comment
21 comments, sorted by Click to highlight new comments since: Today at 12:27 AM

You should create a github.com repo ala the Awesome lists (e.g., https://github.com/hellerve/programming-talks). Lesswrong does not lend itself well to these collaborative community resources, as evidenced by the death of The Best Textbooks on Every Subject.

This strikes me as likely getting it backwards. The Best Textbooks on Every Subject thread was active for vastly longer than almost any Github repo of this style that I've ever seen. It petered out after a few years, but that's a really long time, and mostly because Luke was no longer actively maintaining it. 

I expect doing this on Github to basically cause an immediate failure of this. This isn't because Github repos like this can't work, but it's because Github repo's like this don't end up being added to via Github, but via separate channels, and are usually curated by one really engaged person who invests a lot of time into it. They are not a good template for crowdsourcing submissions. 

You can easily edit any documents on GitHub itself; It’ll automatically create a pull request for you. Github also has good features for having multiple maintainers, handing off ownership, and even forking in the worst case. You can even find active forks of any repo. (I have an extension that displays this information on the main repo page, called Lovely Forks.)

https://github.com/JanVanRyswyck/awesome-talks/commits/master , for an example of one such repo I use and is in active “development.”

Github allows better organizatio of the information (e.g., in the textbook example, we could have had a separate directory for each subject, and a separate file in those for each contributor’s opinions), as well. 
 

Now, I have no doubt that starting any such community effort is hard, and maybe using Lesswrong has a lower barrier to entry, but in the long run, a Github repo is more viable. The best case scenario is for the author to collate the comments on Lesswrong into the git repo, and provide links to tutorials on using Github’s web interface for new contributors.

I have taken your advice; the link is now at the top. Feel free to offer commentary on formatting and readability; it's my first repo and the cool tricks I have seen elsewhere are not known to me.

Thanks! I think the formatting is good enough. Adding screenshots will probably engage people more; It’s a quick way to show people if they are at all interested in the content. 
 

Standford's collection of ancient data visualizations

https://exhibits.stanford.edu/dataviz

Complexity Explorables: interactive toys to learn about complex dynamical systems.

http://complexity-explorables.org/

The current state of AIDS in the European Union

https://www.politico.eu/article/aids-european-state-of-play/

Evolution of word usage in Scientific American over the last 150 years

https://www.scientificamerican.com/article/explore-175-years-of-words-in-scientific-american/

Which and how body parts are described in literature, according to gender

https://pudding.cool/2020/07/gendered-descriptions/

Heatmap of mortality rates over the last centuries, by age, country and gender
https://jschoeley.shinyapps.io/hmdexp/

Historic usage of the word "ass"

https://pudding.cool/2019/10/slang/ (and anything by The Pudding really)

Simulated dendrochronology of USA immigration

https://web.northeastern.edu/naturalizing-immigration-dataviz/

Some meta thoughts, since I've thought a lot about what made the best-textbooks thread work: 

  • I think the inclusion criteria for the best textbook thread were really important. Having a relatively objective inclusion criteria made people more comfortable posting, and made using it as a trusted source a good bit better. 
  • I think having the top-level post update with new additions was really valuable, and caused the post to be a good long-term reference. A github repo might work, but I expect the trivial inconvenience of clicking through the link to make it a lot less useful (you want people to come back to the place where they can submit new visualizations if you want the thing to keep getting new submissions).
  • Limiting it to one submission per comment was really useful because it allowed for fine-grained upvoting and downvoting of various suggestions, and made navigating the thread a lot easier. I would also recommend doing that.

I agree with these, for the most part.

The inclusion criteria were easy to set for textbooks; textbooks are over a hundred years old as a format, and there are always multiple options for any not-brand-new subject. By contrast most of the really impressive visualizations of huge datasets are younger than 10 years, and it is mostly a question of is there a visualization or not, rather than being able to select among several. That being said, I did consider whether to strive for an authoritative source, but I reason I can do the legwork up front to separate out hobbyist efforts. However, if everyone is more likely to use the list if it only contains authoritative sources (or endorsed by authoritative sources), then I would reconsider and set that as a criteria.

I agree with this directly, particularly the trivial inconvenience point. But I am comfortable doing the legwork on both for now, and we can see which proves the more popular/useful.

This is an excellent point; I will modify the post to include it.

I agree with this directly, particularly the trivial inconvenience point. But I am comfortable doing the legwork on both for now, and we can see which proves the more popular/useful.

Nice, thank you for that! :)

I use beauty as one of my top criteria for deciding whether to read a textbook. Textbooks I haven't had time to review:

An Illustrated Theory of Numbers. An exquisite work of art which is also an excellent textbook. Don't read if you hate number theory and you want to keep it that way.
Visual Group Theory. Lucid but slow, great for beginners.

I'd like to recommend Molecular Biology of the Cell as having made my heart sing, but I haven't completed it, and I'm a biology noob, so what do I know?

The first image seems broken to me.

For biochemistry, I think the Roche Biochemical Pathways chart is awesome, if a little overwhelming:

http://biochemical-pathways.com/#/map/1

I don't recommend using it to learn biochemistry but it's pretty great to see it all laid out in one place like that.

I feel like it being overwhelming is a feature, rather than a bug. Why doesn't such-and-such a drug work? Because it has to go through this.

A good firm smack in the face with complexity helps a lot in problem solving, in my view.

Here's a visualisation that goes along with Euclid's elements

A plot of which theorems are used in the proof of each theorem in Euclid's elements, ordered by the book e.g. the black dots at the bottom say proofs in the book 13 mostly used theorems from all the books bar 7,8,9 and 12. 

This was one of many from an article on "The Empirical MetaMathematics of Euclid and Beyond". It is a long essay on the overarching structure of Euclid's elements and verifies some claims made about Euclid's Elements e.g. the proofs were ordered in nearly the most parsimonious way possible. It also finds the most difficult theorems in each book, the greatest possible reductions in proof length, and hints that the network of theorem dependancy has a local 2-d structure. Highly recommend the article. 

Domain of Science is a YouTube channel that among other things makes maps of different fields of study, which can be a very nice way to get an overview of the different subjects in a given field.

Some good math visualization on YouTube include: 3Blue1Brown, Think Twice, Reducible, Welch Labs, Primer

Our World in Data for data visualization.

Explorable Explanations - A bunch of things explained visually and interactively.

tree of life explorers (TOLE) - OneZoom, MinuteLabs TLOE, and more (just search the term online)

For the field of chemistry, I nominate The Periodic Table of the Elements.  I know it's old but it really does capture a surprising amount of information in a visually pleasing format.

This isn’t a textbook, but Dataclysm by Christian Rudder was a major inspiration to me when I was new to data analysis. The book is like a long data analysis project around dating on OKCupid (Rudder founded the site), and has a lot of good graphs made just for the book. Unlike some of the popular examples made famous by e.g. Tufte, the graphs in Dataclysm are of the type an analyst in 2020 might typically make in their day-to-day work. Lots of scatter plots and bar plots, but created thoughtfully enough to really be something. Rarely in this book did I think “ah, beautiful” - much more often, I thought “ah, yup, I see the relationship he’s saying exists.”

The version of Dataclysm that I read had red font in addition to black font, and I found it was quite striking as a way to do emphasis. When I started writing my personal journal, I adopted red and black felt-tipped pens because I enjoyed the font colours from the book.

Not sure if this is what you're looking for, but I made a bunch of math and physics diagrams and animations back in my wikipedia days: :-) https://en.wikipedia.org/wiki/User:Sbyrnes321

Definetely not a subject, but I'd say that the visualisation of Wolfram's theory of everything is excellent. Of course there are problems with his theory of everything, like the fact that he hasn't actually proved his claims that it generates GR field equations or replicates QM. Or shown that his theory evades the critical objection Scott Aaronson raised. but as a visualisation:

  1. It is aesthetically pleasing
  2. Compactly contains the basic ideas of his T.o.E. 
  3. Ties the basic concepts together to see how they could generate a theory of physics

So I'd still recommend it. 

Question: should we include here, or make a separate location for, data visualization tools? By tools I mean the methods by which to make visualizations and/or maybe guidance for how.

This is clearly related, but it seems like a separate problem. It might be much more useful to the people in the community who maintain their own blog or have related professional obligations, which is a more distinct subset.