Demonstrating how large language models work

You might’ve heard about large language models lately. They’re the “brains” behind recent chatbots that seem to know an awful lot. Aatish Bhatia, for NYT’s The Upshot, walks you through how such a model “learns” to write based on a relatively small body of text.

There’s a little bit of Choose Your Own Adventure mixed in, so you can select the type of text. Bhatia used nanoGPT, an open source library, to model Jane Austen, Shakespeare, and Star Trek: The Next Generation.

Tags: , , ,

Even with vaccine, probably shouldn’t rush into easing up on restrictions

With vaccines, we might be tempted to jump back into “normal” life before it’s really safe. The New York Times reports on why waiting until March instead of February might be the way to. This is based on estimates from Columbia University researchers, and you can read the preprint here (pdf) by Jeffrey Shaman et al.

We’ve come this far already…

Tags: , , , ,

Visual explainer for hierarchical modeling

Hierarchical models, or multilevel models, are used to represent data that might vary on, you guessed, different levels. Michael Freeman, from the University of Washington Information School, provides an introduction to the method using a scrolling format. The transitions give a good sense of how the model can change, depending on your approach.

Tags: , ,