Raw Data Studies – Visual explorations of found data

Raw Data Studies

Bad year to be born

Using data from the Human Mortality Database, I find some interesting patterns regarding good and bad years to be born.

February 7, 2026
Seeing Uniformity

I tracked two-digit authentication codes for two years after suspecting a subtle bias toward higher values. Using simple visualizations, standard statistical tests, and simulation as a reality check, this post explores how easy it is to see structure in randomness and how hard it is to prove it.

January 19, 2026
Data Strips: Quintiles vs. Box Plots

Experiments with a new “quintile area” strip plot, prompted by skewed box plots in a biology paper, ended up clarifying why box plots remain so robust.

January 2, 2026
Greenland Ice melt history

Charting Greenland Ice sheet melt data from two sources of raw data.

December 29, 2025
From radar charts to curve fitting and back

An exploration of radar charts from a recent Nature article, tracing the path from radars to fitted sigmoidal curves to alternate derived summary views.

December 15, 2025
Beeswarm attack

Here’s another case in the wild of beeswarm jitter using clamped bounds that hide the distribution of the data. This one has a twist in that a large proportion of the data values are zero.

November 22, 2025
NCAA football team draft rates

Comparing NCAA football players’ NFL draft rates with their high school composite ratings

October 26, 2025
Step count versus city walkability

Someone once quipped that I only read journal articles by looking at the pictures. I admit the graphs are the first things I look at. Next I check the data availability statement, to see if I can better understand the graphs with a little exploratory analysis. After that, I might also read the text of…

August 23, 2025
Data extraction challenge

Throughout my quests for raw data, I’ve learned a few techniques for find data lurking behind the charts. This walk-through shows a few of them,

July 21, 2025
Data Strips Experiment

I built a “Data Strips” app to experiment with new ways of graphically summarizing the distribution of a single variable.. You can try it out or access the code on GitHub. This post will introduce the app and summarize the views.

July 5, 2025