Skip to content
Raw Data Studies

Raw Data Studies

  • About
  • Transit ridership data

    Transit ridership data

    What to do with 25 years of monthly transit system data? How about try to identify college towns based on ridership trends.

    May 17, 2026
  • Convert to CSV webapp

    Convert to CSV webapp

    In my hobby of digging into data shared by research articles, I sometimes encounter data file types I can’t read directly, so I built this webapp to help.

    May 15, 2026
  • Visualization study webapp

    Visualization study webapp

    The start (and end?) of my adventure in online data visualization research, with some preliminary results.

    April 19, 2026
  • Removing accidental Claude

    Removing accidental Claude

    How I removed Claude from the contributors list of my GitHub repo.

    March 27, 2026
  • Bad year to be born

    Bad year to be born

    Using data from the Human Mortality Database, I find some interesting patterns regarding good and bad years to be born.

    February 7, 2026
  • Seeing Uniformity

    Seeing Uniformity

    I tracked two-digit authentication codes for two years after suspecting a subtle bias toward higher values. Using simple visualizations, standard statistical tests, and simulation as a reality check, this post explores how easy it is to see structure in randomness and how hard it is to prove it.

    January 19, 2026
  • Data Strips: Quintiles vs. Box Plots

    Data Strips: Quintiles vs. Box Plots

    Experiments with a new “quintile area” strip plot, prompted by skewed box plots in a biology paper, ended up clarifying why box plots remain so robust.

    January 2, 2026
  • Greenland Ice melt history

    Greenland Ice melt history

    Charting Greenland Ice sheet melt data from two sources of raw data.

    December 29, 2025
  • From radar charts to curve fitting and back

    From radar charts to curve fitting and back

    An exploration of radar charts from a recent Nature article, tracing the path from radars to fitted sigmoidal curves to alternate derived summary views.

    December 15, 2025
  • Beeswarm attack

    Beeswarm attack

    Here’s another case in the wild of beeswarm jitter using clamped bounds that hide the distribution of the data. This one has a twist in that a large proportion of the data values are zero.

    November 22, 2025
1 2 3 … 6
Next Page→

Raw Data Studies

by Xan Gregg

Loading Comments...