Raw Data Studies

Raw Data Studies

  • About
  • Regression with a ranged interval variable

    Regression with a ranged interval variable

    How reproducible are studies with shared data? I collect one more data point in a quest for the answer, and make a side exploration into a linear regression conundrum.

    December 18, 2021
  • Case study on wide scales

    Case study on wide scales

    Five alternatives to a broken axis scale for data visualization’s wide scale problem, and a broken line chart tip.

    November 26, 2021
  • Danger of quadratic extrapolation

    Danger of quadratic extrapolation

    Any investigation into small R² values leads to a different finding about extrapolation.

    September 20, 2021
  • Smoothing Star Trek gender ratios

    Smoothing Star Trek gender ratios

    A study of gender balance in Star Trek dialog leads to an adventure in data cleaning and a smoothing revelation.

    August 3, 2021
  • Reservoir status maps

    Reservoir status maps

    Where a more realistic (skeuomorphic) data encoding is less accurate to read.

    July 10, 2021
  • Mapping Choices

    Mapping Choices

    A study of a map of wine consumption reveals many underlying choices, both cosmetic and substantive.

    July 5, 2021
  • Anscombe’s Quartet Escapee

    Anscombe’s Quartet Escapee

    When a linear fit doesn’t look right, should we use it anyway? Let’s not — here are some alternatives.

    June 19, 2021
  • To stack or not to stack

    To stack or not to stack

    That is the question. Or is it about the value of the individual responses versus combined responses? And don’t forget to smooth.

    May 19, 2021
  • Querying Census Bureau for Divorce Rates

    Querying Census Bureau for Divorce Rates

    Trying to remake a divorce percentage chart led me learn about the new Census Bureau table query interface to its Public Use Microdata.

    May 8, 2021
  • Suicides and Covid-19

    Suicides and Covid-19

    Looking at the raw data behind two recent studies of early suicide counts during the Covid-19 pandemic gives me a chance to try out a “moving box plot” since the data is a bit sparse for regular confidence intervals.

    May 1, 2021
←Previous Page
1 2 3
Next Page→

Raw Data Studies

by Xan Gregg

 

Loading Comments...