Picture This: Descriptive Statistics Don't Tell the Whole Story

May 16, 2017

Here's a great example of why it is important to plot your data and visually review the result before drawing any conclusions. Remarkably, the Y and X data used to create both of these scatter plots have the same descriptive statistics, and the same (almost) correlation coefficient value (r), but each plot certainly tells a very different story.

Engineroom Output

Here's a link to the original work by Justin Matejka and George Fitzmaurice, two researchers at Autodesk, who developed an algorithm to generate the variables with matching statistics: https://www.autodeskresearch.com/publications/samestats

Moresteam Poster
MoreSteam

MoreSteam's Enterprise Process Improvement platform includes the tools, training, and software you need to transform your organization, large or small, into a problem-solving powerhouse. Our products are trusted by over half of the Fortune 500 and by other organizations and universities worldwide. When you partner with MoreSteam you gain a team dedicated to helping you succeed.

Use Technology to Empower Your Continuous Improvement Program