EngineRoom

Pareto Analysis Tutorial

Tutorial

When to use this tool

The Pareto Chart is a simple tool that organizes independent categories from highest to lowest frequency in order to visualize and compare the relative importance of each. It helps you choose the most important changes to make or select the small number of tasks that produce the most significant overall effect. The Pareto Analysis study is located under the Measure menu:

Pareto analysis start up menu.

How to use this tool in EngineRoom

The Pareto Analysis tool can handle both raw data (where the independent categories are listed down the column, and where the counts of each are indicated by how many times they appear) and summary data (where the categories map to a frequency or other numeric variable).

Raw Data Example

Lat's first use the raw data. Here we have data from a computer support center showing the reasons for service calls. A snapshot of the data set is shown:

Sample pareto analysis data.

Go to Measure menu > Pareto Analysis. The study opens. Drag the Computer Support Service Calls (raw data) variable on to the Categories Variable drop zone. The resulting Pareto Analysis output is shown:

Sample pareto analysis aggregated output.

The Pareto chart shows the data organized as vertical bars in order from largest to smallest, indicating that Support Calls regarding computer viruses detected is the most frequently occurring call. The output also includes a table containing the counts corresponding to the categories, the cumulative counts, percentages of the total and cumulative percentages:

Aggregated Data Example

The second way of formatting data, aggregated data, uses two different dropzones. The data set is the same as before, but now it is split in two columns, with the first column showing the independent categories and their counts in the second column:

Sample pareto analysis aggregate data.

Go to Measure > Pareto Analysis. The study opens. Drag the Computer Support Service Calls variable on to the Categories Variable drop zone and the Count variable on to the Frequency Variable drop zone. The resulting Pareto Analysis output is shown:

Sample pareto analysis aggregated output.

The chart and table output are the same as before becasue the data are the same, just organized in an aggregated format.

Editing the Chart

The study setup button on the top right of the study has two tabs:

1. The Data tab has three conditional options

  • Combine attributes (the items that are being counted): this is an On/Off button that you can use to combine categories with small counts together into a single category. If this button is On, you can use the other two options below.
  • Specify the percentage value after which to combine attributes. If you enter '90'%, the chart will list each category individually up until 90% is achieved, and combine all attributes after that value.
  • Specify the name of the category containing the combined data.
Pareto analysis set up menu.

2. The Display tab has a single option

  • Turn the cumulative frequency line On or Off (it is on by default)
Pareto analysis start up menu display tab.

Was this helpful?