Quantitative Data Distributions Summary Statistics Outliers
The objective of this activity is to use data from the 2021 Speed Climbing and Lead Climbing Olympic events to calculate summary statistics, construct graphs, and describe and compare distributions.
During the 2021 Summer Olympics in Tokyo, competition climbing became an Olympic sport for the first time ever. Athletes competed in three separate disciplines: speed climbing, bouldering, and lead climbing. There were 20 men and 20 women who competed in the 2021 Olympic Sports Climbing Qualification Rounds.
In speed climbing, two climbers race side-by-side to scale identical routes on a 15m high wall set at an angle of 95 degrees.
The walls used for bouldering present a range of challenges, with overhangs and some holds so small that they can only be held by the fingertips. Climbers must plan each move carefully while constantly being aware of the 4 minute time limit. The goal is to complete as many routes as possible.
When lead climbing, athletes wearing harnesses attached to a climbing rope attempt to climb as high as they can on a taller wall measuring 15-20 meter within six minutes. The wall features 40-60 handholds. Climbers are scored on how far they progress, with each handhold earning 1 point.
The table below shows the number of holds attained by the 20 women during the Lead Climbing Qualification Rounds.
40 | 33 | 30 | 30 | 29 | 27 | 26 | 26 | 25 | 25 | 22 | 21 | 21 | 16 | 16 | 15 | 14 | 13 | 12 | 7 |
Source: Olympics.com
The 5-number summary statistics can be computed using the Statistics application. Input the number of holds attained in V1.
Use the Stats tab to obtain the necessary values
Min | Q1 | Med | Q3 | Max |
---|---|---|---|---|
7 | 15.5 | 23.5 | 28 | 40 |
Use the Graph tab to construct a Histogram and Boxplot of the data.
The distribution of holds attained by female Olympic climbers is fairly symmetric with one peak. The distribution is centered around the median of 23.5 holds with a range of 40-28 = 12 holds. There are no obvious outliers.
The boxplot provides a quick summary of the distribution but the histogram shows the shape of the distribution.
The table below shows the number of holds attained by the 20 men during the Lead Climbing Qualification Round.
42 | 42 | 41 | 39 | 39 | 36 | 36 | 35 | 29 | 29 | 28 | 28 | 28 | 26 | 26 | 26 | 25 | 25 | 25 | 7 |
Source: Olympics.com
A boxplot can be constructed using the Statistics application. Input the number of holds attained in V1. Select the boxplot from the Graph tab.
The data point 7 is an outlier if it falls below Q1 - 1.5(IQR). For this dataset, Q1 = 26 and IQR = 11.5. Therefore, the cutoff value is 26 - 1.5(11.5) = 8.75. Since 7 < 8.75, Mawem's value is an outlier.
The mean and standard deviation are not resistant to outliers. Therefore, the median and IQR should be used to describe the distribution.
The table below shows the times (in seconds) of the 20 men and 20 women during the Speed Climbing Qualification Round.
Men | 5.45 | 5.94 | 5.95 | 6.19 | 6.21 | 6.23 | 6.32 | 6.33 | 6.48 | 6.51 |
---|---|---|---|---|---|---|---|---|---|---|
6.63 | 6.70 | 6.71 | 6.93 | 7.08 | 7.23 | 7.34 | 7.46 | 7.47 | 7.59 | |
Women | 6.97 | 7.12 | 7.46 | 7.55 | 7.65 | 7.99 | 8.08 | 8.17 | 8.23 | 8.42 |
8.51 | 8.67 | 8.83 | 9.44 | 9.54 | 9.65 | 10.01 | 10.43 | 10.5 | 11.1 |
Source: Olympics.com
Use the Histogram or Box options from the Graph tab to construct graphs to compare the data.
Answers may vary. The average climbing time was less for men than women (around 6.5 seconds for men compared to around 8.5 seconds for women). The climbing time for women varied more than it did for men as both the standard deviation and IQR were 2-3 times larger for women than men. The distribution of for women's climbing time appears to be slightly skewed right whereas the distribution for men may be slightly skewed left. Neither distribution of climbing times appears to have outliers.