KIN 610: Quantitative Methods in Kinesiology

FYI

This presentation is based on the the following books. The references are coming from these books unless otherwise specified.

Main sources:

Weir, J. P., Vincent, W. J. (2021). Statistics in Kinesiology. Human Kinetics.
Furtado, O., Jr. (2026). Statistics for movement science: A hands-on guide with SPSS (1st ed.). https://drfurtado.github.io/sms

ClassShare App

You may be asked in class to go to the ClassShare App to answer questions.

https://classsharedrfurtado.netlify.app/

Learning Objectives

Define central tendency and identify the three primary measures: mode, median, and mean.
Calculate the mode for raw data and frequency distributions.
Calculate the median for raw data and frequency distributions.
Calculate the mean for raw data and frequency distributions.
Identify the appropriate measure of central tendency for different types of data.
Explain the advantages and disadvantages of each measure of central tendency.
Describe the relationship between the mean, median, and mode in different types of distributions.

Distribution Shapes

Understanding distribution shapes is crucial for selecting the appropriate measure of central tendency^[1].

Normal (Symmetric)

Characteristics: - Symmetric bell shape - Mean = Median = Mode - Best measure: Mean

Positively Skewed

Characteristics: - Tail extends right - Mode < Median < Mean - Best measure: Median

Negatively Skewed

Characteristics: - Tail extends left - Mean < Median < Mode - Best measure: Median

Important

The shape of the distribution determines which measure of central tendency is most appropriate. In skewed distributions, the mean is pulled toward the tail, while the median remains more representative of the “typical” value.

Introduction: Describing the Center

This chapter focuses on measures of central tendency — statistical values that describe the middle or central characteristics of a data set^[1].

We will explore three primary measures:
- Mode: The most frequent score
- Median: The middle score
- Mean: The arithmetic average

Important

These measures describe how scores tend to cluster in a distribution, which is fundamentally different from measures of variability (how scores spread out), covered in Chapter 5.

Key Terms

Understanding the terminology is essential for selecting and interpreting the appropriate measure^[1]:

Measures of Central Tendency: Values that describe the middle or central characteristics of a data set
Mode: The most frequent or common score in a distribution
Median: The score above and below which half (50%) of all scores fall; the “typical” score
Mean: The sum of all scores divided by the total number of scores; the arithmetic average

Note

Each measure has specific use cases, strengths, and limitations. The choice depends on the data type and distribution shape. Recall we discussed data types in Chapter 2 - nominal, ordinal, interval, and ratio. Distribution shapes will be discussed in Chapter 6.

The Mode: Most Frequent Score

Definition: The score that occurs most frequently in a distribution
Calculation: No formula required — identified by inspection
Methods:
- Rank order listing: Look for the score that appears most often
- Frequency distributions: Find the highest value in the frequency (\(f\)) column

Note

Distributions can be unimodal (one mode), bimodal (two modes), trimodal (three modes), or multimodal (more than three modes). A distribution can technically have any number of modes.

Mode Example: Pull-Up Scores

Table 4.1: Rank Order Distribution of Pull-Up Scores

Score	Frequency
18	1
17	2
16	3
15	5
14	4
13	3
12	2
11	1
10	1

Tip

Question: What is the mode in this distribution?

Answer

The mode is 15 because it has the highest frequency (5 occurrences).

Disadvantages of the Mode

While simple, the mode has significant limitations:

Unstable: Can change dramatically based on how data is grouped or categorized
Terminal statistic: Not useful for further mathematical or statistical calculations
Ignores information: Does not account for the values of extreme scores or the overall distribution shape

Warning

The mode is best used as a rough estimate when data are near normal, or for nominal data where it is the only appropriate measure.

The Median: The Middle Score

Note

The median is the “typical” score and is particularly useful for ordinal data or highly skewed distributions^[1].

Definition: The 50th percentile — the score that divides the data set into two equal halves
Characteristics:
- Half of all scores fall above the median
- Half of all scores fall below the median
Calculation (Odd N): The middle score in the ordered list
Calculation (Even N): The average of the two middle scores

Median: Strengths and Applications

Key Strengths:

Unaffected by extreme scores (outliers): A few very high or very low values do not distort the median^[2]
Appropriate for ordinal data: Works well when data has rank order but unequal intervals
Robust measure: Provides a stable center even when distributions are skewed^[1]

When to Use the Median:

Data is ordinal (e.g., Likert scales, race finish positions)
You need the middle or “typical” score
The distribution is badly skewed by outliers

Tip

Example: If measuring recovery time and one participant takes 10x longer than others due to injury, the median recovery time is more representative than the mean.

The Mean: Arithmetic Average

Definition: The arithmetic average — the sum of all scores divided by the number of scores^[1]
Status: The most widely used index of central tendency in statistics
Logic: Balances all values in the distribution (the “balance point”)

Important

The mean is sensitive to every score in the distribution, including extreme values (outliers)^[3].

Formula for the Mean

The mean is calculated using the following formula:

\[\bar{X} = \frac{\sum X}{N}\]

Components:

\(\bar{X}\) = Mean (pronounced “X-bar”)
\(\sum\) = Summation sign (add up all values)
\(X\) = Individual raw score
\(N\) = Total number of scores

Note

This formula is fundamental to many subsequent statistical calculations, including standard deviation, z-scores, and inferential tests.

Sample Calculation of the Mean

VO₂max data (mL/kg/min) for 7 participants:

30.6, 29.5, 28.2, 27.8, 26.5, 26.1, 25.7

Calculation:

\[\bar{X} = \frac{\sum X}{N} = \frac{30.6 + 29.5 + 28.2 + 27.8 + 26.5 + 26.1 + 25.7}{7} = \frac{194.4}{7} = 27.77 \text{ mL/kg/min}\]

Important

Note that the mean is expressed in the same units as the raw data (mL/kg/min in this case).

Mean: Strengths and Limitations

Strengths:

Uses all information in the data set (every value matters)^[1]
Essential for subsequent statistical inference and calculations
Most appropriate for interval and ratio data
Algebraically defined, allowing for advanced mathematical operations

Limitations:

Sensitive to outliers: Extreme scores have a disproportionate impact^[3]
Arguably inappropriate for ordinal data: Assumes equal intervals between values
Can be misleading when distributions are highly skewed

Warning

When extreme scores are present, the mean may not represent the “typical” value well. Consider using the median instead^[2].

Test Your Knowledge: Calculating the Mean

You measure the maximum bench press (in kg) for 5 athletes:

Data: 80, 85, 90, 95, 100

Calculate the mean bench press using the calculator on your device.

Answer

\[\bar{X} = \frac{80 + 85 + 90 + 95 + 100}{5} = \frac{450}{5} = 90 \text{ kg}\]

The mean bench press is 90 kg.

Checking your calculation

Open SPSS using the Canvas link and enter the data into the Data View. Then, go to Analyze > Descriptive Statistics > Frequencies. Select the variable you entered and click OK. The mean will be displayed in the output.

Normal Distribution: Mode = Median = Mean

Characteristics:

All three measures coincide at or near the same value
Mode = Median = Mean
This is the ideal scenario for using the mean
Symmetric bell-shaped distribution

Best measure: Mean (can use for further calculations)

Figure 4: Normal distribution: all measures coincide

Positively Skewed: Mode < Median < Mean

Characteristics:

The measures are pulled apart
The mean is pulled furthest toward the tail (right)
The median sits between the mode and mean
Tail extends to the right

Best measure: Median (resistant to outliers)

Figure 5: Positively skewed distribution showing Mode, Median, and Mean

Negatively Skewed: Mean < Median < Mode

Characteristics:

The measures are pulled apart
The mean is pulled furthest toward the tail (left)
The median sits between the mean and mode
Tail extends to the left

Best measure: Median (resistant to outliers)

Figure 6: Negatively skewed distribution showing Mode, Median, and Mean

Summary: When to Use Mode and Median

Tip

Mode: Use for a rough estimate if data are near normal, or when working with nominal (categorical) data.

Median: Use if data is ordinal, you need the middle/typical score, or the distribution is badly skewed by outliers.

Summary: When to Use the Mean

Tip

Mean: Use if data is interval/ratio and the distribution is near normal, all data information (values and order) must be considered, or further calculations (e.g., standard deviation, standard scores, inferential tests) are required.

Decision Tree: Selecting the Right Measure

Important

This decision tree provides a practical guide, but always consider the research question and context when selecting a measure of central tendency.

Test Your Knowledge: Choosing the Right Measure

For each scenario, identify which measure of central tendency (mode, median, or mean) is most appropriate:

Sport type (soccer, basketball, swimming) for 100 athletes
Finish position in a marathon (1st, 2nd, 3rd, …)
Reaction time (in milliseconds) with one extreme outlier
Body mass (in kg) for a normally distributed sample, and you need to calculate standard deviation

Using the ClassShare App, submit your answers.

Answers

Mode — nominal data (categories only)
Median — ordinal data (rank order)
Median — interval/ratio data with outliers
Mean — interval/ratio data, normal distribution, further calculations needed

Practical Application: Real Data Example

Scenario: You measure sprint times (in seconds) for 10 athletes:

Data: 10.2, 10.5, 10.3, 10.8, 10.2, 10.6, 10.4, 10.2, 10.7, 12.5

Questions:

What is the mode?
What is the median?
What is the mean?
Which measure best represents “typical” performance? Why?

Figure 7: Distribution of sprint times showing outlier effect

Click to reveal answers

Mode: 10.2 seconds (appears 3 times)
Median:
- Ordered data: 10.2, 10.2, 10.2, 10.3, 10.4, 10.5, 10.6, 10.7, 10.8, 12.5
- Middle two values: 10.4 and 10.5
- Median = (10.4 + 10.5) / 2 = 10.45 seconds
Mean:
- Sum = 106.4
- Mean = 106.4 / 10 = 10.64 seconds
Best measure: The median (10.45 s) best represents typical performance because the mean is pulled upward by the outlier (12.5 s). The median is more robust to this extreme value.

Note

This example demonstrates why understanding distribution shape and outliers is crucial for selecting the appropriate measure of central tendency. Notice how the outlier (12.5 s) pulls the mean to the right, while the median stays closer to the bulk of the data.

Tip

The graph above is a dot plot. To learn how to create dot plots, click here.

Key Takeaways

Remember These Core Concepts:

Mode: Most frequent score — simple but limited; best for nominal data^[1]
Median: Middle score (50th percentile) — robust to outliers; best for ordinal or skewed data^[1,2]
Mean: Arithmetic average — uses all information; best for interval/ratio data with normal distributions^[1]
Distribution shape matters: In normal distributions, all three measures converge; in skewed distributions, they diverge^[3]
Choose wisely: Match the measure to your data type, distribution shape, and research goals^[4]

Next Steps: Chapter 5

In the next chapter, we will explore measures of variability:

How do scores spread out around the center?
What is the range, variance, and standard deviation?
How do we interpret variability in the context of research?

Note

Understanding both central tendency (where scores cluster) and variability (how spread out they are) gives you a complete picture of your data distribution.

References

1. Moore, D. S., McCabe, G. P., & Craig, B. A. (2021). Introduction to the practice of statistics (10th ed.). W. H. Freeman; Company.

2. Wilcox, R. R. (2017). Introduction to robust estimation and hypothesis testing (4th ed.). Academic Press.

3. Hippel, P. T. von. (2005). Mean, median, and skew: Correcting a textbook rule. Journal of Statistics Education, 13(2). https://doi.org/10.1080/10691898.2005.11910570

4. Tukey, J. W. (1977). Exploratory data analysis. Addison-Wesley.

5. Furtado, O., Jr. (2026). Statistics for movement science: A hands-on guide with SPSS (1st ed.). https://drfurtado.github.io/sms/