KIN 610: Quantitative Methods in Kinesiology

Chapter 4: Measures of Central Tendency

Ovande Furtado Jr., PhD.

Professor, Cal State Northridge

2026-01-10

FYI

This presentation is based on the the following books. The references are coming from these books unless otherwise specified.

Main sources:

  • Weir, J. P., Vincent, W. J. (2021). Statistics in Kinesiology. Human Kinetics.
  • Furtado, O., Jr. (2026). Statistics for movement science: A hands-on guide with SPSS (1st ed.). https://drfurtado.github.io/sms

ClassShare App

You may be asked in class to go to the ClassShare App to answer questions.

Learning Objectives

  • Define central tendency and identify the three primary measures: mode, median, and mean.
  • Calculate the mode for raw data and frequency distributions.
  • Calculate the median for raw data and frequency distributions.
  • Calculate the mean for raw data and frequency distributions.
  • Identify the appropriate measure of central tendency for different types of data.
  • Explain the advantages and disadvantages of each measure of central tendency.
  • Describe the relationship between the mean, median, and mode in different types of distributions.

Distribution Shapes

Understanding distribution shapes is crucial for selecting the appropriate measure of central tendency[1].

Normal (Symmetric)

Figure 1: Normal Distribution

Characteristics: - Symmetric bell shape - Mean = Median = Mode - Best measure: Mean

Positively Skewed

Figure 2: Positively Skewed

Characteristics: - Tail extends right - Mode < Median < Mean - Best measure: Median

Negatively Skewed

Figure 3: Negatively Skewed

Characteristics: - Tail extends left - Mean < Median < Mode - Best measure: Median

Important

The shape of the distribution determines which measure of central tendency is most appropriate. In skewed distributions, the mean is pulled toward the tail, while the median remains more representative of the “typical” value.

Introduction: Describing the Center

This chapter focuses on measures of central tendency — statistical values that describe the middle or central characteristics of a data set[1].

  • We will explore three primary measures:
    • Mode: The most frequent score
    • Median: The middle score
    • Mean: The arithmetic average

Important

These measures describe how scores tend to cluster in a distribution, which is fundamentally different from measures of variability (how scores spread out), covered in Chapter 5.

Key Terms

Understanding the terminology is essential for selecting and interpreting the appropriate measure[1]:

  • Measures of Central Tendency: Values that describe the middle or central characteristics of a data set
  • Mode: The most frequent or common score in a distribution
  • Median: The score above and below which half (50%) of all scores fall; the “typical” score
  • Mean: The sum of all scores divided by the total number of scores; the arithmetic average

Note

Each measure has specific use cases, strengths, and limitations. The choice depends on the data type and distribution shape. Recall we discussed data types in Chapter 2 - nominal, ordinal, interval, and ratio. Distribution shapes will be discussed in Chapter 6.

The Mode: Most Frequent Score

G A Identify most frequent score B Mode A->B

  • Definition: The score that occurs most frequently in a distribution
  • Calculation: No formula required — identified by inspection
  • Methods:
    • Rank order listing: Look for the score that appears most often
    • Frequency distributions: Find the highest value in the frequency (\(f\)) column

Note

Distributions can be unimodal (one mode), bimodal (two modes), trimodal (three modes), or multimodal (more than three modes). A distribution can technically have any number of modes.

Mode Example: Pull-Up Scores

Table 4.1: Rank Order Distribution of Pull-Up Scores

Score Frequency
18 1
17 2
16 3
15 5
14 4
13 3
12 2
11 1
10 1

Tip

Question: What is the mode in this distribution?

Answer The mode is 15 because it has the highest frequency (5 occurrences).

Disadvantages of the Mode

While simple, the mode has significant limitations:

  1. Unstable: Can change dramatically based on how data is grouped or categorized
  2. Terminal statistic: Not useful for further mathematical or statistical calculations
  3. Ignores information: Does not account for the values of extreme scores or the overall distribution shape

Warning

The mode is best used as a rough estimate when data are near normal, or for nominal data where it is the only appropriate measure.

The Median: The Middle Score

G A Rank order all scores B Find middle value A->B C Median B->C

Note

The median is the “typical” score and is particularly useful for ordinal data or highly skewed distributions[1].

  • Definition: The 50th percentile — the score that divides the data set into two equal halves
  • Characteristics:
    • Half of all scores fall above the median
    • Half of all scores fall below the median
  • Calculation (Odd N): The middle score in the ordered list
  • Calculation (Even N): The average of the two middle scores

Median: Strengths and Applications

Key Strengths:

  • Unaffected by extreme scores (outliers): A few very high or very low values do not distort the median[2]
  • Appropriate for ordinal data: Works well when data has rank order but unequal intervals
  • Robust measure: Provides a stable center even when distributions are skewed[1]

When to Use the Median:

  • Data is ordinal (e.g., Likert scales, race finish positions)
  • You need the middle or “typical” score
  • The distribution is badly skewed by outliers

Tip

Example: If measuring recovery time and one participant takes 10x longer than others due to injury, the median recovery time is more representative than the mean.

The Mean: Arithmetic Average

G A Sum all scores B Divide by number of scores A->B C Mean B->C

  • Definition: The arithmetic average — the sum of all scores divided by the number of scores[1]
  • Status: The most widely used index of central tendency in statistics
  • Logic: Balances all values in the distribution (the “balance point”)

Important

The mean is sensitive to every score in the distribution, including extreme values (outliers)[3].

Formula for the Mean

The mean is calculated using the following formula:

\[\bar{X} = \frac{\sum X}{N}\]

Components:

  • \(\bar{X}\) = Mean (pronounced “X-bar”)
  • \(\sum\) = Summation sign (add up all values)
  • \(X\) = Individual raw score
  • \(N\) = Total number of scores

Note

This formula is fundamental to many subsequent statistical calculations, including standard deviation, z-scores, and inferential tests.

Sample Calculation of the Mean

VO₂max data (mL/kg/min) for 7 participants:

30.6, 29.5, 28.2, 27.8, 26.5, 26.1, 25.7

Calculation:

\[\bar{X} = \frac{\sum X}{N} = \frac{30.6 + 29.5 + 28.2 + 27.8 + 26.5 + 26.1 + 25.7}{7} = \frac{194.4}{7} = 27.77 \text{ mL/kg/min}\]

Important

Note that the mean is expressed in the same units as the raw data (mL/kg/min in this case).

Mean: Strengths and Limitations

Strengths:

  • Uses all information in the data set (every value matters)[1]
  • Essential for subsequent statistical inference and calculations
  • Most appropriate for interval and ratio data
  • Algebraically defined, allowing for advanced mathematical operations

Limitations:

  • Sensitive to outliers: Extreme scores have a disproportionate impact[3]
  • Arguably inappropriate for ordinal data: Assumes equal intervals between values
  • Can be misleading when distributions are highly skewed

Warning

When extreme scores are present, the mean may not represent the “typical” value well. Consider using the median instead[2].

Test Your Knowledge: Calculating the Mean

You measure the maximum bench press (in kg) for 5 athletes:

Data: 80, 85, 90, 95, 100

Calculate the mean bench press using the calculator on your device.

Answer

\[\bar{X} = \frac{80 + 85 + 90 + 95 + 100}{5} = \frac{450}{5} = 90 \text{ kg}\]

The mean bench press is 90 kg.

Checking your calculation

Open SPSS using the Canvas link and enter the data into the Data View. Then, go to Analyze > Descriptive Statistics > Frequencies. Select the variable you entered and click OK. The mean will be displayed in the output.

Normal Distribution: Mode = Median = Mean

Characteristics:

  • All three measures coincide at or near the same value
  • Mode = Median = Mean
  • This is the ideal scenario for using the mean
  • Symmetric bell-shaped distribution

Best measure: Mean (can use for further calculations)

Figure 4: Normal distribution: all measures coincide

Positively Skewed: Mode < Median < Mean

Characteristics:

  • The measures are pulled apart
  • The mean is pulled furthest toward the tail (right)
  • The median sits between the mode and mean
  • Tail extends to the right

Best measure: Median (resistant to outliers)

Figure 5: Positively skewed distribution showing Mode, Median, and Mean

Negatively Skewed: Mean < Median < Mode

Characteristics:

  • The measures are pulled apart
  • The mean is pulled furthest toward the tail (left)
  • The median sits between the mean and mode
  • Tail extends to the left

Best measure: Median (resistant to outliers)

Figure 6: Negatively skewed distribution showing Mode, Median, and Mean

Summary: When to Use Mode and Median

G A Choose Measure of Central Tendency B Mode A->B C Median A->C B1 Nominal data B->B1 B2 Quick rough estimate (near-normal data) B->B2 C1 Ordinal data C->C1 C2 Need typical/middle score C->C2 C3 Badly skewed distribution with outliers C->C3

Tip

Mode: Use for a rough estimate if data are near normal, or when working with nominal (categorical) data.

Median: Use if data is ordinal, you need the middle/typical score, or the distribution is badly skewed by outliers.

Summary: When to Use the Mean

G A Choose Measure of Central Tendency B Mean A->B B1 Interval/ratio data B->B1 B2 Near-normal distribution B->B2 B3 All data information must be considered B->B3 B4 Further calculations needed (SD, z-scores, inference) B->B4

Tip

Mean: Use if data is interval/ratio and the distribution is near normal, all data information (values and order) must be considered, or further calculations (e.g., standard deviation, standard scores, inferential tests) are required.

Decision Tree: Selecting the Right Measure

G Start What type of data? Nominal Nominal Start->Nominal Ordinal Ordinal Start->Ordinal Interval Interval/Ratio Start->Interval UseMode Use MODE Nominal->UseMode UseMedian Use MEDIAN Ordinal->UseMedian Skewed Is distribution skewed or has outliers? Interval->Skewed Skewed->UseMedian Yes FurtherCalc Need further calculations? Skewed->FurtherCalc No UseMean Use MEAN FurtherCalc->UseMean Yes UseMedian2 Use MEDIAN or MEAN FurtherCalc->UseMedian2 No

Important

This decision tree provides a practical guide, but always consider the research question and context when selecting a measure of central tendency.

Test Your Knowledge: Choosing the Right Measure

For each scenario, identify which measure of central tendency (mode, median, or mean) is most appropriate:

  1. Sport type (soccer, basketball, swimming) for 100 athletes
  2. Finish position in a marathon (1st, 2nd, 3rd, …)
  3. Reaction time (in milliseconds) with one extreme outlier
  4. Body mass (in kg) for a normally distributed sample, and you need to calculate standard deviation

Using the ClassShare App, submit your answers.

Answers
  1. Mode — nominal data (categories only)
  2. Median — ordinal data (rank order)
  3. Median — interval/ratio data with outliers
  4. Mean — interval/ratio data, normal distribution, further calculations needed

Practical Application: Real Data Example

Scenario: You measure sprint times (in seconds) for 10 athletes:

Data: 10.2, 10.5, 10.3, 10.8, 10.2, 10.6, 10.4, 10.2, 10.7, 12.5

Questions:

  1. What is the mode?
  2. What is the median?
  3. What is the mean?
  4. Which measure best represents “typical” performance? Why?
Figure 7: Distribution of sprint times showing outlier effect
Click to reveal answers
  1. Mode: 10.2 seconds (appears 3 times)
  2. Median:
    • Ordered data: 10.2, 10.2, 10.2, 10.3, 10.4, 10.5, 10.6, 10.7, 10.8, 12.5
    • Middle two values: 10.4 and 10.5
    • Median = (10.4 + 10.5) / 2 = 10.45 seconds
  3. Mean:
    • Sum = 106.4
    • Mean = 106.4 / 10 = 10.64 seconds
  4. Best measure: The median (10.45 s) best represents typical performance because the mean is pulled upward by the outlier (12.5 s). The median is more robust to this extreme value.

Note

This example demonstrates why understanding distribution shape and outliers is crucial for selecting the appropriate measure of central tendency. Notice how the outlier (12.5 s) pulls the mean to the right, while the median stays closer to the bulk of the data.

Tip

The graph above is a dot plot. To learn how to create dot plots, click here.

Key Takeaways

Remember These Core Concepts:

  1. Mode: Most frequent score — simple but limited; best for nominal data[1]
  2. Median: Middle score (50th percentile) — robust to outliers; best for ordinal or skewed data[1,2]
  3. Mean: Arithmetic average — uses all information; best for interval/ratio data with normal distributions[1]
  4. Distribution shape matters: In normal distributions, all three measures converge; in skewed distributions, they diverge[3]
  5. Choose wisely: Match the measure to your data type, distribution shape, and research goals[4]

Next Steps: Chapter 5

In the next chapter, we will explore measures of variability:

  • How do scores spread out around the center?
  • What is the range, variance, and standard deviation?
  • How do we interpret variability in the context of research?

Note

Understanding both central tendency (where scores cluster) and variability (how spread out they are) gives you a complete picture of your data distribution.

References

1. Moore, D. S., McCabe, G. P., & Craig, B. A. (2021). Introduction to the practice of statistics (10th ed.). W. H. Freeman; Company.
2. Wilcox, R. R. (2017). Introduction to robust estimation and hypothesis testing (4th ed.). Academic Press.
3. Hippel, P. T. von. (2005). Mean, median, and skew: Correcting a textbook rule. Journal of Statistics Education, 13(2). https://doi.org/10.1080/10691898.2005.11910570
4. Tukey, J. W. (1977). Exploratory data analysis. Addison-Wesley.
5. Furtado, O., Jr. (2026). Statistics for movement science: A hands-on guide with SPSS (1st ed.). https://drfurtado.github.io/sms/