Appendix K — Calculating Measures of Variability in SPSS
Range, Variance, and Standard Deviation Using SPSS 31
K.1 Purpose & Outcomes
This tutorial teaches you how to calculate and interpret measures of variability using SPSS 31. By the end, you will be able to compute the range, variance, and standard deviation for continuous variables using the Frequencies, Descriptives, and Explore procedures. You will learn when to use each procedure, how to interpret the output tables, and how to report these statistics in APA format. Additionally, you will understand how variability measures complement measures of central tendency and what they reveal about the spread or dispersion of your data.
K.2 Why This Matters
Measures of variability describe how spread out or dispersed the values in a dataset are. While measures of central tendency (mean, median, mode) tell you where the center of your data is, measures of variability tell you how much the data points differ from that center. Understanding variability is essential for describing your data completely, assessing the reliability of measurements, comparing the consistency of different groups, and making informed decisions about statistical analyses. Two groups can have identical means but vastly different variability, leading to different interpretations and conclusions.
K.3 Understanding Measures of Variability
Before diving into SPSS procedures, it’s important to understand what each measure represents and when to use it.
K.3.1 The Range
The range is the simplest measure of variability, calculated as the difference between the maximum and minimum values in your dataset. While easy to compute and understand, the range is highly sensitive to outliers—a single extreme value can dramatically inflate the range. The range provides a quick sense of the total spread but doesn’t tell you anything about how the values are distributed within that spread.
Formula: Range = Maximum − Minimum
K.3.2 The Variance
The variance measures the average squared deviation of each data point from the mean. It considers every value in the dataset, not just the extremes. Because deviations are squared, the variance is always positive and is expressed in squared units (e.g., seconds², cm²), which can make interpretation less intuitive. The variance is fundamental to many statistical procedures, including ANOVA and regression, but for descriptive purposes, the standard deviation is often preferred.
Population Formula: \(\sigma^2 = \frac{\sum(X - \mu)^2}{N}\)
Sample Formula: \(s^2 = \frac{\sum(X - \bar{X})^2}{n-1}\)
K.3.3 The Standard Deviation
The standard deviation is the square root of the variance and is expressed in the same units as the original data, making it more interpretable. It represents the typical or average distance of data points from the mean. A small standard deviation indicates that values cluster tightly around the mean (low variability), while a large standard deviation indicates that values are more spread out (high variability). For normally distributed data, approximately 68% of values fall within one standard deviation of the mean, and about 95% fall within two standard deviations.
Population Formula: \(\sigma = \sqrt{\frac{\sum(X - \mu)^2}{N}}\)
Sample Formula: \(s = \sqrt{\frac{\sum(X - \bar{X})^2}{n-1}}\)
SPSS uses the sample formula (with n−1 in the denominator) by default for variance and standard deviation. This provides an unbiased estimate of population parameters. The n−1 correction is called the degrees of freedom adjustment. When reporting results from SPSS, you’re typically reporting sample statistics unless you’ve specifically requested population parameters.
K.3.4 Coefficient of Variation
The coefficient of variation (CV) is a normalized measure of variability that expresses the standard deviation as a percentage of the mean. It’s useful for comparing variability across variables measured in different units or with different means. A CV of 10% means the standard deviation is 10% of the mean.
Formula: \(CV = \frac{s}{\bar{X}} \times 100\)
K.4 SPSS Procedures for Calculating Variability
SPSS offers three main procedures for calculating measures of variability: Frequencies, Descriptives, and Explore. Each has different strengths.
K.4.1 When to Use Frequencies
Use the Frequencies procedure when you want:
- Range, variance, and standard deviation along with central tendency
- Frequency tables and histograms
- Percentiles and quartiles
- Analysis of both continuous and categorical variables
K.4.2 When to Use Descriptives
Use the Descriptives procedure when you want:
- Quick, compact output for multiple continuous variables
- Range, variance, standard deviation, minimum, and maximum
- Efficient analysis without frequency tables
- Option to save standardized (z-score) values
K.4.3 When to Use Explore
Use the Explore procedure when you want:
- Comprehensive descriptive statistics and diagnostics
- Boxplots and stem-and-leaf plots
- Tests for normality (Shapiro-Wilk, Kolmogorov-Smirnov)
- Outlier detection and identification
- Separate analyses by groups
K.5 Method 1: Using Frequencies
The Frequencies procedure provides variability measures alongside central tendency and frequency distributions.
K.5.1 Step-by-Step Instructions
Step 1: Open Your Dataset
Ensure your data file is open in SPSS. For this tutorial, we’ll use the core_session.csv file filtered to pre-intervention data only.
Step 2: Access the Frequencies Dialog
Navigate to the menu bar and select:
Analyze > Descriptive Statistics > Frequencies
Step 3: Select Your Variable(s)
- Move the variable(s) you want to analyze to the Variable(s): box
- For continuous variables with many unique values, consider suppressing the frequency table by clicking Format and selecting “Suppress tables with more than N categories”
Step 4: Request Statistics
Click the Statistics button to open the Frequencies: Statistics subdialog.
In the Dispersion section, check the boxes for:
- ☑ Std. deviation — Standard deviation
- ☑ Variance — Variance
- ☑ Range — Range (max − min)
- ☑ Minimum — Smallest value
- ☑ Maximum — Largest value
You can also select:
- S.E. mean — Standard error of the mean (SD/√n)
Click Continue to return to the main dialog, then click OK to run the analysis.
K.5.2 Interpreting Frequencies Output
The output will contain a Statistics table showing your requested measures of variability.
Reading the Output Table:
- N Valid: Number of cases with non-missing values
- N Missing: Number of cases with missing values
- Std. Deviation: Typical distance from the mean
- Variance: Average squared deviation from the mean
- Range: Difference between max and min
- Minimum: Smallest value in the dataset
- Maximum: Largest value in the dataset
Interpretation Example:
For sprint time with SD = 0.35 s, Variance = 0.126 s², Range = 1.55 s:
- Sprint times typically deviate about 0.35 seconds from the mean
- The variance is 0.126 squared seconds (less interpretable than the SD)
- The fastest and slowest times differ by 1.55 seconds
- This relatively small variability suggests good consistency in sprint performance across participants
K.5.3 Syntax for Frequencies
* Calculate measures of variability using Frequencies.
FREQUENCIES VARIABLES=sprint_20m_s
/STATISTICS=STDDEV VARIANCE RANGE MINIMUM MAXIMUM
/FORMAT=NOTABLE
/ORDER=ANALYSIS.K.6 Method 2: Using Descriptives
The Descriptives procedure provides a compact, efficient way to calculate variability measures for multiple continuous variables.
K.6.1 Step-by-Step Instructions
Step 1: Access the Descriptives Dialog
Navigate to the menu bar and select:
Analyze > Descriptive Statistics > Descriptives
Step 2: Select Your Variable(s)
- From the variable list on the left, select the variable(s) you want to analyze
- Click the arrow button to move them to the Variable(s): box
- You can analyze multiple variables simultaneously
Step 3: Configure Options
Click the Options button to specify which statistics to calculate.
In the Options dialog, check:
- ☑ Mean — Arithmetic average (for context)
- ☑ Std. deviation — Standard deviation
- ☑ Variance — Variance
- ☑ Range — Range
- ☑ Minimum — Smallest value
- ☑ Maximum — Largest value
Click Continue to return to the main dialog, then click OK to run the analysis.
K.6.2 Interpreting Descriptives Output
The output displays a compact table with summary statistics for each variable.
Reading the Output Table:
- N: Number of valid cases
- Range: Difference between maximum and minimum
- Minimum: Lowest value in the dataset
- Maximum: Highest value in the dataset
- Variance: Average squared deviation from the mean
- Std. Deviation: Square root of variance, in original units
Interpretation Example:
For sprint time with N = 60, Range = 1.55, Min = 2.90, Max = 4.45, Variance = 0.13, SD = 0.35:
- 60 participants completed the sprint at baseline
- Times ranged from 2.90 to 4.45 seconds (a 1.55-second range)
- The variance is 0.13 s²
- The standard deviation is 0.35 s, meaning most participants’ times fall within about ±0.35 s of the mean
K.6.3 Syntax for Descriptives
* Calculate variability measures using Descriptives.
DESCRIPTIVES VARIABLES=sprint_20m_s
/STATISTICS=MEAN STDDEV VARIANCE RANGE MIN MAX.K.7 Method 3: Using Explore
The Explore procedure provides the most comprehensive analysis, including variability measures, visual diagnostics, and outlier detection.
K.7.1 Step-by-Step Instructions
Step 1: Access the Explore Dialog
Navigate to the menu bar and select:
Analyze > Descriptive Statistics > Explore
This opens the Explore dialog box.
Step 2: Select Your Variable(s)
- Move the variable(s) you want to analyze to the Dependent List: box
- Optionally, move a grouping variable to the Factor List: box to compare variability across groups
- In the Display section, select Both (for statistics and plots) or choose Statistics or Plots individually
Step 3: Configure Statistics and Plots
Click the Statistics button to select which statistics to display. The default includes descriptive statistics, outliers, and percentiles.
Click the Plots button to select visualizations. By default, Explore produces:
- Boxplots (showing median, quartiles, range, and outliers)
- Stem-and-leaf plots
- Normality plots
Step 4: Run the Analysis
Click OK to execute the analysis. SPSS will generate comprehensive output including tables and graphs.
K.7.2 Interpreting Explore Output
The Explore procedure produces extensive output:
Descriptives Table:
- Mean, median, and other central tendency measures
- Standard deviation, variance, range
- Interquartile range (IQR = 75th percentile − 25th percentile)
- Minimum and maximum values
- Skewness and kurtosis (distribution shape)
Boxplot:
The boxplot visually displays variability:
- The box represents the interquartile range (IQR), containing the middle 50% of data
- The line inside the box is the median
- The whiskers extend to the minimum and maximum values (or to 1.5 × IQR)
- Outliers are shown as individual points beyond the whiskers
Interpretation:
A wide box indicates high variability in the middle 50% of data. Long whiskers indicate extreme values. Outliers appear as circles or asterisks and warrant investigation.
K.7.3 Syntax for Explore
* Comprehensive exploration with variability measures.
EXAMINE VARIABLES=sprint_20m_s
/PLOT BOXPLOT STEMLEAF NPPLOT
/STATISTICS DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.K.8 Comparing the Three Procedures
| Feature | Frequencies | Descriptives | Explore |
|---|---|---|---|
| Standard Deviation | ✓ Yes | ✓ Yes | ✓ Yes |
| Variance | ✓ Yes | ✓ Yes | ✓ Yes |
| Range | ✓ Yes | ✓ Yes | ✓ Yes |
| IQR | Via percentiles | ✗ No | ✓ Yes |
| Boxplots | ✗ No | ✗ No | ✓ Yes |
| Outlier Detection | ✗ No | ✗ No | ✓ Yes |
| Normality Tests | ✗ No | ✗ No | ✓ Yes |
| Compact Output | ✗ No | ✓ Yes | ✗ No |
| Best For | General analysis | Quick summaries | Comprehensive diagnostics |
K.9 Understanding Variability in Context
K.9.1 Interpreting Standard Deviation
The standard deviation is most meaningful when considered relative to the mean and the context of your measurement:
Small SD relative to mean:
- Data points cluster tightly around the mean
- High consistency or homogeneity
- Example: Sprint times with M = 3.79 s, SD = 0.35 s (CV = 9.3%)
Large SD relative to mean:
- Data points are widely dispersed
- High variability or heterogeneity
- Example: Body mass with M = 73.83 kg, SD = 15.22 kg (CV = 20.6%)
K.9.2 The 68-95-99.7 Rule
For approximately normal distributions:
- About 68% of values fall within ±1 SD of the mean
- About 95% of values fall within ±2 SD of the mean
- About 99.7% of values fall within ±3 SD of the mean
This rule helps you understand what “typical” variability looks like and identify potential outliers.
K.9.3 Comparing Variability Across Groups
When comparing two or more groups, examine both central tendency and variability:
- Same mean, different SD: Groups have the same average but different consistency
- Different mean, same SD: Groups differ in average but have similar variability
- Different mean, different SD: Groups differ in both location and spread
Two training groups might have the same mean improvement (5 seconds), but Group A has SD = 1.0 s (very consistent) while Group B has SD = 4.0 s (highly variable). This suggests Group A’s training produces more predictable results.
K.10 Reporting Results in APA Format
When reporting measures of variability in research papers or assignments, follow APA style guidelines:
K.10.1 Reporting Standard Deviation with the Mean
“The mean 20-m sprint time was 3.79 s (SD = 0.35, N = 60).”
or in a sentence:
“Participants completed the 20-m sprint in an average of 3.79 seconds (SD = 0.35), with times ranging from 2.90 to 4.45 seconds.”
K.10.2 Reporting Range
“Sprint times ranged from 2.90 to 4.45 seconds (range = 1.55 s).”
K.10.3 Reporting Variance
Variance is rarely reported alone in APA format. When needed:
“The variance in sprint times was 0.13 s².”
K.10.4 Reporting Coefficient of Variation
“The coefficient of variation for sprint time was 9.3%, indicating low-to-moderate variability relative to the mean.”
K.10.5 In a Table
| Variable | N | M | SD | Variance | Range | Min | Max |
|---|---|---|---|---|---|---|---|
| Sprint Time (s) | 60 | 3.79 | 0.35 | 0.13 | 1.55 | 2.90 | 4.45 |
| VO₂max (mL·kg⁻¹·min⁻¹) | 60 | 41.34 | 6.82 | 46.48 | 29.60 | 26.90 | 56.50 |
Note. M = mean; SD = standard deviation; Min = minimum; Max = maximum. Pre-training time point (N = 60).
K.10.6 Reporting with Graphical Display
When presenting means and standard deviations graphically, use error bars to show ±1 SD:
“Figure 1 displays mean sprint times (±1 SD) for the training and control groups at baseline.”
Recent recommendations suggest avoiding “dynamite plunger plots” (bar charts with error bars) because they hide the distribution of individual data points. Instead, use dot plots or violin plots that show both summary statistics and the underlying data distribution.
K.11 Sample Dataset for This Tutorial
We will use the Core Dataset from this book, specifically the core_session.csv file. This dataset comes from a mixed neuromuscular training study with recreationally active adults.
Click to download: core_session.csv
For complete information about this dataset, see the Core Dataset Overview and Core Dataset Codebook chapters.
K.11.1 Variables We’ll Analyze
For this tutorial, we’ll focus on these continuous variables:
- sprint_20m_s: 20-meter sprint time in seconds (lower is better)
- vo2_mlkgmin: Aerobic capacity estimate in mL·kg⁻¹·min⁻¹ (higher is better)
- mass_kg: Body mass in kilograms
We’ll primarily use sprint_20m_s at the pre time point to demonstrate calculating measures of variability.
K.12 Practical Example: Complete Workflow
Let’s walk through a complete example using the 20-meter sprint times from the Core Dataset.
K.12.1 Scenario
You have collected 20-meter sprint times from participants at baseline and want to describe the variability in sprint performance to understand how consistent or variable the group is.
K.12.2 Step 1: Import and Prepare the Data
- Download
core_session.csvfrom the link above - Open SPSS and import the file using File > Open > Data
- Save it as
core_session.savin your data folder
Since the dataset contains three time points (pre, mid, post), we need to filter to analyze only the baseline (pre) measurements.
Using Menus:
- Go to Data > Select Cases
- Choose “If condition is satisfied” and click If
- Enter:
time = "pre" - Click Continue, then OK
Using Syntax:
* Select only pre-intervention cases.
SELECT IF (time = "pre").
EXECUTE.K.12.3 Step 2: Calculate Variability Using Descriptives
Since we want a quick summary of variability measures:
- Go to Analyze > Descriptive Statistics > Descriptives
- Move
sprint_20m_sto the Variable(s) box - Click Options
- Check: Mean, Std. deviation, Variance, Range, Minimum, Maximum
- Click Continue, then OK
Syntax:
* Calculate variability measures for sprint time.
DESCRIPTIVES VARIABLES=sprint_20m_s
/STATISTICS=MEAN STDDEV VARIANCE RANGE MIN MAX.K.12.4 Step 3: Interpret the Output
Examine the Descriptive Statistics table. For sprint_20m_s at pre-training (N = 60):
Descriptive Statistics
N Range Minimum Maximum Mean Std. Deviation Variance
sprint_20m_s 60 1.55 2.90 4.45 3.79 .354 .126
Valid N (listwise) 60
- N = 60: All 60 pre-training participants
- Range = 1.55 s: Difference between fastest and slowest
- Minimum = 2.90 s: Fastest sprint time
- Maximum = 4.45 s: Slowest sprint time
- Mean = 3.79 s: Average sprint time
- Variance = 0.126 s²: Average squared deviation
- Std. Deviation = 0.354 s: Typical deviation from the mean
K.12.5 Step 4: Calculate Coefficient of Variation
The coefficient of variation isn’t automatically calculated, but you can compute it:
CV = (SD / Mean) × 100 = (0.354 / 3.79) × 100 = 9.3%
This indicates that the standard deviation is about 9.3% of the mean, representing low-to-moderate variability.
K.12.6 Step 5: Create a Boxplot for Visual Assessment
To visualize the variability and identify potential outliers:
- Go to Analyze > Descriptive Statistics > Explore
- Move
sprint_20m_sto the Dependent List - Select Both under Display
- Click OK
Syntax:
* Create boxplot and comprehensive statistics.
EXAMINE VARIABLES=sprint_20m_s
/PLOT BOXPLOT
/STATISTICS DESCRIPTIVES
/MISSING LISTWISE.The boxplot will show:
- The median (line inside the box)
- The IQR (height of the box)
- The range (whiskers)
- Any outliers (individual points)
K.12.7 Step 6: Assess Variability in Context
With SD = 0.354 s and Mean = 3.79 s:
- Most participants’ sprint times fall between 3.44 s and 4.15 s (Mean ± 1 SD)
- The CV of 9.3% indicates low-to-moderate variability
- The range of 1.55 s shows the total spread from fastest to slowest
- The IQR of 0.48 s (Q1 = 3.54 s, Q3 = 4.03 s) confirms the middle 50% of participants are tightly clustered
- This relatively low variability is consistent with a fairly homogeneous sample of recreationally active adults
K.12.8 Step 7: Report the Results
“Twenty-meter sprint times at baseline (N = 60) ranged from 2.90 to 4.45 seconds (M = 3.79 s, SD = 0.35 s). The coefficient of variation was 9.3%, indicating low-to-moderate variability in sprint performance. Visual inspection of the boxplot revealed no extreme outliers, and the distribution appeared approximately symmetric.”
K.13 Common Pitfalls and Troubleshooting
K.13.1 Pitfall 1: Confusing Variance and Standard Deviation
Problem: Reporting variance when standard deviation is more appropriate, or vice versa.
Solution: For descriptive purposes, report the standard deviation because it’s in the same units as your data. Variance is more commonly used in inferential statistics (ANOVA, regression).
K.13.2 Pitfall 2: Ignoring the Scale of Measurement
Problem: Comparing standard deviations across variables with different units or scales.
Solution: Use the coefficient of variation to compare variability across different measures. CV normalizes the SD by the mean, allowing meaningful comparisons.
K.13.3 Pitfall 3: Overlooking Outliers
Problem: Reporting variability measures without checking for outliers that may inflate the SD and range.
Solution: Always create a boxplot or use the Explore procedure to identify outliers. Investigate whether outliers are data entry errors or legitimate extreme values.
K.13.4 Pitfall 4: Misinterpreting Small vs. Large SD
Problem: Not considering the SD relative to the mean or the context of measurement.
Solution: Calculate the coefficient of variation. An SD of 5 might be large for a variable with a mean of 10 (CV = 50%) but small for a variable with a mean of 100 (CV = 5%).
K.13.5 Pitfall 5: Forgetting Degrees of Freedom
Problem: Manually calculating variance or SD using n instead of n−1.
Solution: SPSS automatically uses the sample formula (n−1) for unbiased estimates. If you’re calculating by hand, remember to use n−1 for sample statistics.
K.13.6 Pitfall 6: Reporting Only Central Tendency
Problem: Describing data with only the mean, without reporting variability.
Solution: Always report both central tendency and variability. Two groups can have identical means but vastly different SDs, leading to different interpretations.
K.14 Practice Exercise
Use the Core Dataset to practice calculating measures of variability for a different variable.
K.14.1 Practice Data
Use the same core_session.csv file you downloaded earlier. For this exercise, you’ll analyze aerobic capacity (VO₂) at baseline.
K.14.2 Tasks
- Filter the data to include only pre-intervention measurements (
time = "pre") - Calculate variability measures using the Descriptives procedure for the
vo2_mlkgminvariable - Calculate the coefficient of variation manually from the output
- Create a boxplot using the Explore procedure to visualize variability and identify outliers
- Compare variability between sprint time and VO₂ using the coefficient of variation
- Write a brief results paragraph in APA format reporting your findings
K.15 Quick Reference: Syntax Templates
K.15.1 Frequencies (Variability Measures)
FREQUENCIES VARIABLES=variable_name
/STATISTICS=STDDEV VARIANCE RANGE MINIMUM MAXIMUM
/FORMAT=NOTABLE
/ORDER=ANALYSIS.K.15.2 Descriptives (Variability Measures)
DESCRIPTIVES VARIABLES=variable_name
/STATISTICS=MEAN STDDEV VARIANCE RANGE MIN MAX.K.15.3 Explore (Comprehensive Analysis)
EXAMINE VARIABLES=variable_name
/PLOT BOXPLOT STEMLEAF
/STATISTICS DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE.K.15.4 Multiple Variables
DESCRIPTIVES VARIABLES=var1 var2 var3
/STATISTICS=MEAN STDDEV VARIANCE RANGE MIN MAX.K.15.5 By Group (Split File)
SORT CASES BY group.
SPLIT FILE LAYERED BY group.
DESCRIPTIVES VARIABLES=variable_name
/STATISTICS=MEAN STDDEV VARIANCE RANGE MIN MAX.
SPLIT FILE OFF.K.16 Readiness Checklist
Before moving on, ensure you can:
K.17 Additional Resources
- SPSS Help: Press F1 within SPSS for context-sensitive help
- IBM SPSS Documentation: Comprehensive guides available through IBM’s website
- APA Publication Manual (7th ed.): For detailed reporting guidelines
- Course Textbook: Review the chapter on variability for conceptual understanding
Once you’re comfortable calculating measures of variability, the next tutorial will cover the normal distribution and standard scores (z-scores), which build on your understanding of the mean and standard deviation to describe relative positions within a distribution.





