Appendix P — SPSS Tutorial: Confidence Intervals

Computing and interpreting confidence intervals for means, differences, and proportions

Learning Objectives

By the end of this tutorial, you will be able to:

Compute confidence intervals for a single mean using SPSS
Interpret confidence interval output in the context of Movement Science research
Calculate confidence intervals for the difference between two independent means
Compute confidence intervals for paired (dependent) samples
Understand how sample size affects confidence interval width
Visualize confidence intervals using error bar plots
Report confidence intervals following APA guidelines

P.1 Overview

Confidence intervals (CIs) are essential for quantifying uncertainty and communicating the precision of estimates. While point estimates (e.g., sample means) provide single “best guess” values, confidence intervals acknowledge that sample-based estimates vary due to sampling error. A 95% confidence interval provides a range of plausible values for the population parameter, constructed such that if we repeated the study many times, approximately 95% of the intervals would contain the true population value.

SPSS provides confidence intervals automatically in many statistical procedures, but understanding how to access, interpret, and visualize them is critical for responsible statistical practice. This tutorial demonstrates:

How to obtain 95% and 99% confidence intervals
How to interpret CI width as a measure of precision
How to use CIs for assessing statistical and practical significance
How to create publication-ready error bar plots

Prerequisites: Familiarity with SPSS data entry, descriptive statistics, and basic data management.

P.2 Dataset for this tutorial

We will use the Core Dataset (core_session.csv), filtered to pre-training (N = 60) unless otherwise noted. Download it here: core_session.csv

Key variables used in this tutorial:

sprint_20m_s — 20-meter sprint time in seconds
vo2_mlkgmin — VO₂max (mL·kg⁻¹·min⁻¹)
group — training vs. control

P.3 Part 1: Confidence interval for a single mean

Let’s start with the most basic scenario: computing a 95% confidence interval for a population mean based on a single sample.

P.3.1 Example: Mean VO₂max at baseline

We measure aerobic capacity (vo2_mlkgmin) in 60 participants at baseline and want to estimate the population mean with a confidence interval.

P.3.2 Procedure: Using Explore

The simplest way to obtain a confidence interval for a mean is through the Explore procedure.

Analyze → Descriptive Statistics → Explore…
Move your variable (e.g., vo2_mlkgmin) to Dependent List
Click Statistics…
- ✓ Check Descriptives (if not already selected)
- Confidence Interval for Mean: 95% (default)
Continue
Display: Select Statistics
OK

P.3.3 Interpreting the output

SPSS will produce a table labeled Descriptives that includes:

Descriptives
                               Statistic    Std. Error
vo2_mlkgmin  Mean              41.340       0.880
             95% Confidence    Lower Bound  39.579
             Interval for Mean Upper Bound  43.101
             Median            41.550
             Variance          46.477
             Std. Deviation    6.817
             Minimum           26.9
             Maximum           56.5
             Range             29.6
             Interquartile Range 9.8
             Skewness          -0.040       0.309
             Kurtosis          -0.563       0.608

Key information:

Mean = 41.34 mL·kg⁻¹·min⁻¹: Point estimate of population mean
95% CI [39.58, 43.10]: We are 95% confident that the true population mean VO₂max lies between 39.58 and 43.10 mL·kg⁻¹·min⁻¹
Std. Error = 0.880: Standard error of the mean (SE = SD / √n = 6.817 / √60)

Interpretation in context

Report: “Mean VO₂max was 41.34 mL·kg⁻¹·min⁻¹, 95% CI [39.58, 43.10], based on 60 participants at pre-training.”

The interval width (3.52 mL·kg⁻¹·min⁻¹) reflects moderate precision given the relatively high variability (SD = 6.82).

P.3.4 Changing the confidence level

To compute a 99% confidence interval (wider, more conservative):

Analyze → Descriptive Statistics → Explore…
Statistics button
Change Confidence Interval for Mean to 99
Continue → OK

The 99% CI will be wider than the 95% CI because we are demanding greater certainty.

Example output (approximate):

95% CI: [39.58, 43.10] (width = 3.52)
99% CI: [38.99, 43.69] (width = 4.70)

Notice the interval is wider with 99% confidence, reflecting the trade-off between confidence level and precision.

Why use 99% vs 95%?

95% CI is standard in most research.
99% CI is appropriate when greater certainty is required (e.g., safety-critical applications, exploratory research with high false-positive risk).
90% CI may be used in exploratory or preliminary studies where narrower intervals are acceptable.

P.4 Part 2: Confidence intervals for independent samples (two groups)

When comparing two independent groups (e.g., training vs. control), we want a confidence interval for the difference in means.

P.4.1 Example: Comparing sprint time between training and control groups

We compare 20-m sprint time (sprint_20m_s) between the training (n = 30) and control (n = 30) groups at pre-training.

P.4.2 Procedure: Independent-Samples T Test

SPSS produces confidence intervals for mean differences through the Independent-Samples T Test procedure.

Analyze → Compare Means → Independent-Samples T Test…
Move sprint_20m_s to Test Variable(s)
Move group to Grouping Variable
Click Define Groups… and enter control and training
Continue → OK

P.4.3 Interpreting the output

SPSS will produce two tables:

Group Statistics:

Group Statistics
              group      N    Mean   Std. Deviation   Std. Error Mean
sprint_20m_s  control   30   3.811  .340             .062
              training  30   3.772  .373             .068

Independent Samples Test:

Independent Samples Test
                                        t       df    Sig.     Mean        95% CI of the Difference
                                                             Difference    Lower      Upper
sprint_20m_s   Equal variances assumed  0.429   58    .669     .039        -.145      .224

Key information:

Mean difference = 0.039 s: Groups differ by less than 1/25 of a second on average
95% CI [−0.145, 0.224]: The CI includes zero, indicating no significant difference at α = 0.05
p = .669: Consistent with the CI conclusion

Practical interpretation

The confidence interval tells us:

Statistical significance: Because zero is in the interval, we conclude the two groups did not differ significantly at baseline.
Effect magnitude: The true difference is plausibly anywhere from −0.145 s to +0.224 s — both very small.
Precision: The interval width (0.369 s) is moderate; larger samples would narrow it further.

P.4.4 Reporting

“Sprint time at baseline did not differ significantly between control (M = 3.81 s, SD = 0.34) and training groups (M = 3.77 s, SD = 0.37), mean difference = 0.04 s, 95% CI [−0.15, 0.22], t(58) = 0.43, p = .669.”

P.5 Part 3: Confidence intervals for paired samples (within-subjects)

For paired or repeated-measures designs (e.g., pre-test vs. post-test), we use Paired-Samples T Test.

P.5.1 Example: Pre-post sprint time comparison

We compare sprint time at pre-training vs. post-training across all 55 participants with complete data.

P.5.2 Procedure: Paired-Samples T Test

Analyze → Compare Means → Paired-Samples T Test…
Select the two variables (e.g., sprint_pre and sprint_post) and move them to Paired Variables
Click Options… → Continue
OK

P.5.3 Interpreting the output

Paired Samples Statistics:

Paired Samples Statistics
                 Mean     N     Std. Deviation   Std. Error Mean
Pair 1  sprint_pre  3.802   55    .365             .049
        sprint_post 3.792   55    .402             .054

Paired Samples Test:

Paired Samples Test
                                 Mean      Std.        95% CI of the Difference      t       df    Sig.
                                 Difference Deviation  Lower       Upper
Pair 1  sprint_pre - sprint_post  .010      .158       -.033       .053             0.469   54    .641

Key information:

Mean difference = 0.010 s: Sprint time barely changed from pre to post
95% CI [−0.033, +0.053]: CI includes zero; change is non-significant
The CI excludes practically meaningful change, confirming minimal intervention effect on sprint time across the full sample

Reporting paired comparisons

“Sprint time did not change significantly from pre-training (M = 3.80 s, SD = 0.37) to post-training (M = 3.79 s, SD = 0.40), mean change = 0.01 s, 95% CI [−0.03, 0.05], t(54) = 0.47, p = .641.”

The confidence interval shows the true population change is plausibly between −0.033 and +0.053 seconds — practically negligible in either direction.

P.6 Part 4: Visualizing confidence intervals with error bars

Error bar plots are effective for displaying means with confidence intervals across groups or conditions.

P.6.1 Procedure: Creating error bar plots

Graphs → Legacy Dialogs → Error Bar…
Select Simple and Define
Bars Represent: Select Confidence interval for mean
- Level (%): 95 (or desired confidence level)
Move your continuous variable (e.g., JumpHeight) to Variable
Move your grouping variable (e.g., TrainingGroup) to Category Axis
OK

P.6.2 Example output

SPSS will produce a plot with:

Group means displayed as points or bars
Error bars extending ±95% CI from each mean
Y-axis labeled with the variable name and units

Interpreting error bar plots

Non-overlapping error bars suggest a statistically significant difference (though formal testing is required).
Wide error bars indicate high uncertainty/low precision.
Narrow error bars indicate high precision (large sample, low variability).

Caution: Overlapping CIs do not necessarily mean “no difference.” Always examine the confidence interval for the difference, not just overlap of individual group CIs.

P.6.3 Enhancing plots for publication

To improve error bar plots:

Double-click the chart to enter Chart Editor
Elements → Show Data Labels (optional, for displaying exact means)
Edit → Properties to customize colors, fonts, and axis labels
Export as high-resolution image (File → Export)

P.7 Part 5: Confidence intervals for proportions

While SPSS does not provide confidence intervals for proportions through standard menus, you can compute them manually or use syntax.

P.7.1 Manual calculation example

Suppose 18 out of 50 recreational runners report an injury during a season.

Sample proportion: \(\hat{p} = 18/50 = 0.36\)

95% CI: \(\hat{p} \pm 1.96 \times \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}\)

\[ 0.36 \pm 1.96 \times \sqrt{\frac{0.36 \times 0.64}{50}} = 0.36 \pm 0.133 = [0.227, 0.493] \]

Interpretation: We are 95% confident that the true injury rate among recreational runners is between 22.7% and 49.3%.

P.7.2 Using SPSS syntax for proportions

COMPUTE p = 18/50.
COMPUTE n = 50.
COMPUTE SE = SQRT((p * (1 - p)) / n).
COMPUTE CI_lower = p - 1.96 * SE.
COMPUTE CI_upper = p + 1.96 * SE.
EXECUTE.

Add these commands to a syntax file and run to obtain CI bounds.

P.8 Part 6: Understanding factors affecting confidence interval width

P.8.1 Effect of sample size

To see how sample size affects CI width:

Split your dataset into subsets of different sizes (e.g., n = 10, n = 30, n = 100)
Compute 95% CIs for each subset using Explore
Compare interval widths

Expected result: Larger samples produce narrower CIs.

P.8.2 Effect of confidence level

Compute CIs at different confidence levels (90%, 95%, 99%) for the same dataset:

90% CI: Narrowest
95% CI: Moderate width
99% CI: Widest

Trade-off: Higher confidence requires wider intervals.

P.8.3 Effect of variability

Compare CIs for variables with different standard deviations:

Higher SD → wider CI (more sampling error)
Lower SD → narrower CI (less sampling error)

Key takeaways

Sample size has the greatest practical impact on precision. Larger samples always improve precision.
Confidence level is a choice reflecting desired certainty. 95% is standard but not mandatory.
Variability is inherent in the population and cannot be controlled, but precise measurement tools can reduce it.

P.9 Part 7: Common mistakes and troubleshooting

P.9.1 Mistake 1: Interpreting CIs as “95% of data fall in this range”

Incorrect: “95% of participants have jump heights between 55 and 61 cm.”

Correct: “We are 95% confident that the population mean jump height is between 55 and 61 cm.”

The CI estimates the population parameter (mean), not the distribution of individual observations.

P.9.2 Mistake 2: Concluding “no difference” from overlapping CIs

Two groups with overlapping 95% CIs may still differ significantly. Always compute the CI for the difference using Independent-Samples T Test.

P.9.3 Mistake 3: Reporting only “significant” without the CI

Report: “Mean difference = 4.3 cm, 95% CI [2.5, 6.1], p < .001”

Not: “p < .001” alone.

P.9.4 Mistake 4: Using z-values instead of t-values for small samples

For small samples (n < 30), use the t-distribution (automatic in SPSS procedures). Using z-values (1.96) underestimates uncertainty.

P.10 Part 8: Exporting and reporting confidence intervals

P.10.1 APA-style reporting template

“[Group 1 description] (M = [mean], SD = [SD], n = [n]) [verb] [Group 2 description] (M = [mean], SD = [SD], n = [n]), mean difference = [diff], 95% CI [lower, upper], t([df]) = [t-value], p = [p-value].”

Example:

“Sprint time at baseline did not differ significantly between control (M = 3.81 s, SD = 0.34, n = 30) and training groups (M = 3.77 s, SD = 0.37, n = 30), mean difference = 0.04 s, 95% CI [−0.15, 0.22], t(58) = 0.43, p = .669.”

P.10.2 Creating a summary table

You can copy-paste SPSS output into Word or Excel, or create a custom table:

Variable	n	Mean (SD)	95% CI
VO₂max (mL·kg⁻¹·min⁻¹)	60	41.34 (6.82)	[39.58, 43.10]
Sprint Time (s) — Control	30	3.81 (0.34)	[3.68, 3.94]
Sprint Time (s) — Training	30	3.77 (0.37)	[3.63, 3.91]
Sprint Difference	—	0.04	[−0.15, 0.22]

Note. Pre-training time point.

P.11 Summary

This tutorial covered:

Computing 95% and 99% CIs for single means using Explore
Obtaining CIs for independent-samples comparisons using Independent-Samples T Test
Computing CIs for paired-samples comparisons using Paired-Samples T Test
Visualizing CIs using Error Bar plots
Understanding how sample size, confidence level, and variability affect CI width
Avoiding common misinterpretations of confidence intervals

Confidence intervals are indispensable for responsible statistical reporting. Always report CIs alongside point estimates to communicate uncertainty and precision honestly.

Next steps

Practice computing CIs on your own datasets
Experiment with different confidence levels to see the trade-off between width and certainty
Use error bar plots in presentations and manuscripts to visualize uncertainty
Consult Chapter 9 of the textbook for deeper conceptual understanding of confidence intervals

P.12 Additional resources

SPSS manuals: IBM SPSS Statistics Base documentation
APA Style (7th ed.): Guidelines for reporting confidence intervals
Cumming, G. (2014). Understanding the New Statistics: Comprehensive guide to confidence intervals and effect sizes
Textbook website: Download practice datasets and syntax files

Questions or issues? Refer to the textbook’s online support forum or consult your instructor.