3 Displaying and Visualizing Data
How graphs reveal patterns, problems, and meaning in Movement Science data
For a practical guide on how to create the graphs mentioned in this chapter, refer to Appendix J in the SPSS tutorials section.
3.1 Chapter roadmap
Graphs are not decoration. In Movement Science, visualization is one of the fastest ways to understand what your data are doing, whether your measurement process behaved as expected, and whether an analysis plan makes sense. Tables can tell you an average sprint time. A good plot can reveal skew, outliers, clusters, fatigue patterns across trials, or group differences that change over time.
By the end of this chapter, you will be able to:
- Choose a graph that matches your question and variable types.
- Create and interpret a broad set of core plots used in Movement Science.
- Use visualization for data screening before inferential testing.
- Explain how graph design choices shape interpretation and can mislead.
Any time you are unsure what to do next, use this sequence:
- Clarify the question (distribution, comparison, relationship, or change over time)
- Identify variable types (categorical, quantitative, repeated measures)
- Start simple (show raw patterns before adding summaries)
- Check for problems (outliers, strange spikes, unit mix-ups)
- Add context (group, condition, time) only if it helps answer the question
- State the takeaway in one sentence, including uncertainty and limitations
3.2 Why visualization matters in Movement Science
Visualization is a form of reasoning. You are using your visual system to detect structure that is difficult to see in a spreadsheet. This matters because Movement Science datasets often contain features that can quietly break assumptions or distort conclusions.
Common patterns that are easier to see than to compute:
- Skew and heavy tails in variables like EMG amplitude and sway area
- Trial-by-trial patterns from learning or fatigue
- Group differences that change across time points
- Clusters that create misleading correlations
- Ceiling effects on bounded scales such as function scores
A good habit is to treat graphs as part of quality control. If the plot looks surprising, do not immediately blame the participants. First ask whether the data structure, coding, units, protocol, or device could explain the pattern.
Before running inferential tests, you should have looked at the distribution of your main outcomes and the structure of change over time. Many analysis problems are visible in a plot before they show up in output tables.
3.3 Start with the question, not the graph
Students often ask, “Which graph should I use?” A better question is, “What am I trying to learn from the data?” Most visualization decisions can be reduced to four question types.
- What values occur and how often (distribution)
- Do groups or conditions differ (comparison)
- Are two variables related (relationship)
- How does something change over time or across trials (change)
3.4 Match the graph to the variable type
You can choose a sensible first plot by matching the question to the variables.
| Question type | Typical variables | First-choice plots |
|---|---|---|
| Distribution | one variable | histogram, boxplot, dot plot |
| Comparison | outcome plus group or condition | dot plot with summary, side-by-side boxplots |
| Relationship | two quantitative variables | scatterplot, grouped scatterplot |
| Change | repeated measures | paired plot, spaghetti plot, line graph |
| Trial sequence | trial index plus outcome | trial-by-trial line plot, spaghetti-by-trial |
When sample sizes are small or moderate, start with a dot plot for comparisons and a paired plot for change. Then add summaries, not the other way around.
3.5 Visualizing categorical variables
Categorical variables represent membership in a group or category. Examples include intervention group, sport, injury status, or condition label. The most common graphs are bar charts and grouped bar charts.
3.5.1 Bar charts: counts and percentages
Counts are best for describing your sample. Percentages are best for comparing groups of different sizes. Percentages without sample sizes can be misleading, so good reporting usually includes sample size information in text or captions.
3.5.2 Grouped bar charts for comparisons
If you want to compare categories across groups, grouped bars are often clearer than stacked bars because they share a common baseline.
Stacked bar charts can make it hard to compare categories because most segments do not share a common baseline. Grouped bars are often clearer when your goal is comparison.
3.6 Visualizing distributions of quantitative variables
A distribution plot shows where values are located, how spread out they are, and what shape they form. In Movement Science, shape matters because many variables are not symmetric.
3.6.1 Histograms
Histograms group values into bins and show frequency. They are excellent for understanding shape, but bin width can change the appearance. A useful approach is to view more than one bin width and ask whether the interpretation is stable.
- Symmetry versus skew
- Multiple peaks suggesting subgroups
- Extreme values that look implausible
- Spikes at round numbers suggesting rounding
3.6.2 Boxplots
Boxplots summarize the distribution using the median and interquartile range. They are excellent for comparisons across groups or conditions, but they can hide raw patterns in small samples. If the sample is small, overlay points or use dot plots.
3.6.3 Dot plots as a default companion
Dot plots show each observation. They are especially valuable in Movement Science because sample sizes are often not huge and individual variability is scientifically meaningful.
3.7 Comparing groups and conditions
When your question is “do groups differ,” you want to see separation between groups and variability within groups.
Good first-choice plots include:
- dot plot with a summary overlay
- side-by-side boxplots, preferably with points overlaid
Summary-only bar charts of means can hide important structure, so they should be treated as a secondary view rather than a default.
3.8 Visualizing relationships: scatterplots
A scatterplot is the default plot for two quantitative variables. It helps you see direction, form, and strength. It also helps you see whether the relationship is nonlinear or driven by clusters.
If training participants cluster in one region of a scatterplot and control participants cluster in another, the overall trend can look strong even if there is little relationship within each group. When group membership might matter, visualize with group separation or use different symbols.
3.9 Visualizing change over time and across trials
Movement Science studies often focus on change: learning, adaptation, fatigue, or recovery. These questions require plots that respect repeated measures.
3.9.1 Paired plots and slopegraphs
For two time points, a paired plot shows each person’s change and makes within-person response patterns visible.
3.9.2 Spaghetti plots and line graphs
For multiple time points, spaghetti plots show individual trajectories. Line graphs of group means can summarize trends, but they should not hide individual response variation when the sample is not large.
3.9.3 Trial-by-trial plots
Trial-by-trial plots show within-session patterns. These plots are essential when learning or fatigue might occur. Averaging across trials without inspecting trial order can remove meaningful patterns.
3.10 Visualization as data screening and quality control
Visualization is one of the simplest ways to detect data problems. Before inferential testing, your plots should help you answer: does this dataset look plausible?
Common visual clues include impossible values, unit mix-ups, rounding artifacts, outliers that may be errors, and missingness patterns that differ by group or time.
3.11 How graphs influence interpretation
Graphs shape what the viewer notices, what looks important, and what feels convincing. That makes visualization a scientific responsibility.
Axis scaling can exaggerate small differences, especially in bar charts. Aspect ratio can make trends look steeper or flatter. Overplotting can hide where data concentrate. Decorative elements can reduce comprehension. The ethical goal is not persuasion. The goal is clarity.
A scientific plot should help a reader understand the data, not persuade them to agree with you.
3.12 Choosing the right graph for the question
3.13 Worked example using the Core Dataset
The Core Dataset includes training and control groups measured at pre, mid, and post. Some outcomes are session-level (sprint time, VO₂, pain, function). Some outcomes are trial-level (jump height, peak force, EMG amplitude, sway area). The same dataset can answer multiple questions, and each question suggests a different plot.
Distribution question: What does the distribution of sway area look like at pre?
Comparison question: Do training and control differ in sprint time at pre?
Relationship question: Is peak force related to sprint time?
Change question: Do participants improve from pre to post, and do groups change differently?
Within-session question: Do trial values show a pattern from trial 1 to trial 3?
The key point is that plots are aligned with questions. If you try to force one “favorite graph” onto every question, you will either hide important information or overcomplicate the presentation.
3.14 Chapter summary
Visualization is a core tool for Movement Science statistics. Good plots reveal distribution shape, variability, outliers, group differences, relationships, and change over time. This chapter introduced a workflow for choosing graphs based on the question and variable types, and it emphasized that graph design choices can influence interpretation in ways that are scientifically meaningful.
3.15 Key terms
distribution; histogram; bin width; boxplot; dot plot; bar chart; grouped bar chart; line graph; paired plot; slopegraph; spaghetti plot; scatterplot; outlier; overplotting; axis scaling; aspect ratio
3.16 Practice: quick checks
- You collect EMG amplitude data and the histogram shows a long right tail. What summaries and plots would you use to describe it responsibly?
- A boxplot shows several outliers for peak force. List two reasons those points could be real and two reasons they could be errors.
- A scatterplot of peak force versus sprint time shows a strong trend, but the points form two clusters. Give two plausible explanations and describe how you would visualize the data to investigate.
- You have pre and post function scores for each participant. Which plot best shows individual change, and what question does it answer that a bar chart cannot?
- You plot trial-by-trial jump height and see trial 1 is consistently lower than trials 2 and 3. What does that imply about using mean of trials as a summary?
The next chapter introduces percentiles and standard scores, which are widely used in testing, screening, and clinical interpretation.