Data presentation in statistics

Data presentation in statistics is done using tables, charts, diagrams, or graphs and is part of the data analysis and reporting phase of the data analysis life cycle. Visual data presentation helps data analysists identify and understand relationships between and within data sets to inform science-based decision making. This post constitutes Lesson 2 of the Basic Statistics Mini-Course.

You may be also interested in Measures of central tendency.

Key concepts covered in this post: frequency distributions and histograms, graphical representations of data, comparing graphs, stem and leaf plots.

Frequency distributions and histograms

Collected data can be organized in a frequency table and displayed in a histogram to make it easy to notice important features or patterns in the data.

For example, we’re trying to know how far a turtle can travel in half an hour. The distances traveled by 50 turtles in half an hour are recorded rounded to the nearest centimeter. 

2826473893855996325
4810633275461114605851
21514193531115398666
121497546297126197353
35239784567560593246

Often, collected data are grouped into suitable intervals before displaying them in a frequency table and then in a histogram. 

When grouping data 1) choose intervals of the same length, 2) try to keep the number of intervals between 5 and 20 (to help maintain clarity), and 3) do not leave gaps between the bars of the histogram (Google Spreadsheets may not allow you to completely remove the gaps between the bars).

Suppose we find that the smallest distance traveled by a turtle in half an hour was 2 cm and the largest was 121 cm, a spread covering 120 numbers. For the purpose of this illustration, 10 intervals of 12 can be used to organize the data in a frequency table.

Distances (cm)Frequency
2-142
14-265
26-386
38-507
50-6212
62-746
74-864
86-984
98-1102
110-1222

Use the middle number of an interval to label the horizontal axis. In this example, the most common distance traveled by a turtle in half an hour was 56 cm.

Data presentation: Histogram

Graphical representations of data

Choose the type of graph that best explains the results.

Pie charts (circle graphs) show how something is divided (a breakdown of the parts of the whole), e.g., how the government spends $1.00 of tax revenue;

Broken line graphs visualize how something changes over time, e.g., the changing value of the dollar over time;

Bar charts compare things that are alike, e.g., the heights of the mountains of North America;

Pictographs (pictos) show information using images (present the information visually), e.g., a visualization of Canada’s exports.

Comparing graphs

Businesses can plot their expenses and revenues on the same axis (Y axis) over the years (x axis) to help them analyze their financial position. The same scale must be used for both sets of data. Where the revenue line and the expense line cross is the break-even point (when the amount earned is equal to the amount spent). 

Stem and leaf plots

A stem and leaf plot looks like a histogram on its side. But unlike histograms, stem and leaf plots retain the original data to at least two significant digits and order the data to facilitate order-based inference and non-parametric statistics.

The first part of a number is the stem and the second part is the leaf, e.g., 48.6 could be written with 48 as the stem and 6 as the leaf.

Stem and leaf displays present quantitative data in a graphical format to assist in visualizing the shape of a distribution. 

The stem is used to group the scores and each leaf shows the individual scores within each group. You can see how the results cluster (leaves around stems).

See some examples at https://www.mathsisfun.com/data/stem-leaf-plots.html

Next post: Part 3: Measures of central tendency 

Key concepts: Mean, median, mode, the effect of distribution on measures of central tendency.

Back to Basic Statistics Mini-Course

Back to DTI Courses

Related content

Analyzing data spread

Bivariate statistics or two variable statistics

Data collection in statistics

Measures of central tendency

Normal approximation to the binomial distribution

Normal distribution or Gaussian distribution

What is data mining?

Other content

1st Annual University of Ottawa Supervisor Bullying ESG Business Risk Assessment Briefing

Disgraced uOttawa President Jacques Frémont ignores bullying problem

How to end supervisor bullying at uOttawa

PhD in DTI uOttawa program review

Rocci Luppicini – Supervisor bullying at uOttawa case updates

The case for policy reform: Tyranny

The trouble with uOttawa Prof. A. Vellino

The ugly truth about uOttawa Prof. Liam Peyton

uOttawa engineering supervisor bullying scandal

uOttawa President Jacques Frémont ignores university bullying problem

uOttawa Prof. Liam Peyton denies academic support to postdoc

Updated uOttawa policies and regulations: A power grab

What you must know about uOttawa Prof. Rocci Luppicini

Why a PhD from uOttawa may not be worth the paper it’s printed on

Why uOttawa Prof. Andre Vellino refused academic support to postdoc

Supervisor Bullying

Text copying is disabled!