Visualizing Data Part 1

In previous sections, we discussed how to calculate the mean and its various applications. Here, we will learn more intermediate concepts involved with mean and other statistical measures. That is, we’ll introduce the various ways you can calculate mean in the real world, as well as how to visualize and interpret the mean.

The best Maths tutors available
1st lesson free!
Ayush
5
5 (27 reviews)
Ayush
£90
/h
1st lesson free!
Intasar
4.9
4.9 (23 reviews)
Intasar
£42
/h
1st lesson free!
Dr. Kritaphat
4.9
4.9 (6 reviews)
Dr. Kritaphat
£39
/h
1st lesson free!
Matthew
5
5 (17 reviews)
Matthew
£25
/h
1st lesson free!
Paolo
4.9
4.9 (11 reviews)
Paolo
£25
/h
1st lesson free!
Petar
4.9
4.9 (9 reviews)
Petar
£27
/h
1st lesson free!
Myriam
5
5 (15 reviews)
Myriam
£20
/h
1st lesson free!
Andrea
5
5 (12 reviews)
Andrea
£40
/h
1st lesson free!
Ayush
5
5 (27 reviews)
Ayush
£90
/h
1st lesson free!
Intasar
4.9
4.9 (23 reviews)
Intasar
£42
/h
1st lesson free!
Dr. Kritaphat
4.9
4.9 (6 reviews)
Dr. Kritaphat
£39
/h
1st lesson free!
Matthew
5
5 (17 reviews)
Matthew
£25
/h
1st lesson free!
Paolo
4.9
4.9 (11 reviews)
Paolo
£25
/h
1st lesson free!
Petar
4.9
4.9 (9 reviews)
Petar
£27
/h
1st lesson free!
Myriam
5
5 (15 reviews)
Myriam
£20
/h
1st lesson free!
Andrea
5
5 (12 reviews)
Andrea
£40
/h
First Lesson Free>

Calculating Mean in the Real World

If you open any book or website on beginner’s statistics, one of the first concepts you will encounter is the mean. In fact, finding the average of a data set is one of the first statistical concepts we learn in elementary school. However basic the mean may seem, it’s actually one of the most powerful concepts statistics has to offer.Averages are used in everything from economics to sports entertainment. In fact, it may just be the most popular statistic used on a daily basis. The reason for this popularity can be found in its simplicity. Everyone can understand what an average is and how it’s calculated. The average strives to represent the most “typical” value of a data set and can be a quick way of understanding a large amount of observations.

In economics and national accounting, the mean is used in order to compare information about the economy and finances across years or countries. Some popular indicators using mean include:

  • Average wage
  • Mean income

If you’re interested in comparing the standards of living across country, you are most likely to use indicators such as mean income and average rent costs. Averages can even be found in the burgeoning field of esports, where players can follow indicators such as average competitive rank, average player rating, and more.

However, simple averages can often hide hidden truths within a data set. As we explored in earlier sections, sometimes using other measures of central tendency are better than using the mean. To recap, the general rules of thumb for deciding whether to use mean or median can be summarized in the following:

  • If your data are approximately normally distributed, the mean is representative
  • If your data have a significant amount of outliers or some skew, the median is preferred

Just like when we’re deciding what measure of central tendency we should use over another, we should also take an equally as deliberate and insightful approach to applying means to our research. In the following problem, you’ll understand why using a simple mean can sometimes be inadequate.

Problem 1: Mean by Variable

There are many reasons why you may want to dig deeper into a particular average. Some examples of variables you may want to explore are:

  • Average time spent on the internet by age
  • Average wage by occupation
  • Average response by survey method
Here, you will be asked to find the mean by variable. In the table below, women’s’ times for a 100 meter sprint have been recorded for various countries. Find the mean for the entire data set, then find the average time by country. Keep in mind that this is a fictitious data set, meant for instructional purposes only.Afterwards, check your answers with our solution below.

Observation Country Women 100 m Sprint Time
1 America 9.7
2 America 10.1
3 America 9.9
4 America 11.1
5 America 9.6
6 Canada 11.8
7 Canada 10.4
8 Canada 11.3
9 Canada 10.0
10 Canada 10.5
11 Jamaica 9.5
12 Jamaica 9.5
13 Jamaica 9.3
14 Jamaica 10.0
15 Jamaica 11.2
16 France 9.9
17 France 9.9
18 France 10.9
19 France 10.6
20 France 9.0
21 Nigeria 9.7
22 Nigeria 10.4
23 Nigeria 9.5
24 Nigeria 9.9
25 Nigeria 11.4

 

Solution Problem 1: Mean by Variable

In this problem, you were presented with two tasks:

  • Find the mean for the data set
  • Find the mean for each country

First, we calculate the mean for the entire data set. This is a task we’ve done before, simply using the equation for finding sample means.

    \[ s = \dfrac{255.2}{25} \]

    \[ =10.2 \]

Now, in order to find the mean by country, we simply take the average only of the times corresponding to a particular country. The sample size is 5 for each country.

America

    \[ s = \dfrac{50.4}{5} =10.1 \]

Canada

    \[ s = \dfrac{54}{5} =10.8 \]

Jamaica

    \[ s = \dfrac{49.5}{5} =9.9 \]

France

    \[ s = \dfrac{50.3}{5} =10.1 \]

Nigeria

    \[ s = \dfrac{50.9}{5} =10.2 \]

Finding the mean for each country tells us important information. The fastest average sprint time for the 100 meter spring for women is in Nigeria. This slowest average time is held by Canada. Here, we can see how finding the mean of each category, instead of just the mean of the entire data set, reveals the performance of each individual country.

Visualizing Mean

In the previous sections, you encountered different types of visualizations. Visualizations come in all different types and sizes, some of the most common and basic ones including:

  • Bar charts
  • Pie charts
  • Histograms
  • Scatterplots

As you’ve seen, there are many different ways to convey data. Often, reporting numbers can be cumbersome and can be equally as difficult to read and comprehend. Visualizations allow us the ability to express information about our data efficiently and, simultaneously, report any patterns we may find.

You may be wondering in what cases we might want to visualize something like the mean. Often, plotting the mean into some type of chart can help us make comparisons across time, between countries or between variables. One real-life example is a visualization you may have seen before: average rainfall by month.

Being able to record and plot this data in a bar chart can be more helpful in revealing patterns we may otherwise not have noticed or may have had trouble understanding. Plotting average rainfall by month helps us notice seasonal patterns, which are common in a time series, or a data set listed in order of time.

Sometimes, plotting measures can be the quickest way to understand your data or to supplement your understanding of specific measures, like the average. In the next problem, you’ll be able to see why.

 

Problem 2: Visualizing the Mean

Sometimes, plotting the mean in a chart or graph can be a better way of communicating the mean. Often, however, the hardest part about displaying data is choosing which type of visual to use.

The data below is a fictitious list of mean sprint times for several countries. Given the choice between the following, select the type of visual would you choose to present this data:

  1. Histogram
  2. Pie Chart
  3. Bar Chart
  4. Line Graph

 

America 10.1
Canada 10.8
Jamaica 9.9
France 10.1
Nigeria 10.2
Ivory Coast 10.7
Portugal 10.4
Trinidad and Tobago 11.7
Russia 10.9
Italy 10.4
Barbados 10.2

Solution Problem 2: Visualizing the Mean

While there are many other ways to visualize this data, out of the choices presented a bar chart would be the best method to display this data. A brief overview of why can be found below.

  • Histogram: best used to display frequency
  • Pie Chart: best for illustrating the makeup of categories within one variable
  • Line Graph: best when you want to display a variable changing as a result of another

 

Bar chart_2

 

Here, a bar chart is ideal because we want to display one numerical variable for each value within the country category.

Need a Maths teacher?

Did you like the article?

1 Star2 Stars3 Stars4 Stars5 Stars 5.00/5 - 1 vote(s)
Loading...

Danica

Located in Prague and studying to become a Statistician, I enjoy reading, writing, and exploring new places.