October 9, 2020

Chapters

## Measures of Central Tendency

A **measure of central tendency** is also known as a **summary statistic** and it generally represents the central point of the data set. These measures tell us where most values are located in distribution and are also known as the **central location of the distribution**. Sometimes the data tends to cluster around the central value. The most common measures of central tendency are mean, median, and mode. In this section, we will learn how to calculate the three measures of central tendency. We will also see which kind of data is appropriate for which type of measure.

## Arithmetic Mean

The arithmetic mean is an average of all the values in the data set. It is the most popular measure of central tendency and is easier to calculate. To calculate the arithmetic mean, enumerate all the values in the data set and divide the resulting value by the number of observations.

### Example

The following table shows the score obtained by five students in math exam. Find the arithmetic mean.

Students | Marks |
---|---|

John | 75 |

Mariah | 80 |

Harry | 81 |

Alice | 90 |

Elizabeth | 60 |

### Solution

To find the arithmetic mean, we will divide the total marks by the number of students:

Arithmetic mean =

Hence, the arithmetic mean of the data is 77.2.

## Median

It is the middle value of the data set. This value divides the dataset into two halves. The median of the data is computed after arranging the data either in ascending or descending order. The method of finding the median of the data set when the number of elements are even is different than the process of finding the median when number of elements is odd. In this section, we will see how to find the median of the dataset when it has even and odd number of elements.

### Example 1

Find the median of the below data set.

41, 39, 25, 36, 55, 52, 40

### Solution

The data set in this example consists of seven values. If the data contains an odd number of elements then the median is the middle number after arranging it in ascending or descending order.

First, we will arrange the above data set in ascending order like this:

25, 36, 39, 40, 41, 52, 55

The middle number is 40, hence it is the median.

### Example 2

Consider the following data set.

12, 10, 5, 18, 19, 14

Find its median.

### Solution

Since the data set in this example has even number of elements, therefore the median will be calculated by taking the average of middle two numbers after arranging the data set in ascending or descending order.

Let us arrange the data set in ascending order like this:

5, 10, 12, 14, 18, 19

Since the data has six elements, therefore the median will be the average of third and fourth numbers.

Median =

## Mode

The mode is the **most repeated value** in the data set. A data set can have one, two or multiple modes. A data set that has **two modes** is known as** bimodal** and the data set in which there are multiple modes is known as **multi-modal data set**. If all the elements in the data set are repeated with the same frequency, then the data has **no mode**.

### Example 1

Find the mode of the data below:

4, 6, 4, 6, 8, 7, 9, 10, 6

### Solution

In the above data, 4 is repeated two times and 6 is repeated three times. The most repeated value is 6 which is the mode of the data.

### Example 2

Find the mode of the following data set:

1, 1, 2, 2, 3, 4, 5, 6, 4, 1, 2

### Solution

In the above data, 1 and 2 both are repeated three times. Hence, there are two modes in the above data set which are 1 and 2.

## Measures of Position

Measures of position are various techniques that are used to divide the data into equal groups. These values tell us the position of the value in the data set relative to other values. The data must be arranged in ascending order to compute the measures of position.

## Quartiles

Quartiles divide the data set into **four equal parts**. Different quartiles reflect the three values that divide the data into four equal parts. We denote the first quartile as and it represents 25% of the values less than it and 75% of the values greater than it. The second quartile is also known as median as it reflects 50% of the values above it and 50% below it. It is denoted as . The third quartile has 25% values greater than it and 75% values smaller than it. It is represented as .

The formulas for finding three quartiles of the data set are given below:

First quartile =

Second quartile =

Third quartile =

Here, N = Total number of elements in the set.

### Example 1

Find the three quartiles of the following data set.

1, 3, 5, 7, 9, 11, 13

### Solution

The data set in this example is already arranged in ascending order.

First quartile =

=

= value

The number at the second position in the data set is 3, hence it is the first quartile.

Second quartile =

= value

The number at the fourth position in the data set is 7, hence it is the second quartile.

Third quartile =

=

= value

The number at the sixth position in the data set is 11, hence it is the third quartile.

**Note: If we get the value of any quartile, let say 7.5, then we take average of 7th and 8th term to find the quartile. **

## Deciles

A decile splits the data set into **ten equal segments**. It is terminology in statistics that divides the data into ten intervals. It actually splits the data into ten equal segments on the number line.

We use the following formula for calculating the decile.

Here, i = 1, 2, 3, 4, 5, 6, ....10.

### Example

Calculate the second and fifth decile of the following data set.

SR No. | Values |
---|---|

1 | 3 |

2 | 6 |

3 | 10 |

4 | 13 |

5 | 15 |

6 | 22 |

7 | 25 |

8 | 28 |

9 | 30 |

10 | 32 |

11 | 35 |

12 | 40 |

13 | 43 |

14 | 45 |

15 | 48 |

16 | 52 |

17 | 55 |

18 | 56 |

19 | 60 |

### Solution

The formula for finding the decile is:

Substitute N= 19 and i = 2 in the above formula to get the second decile.

value

Hence, the first decile is the value at the fourth position of the data set. The fourth value in the data set is 13, hence it is the second decile.

To find the 5th decile, substitute N = 19 and i = 5 in the formula:

value

Hence, the 5th decile is the 10th value of the data set. The tenth value in the above data set is 32, hence it is the 5th decile of the data set.

## Percentiles

The percentiles divide the data into **100 equal segments**. Equal segments mean that each part has the same number of values in it.

The formula for finding the percentile is given below:

### Example

Find the 25th and 75th percentiles of the following data set.

Sr No. | Value |
---|---|

1 | 5 |

2 | 10 |

3 | 12 |

4 | 15 |

5 | 18 |

6 | 20 |

7 | 25 |

8 | 28 |

9 | 32 |

10 | 35 |

11 | 38 |

12 | 43 |

### Solution

To find the 25th percentile, substitute the values in the formula below:

The rank value 3.75 is more closer to 4 than 3, hence, the 25th percentile is the value at the fourth position of the dataset. The value at 4th position in the above table is 15.

To find the 75th percentile, substitute the values in the formula below:

9.25 is closer to 9 then 10, hence it is value at the 9th position. The value at the ninth position in the above table is 32, hence it reflects the 75th percentile of the data set.

## Measures of Dispersion

In statistics, the measures of dispersion tell us how the distribution is stretched or squeezed. We can say that the measures of dispersion help us to determine how far the values of the distribution are from the center of the data. In this section, we will discuss four measures of dispersion, that are range, average deviation, standard deviation and variance.

## Range

To find range of the data, you need to take the difference of the highest and lowest value of the data set. The formula for finding the range of a data set is given below:

Range = Highest value - Lowest value

### Example

The following table shows the marks obtained by five students in a math test. Find the range of the data set.

1 | 24 |
---|---|

2 | 32 |

3 | 22 |

4 | 45 |

5 | 33 |

### Solution

Range = Highest value - Lowest value

Range = 45 - 22 = 23

## Average Deviation

To find the average deviation, you should know how to compute absolute deviation. Absolute deviation is computed by taking the mean of the data set and subtracting this mean from each value. After we have the absolute deviation values, we take the average of all the values to calculate average deviation.

### Example

The salaries of five individuals is given in the table below. Find the average deviation of the data.

Name | Salary |
---|---|

Alice | $2200 |

John | $2500 |

Harry | $1800 |

Elizabeth | $2800 |

David | $3800 |

Solution

First, we will calculate the mean of the data set:

Now, we will find the absolute mean of each value like this:

Alice = 2200 - 2620 = -420

John = 2500 - 2620 = 120

Harry = 1800 - 2620 = 820

Elizabeth = 2800 - 2620 = 220

David = 3800 - 2620 = 1180

Now, we will take the average of all the absolute values above to get the value of average deviation.

Average deviation =

## Variance

### Example

Names | Weight (kg) |
---|---|

John | 54 |

Sarah | 55 |

Mariah | 68 |

Alice | 72 |

Harry | 80 |

good day i like the article