Introduction

Humans are naturally wired to see patterns — we often link events, behaviours, or numbers and wonder if they’re connected.
In statistics, this idea is formalised through correlation, a measure that quantifies the strength and direction of a relationship between two variables.

For example, ice cream sales and shark attacks might both increase in summer, showing a positive correlation, but that doesn’t mean one causes the other.
This is a crucial lesson in statistics: correlation does not imply causation.

The correlation coefficient, usually represented by r, measures how strongly two variables are related.

Correlation Coefficient Formula

For two variables X and Y, the Pearson correlation coefficient is given by:

Where:

  • is the mean of the x-values
  • is the mean of the y-values
  • - ranges from -1 to +1

Interpretation:

  • : Perfect positive correlation
  • : No correlation
  • : Perfect negative correlation
The best Maths tutors available
Sehaj
4.9
4.9 (56 reviews)
Sehaj
£60
/h
Gift icon
1st lesson free!
Intasar
5
5 (68 reviews)
Intasar
£129
/h
Gift icon
1st lesson free!
Johann
5
5 (48 reviews)
Johann
£50
/h
Gift icon
1st lesson free!
Hiren
5
5 (32 reviews)
Hiren
£149
/h
Gift icon
1st lesson free!
Poonam
5
5 (62 reviews)
Poonam
£100
/h
Gift icon
1st lesson free!
Harjinder
4.9
4.9 (163 reviews)
Harjinder
£25
/h
Gift icon
1st lesson free!
Syed
5
5 (64 reviews)
Syed
£50
/h
Gift icon
1st lesson free!
Farooq
4.9
4.9 (50 reviews)
Farooq
£50
/h
Gift icon
1st lesson free!
Sehaj
4.9
4.9 (56 reviews)
Sehaj
£60
/h
Gift icon
1st lesson free!
Intasar
5
5 (68 reviews)
Intasar
£129
/h
Gift icon
1st lesson free!
Johann
5
5 (48 reviews)
Johann
£50
/h
Gift icon
1st lesson free!
Hiren
5
5 (32 reviews)
Hiren
£149
/h
Gift icon
1st lesson free!
Poonam
5
5 (62 reviews)
Poonam
£100
/h
Gift icon
1st lesson free!
Harjinder
4.9
4.9 (163 reviews)
Harjinder
£25
/h
Gift icon
1st lesson free!
Syed
5
5 (64 reviews)
Syed
£50
/h
Gift icon
1st lesson free!
Farooq
4.9
4.9 (50 reviews)
Farooq
£50
/h
Gift icon
1st lesson free!
Let's go

Visual Interpretation of Correlation

correlation graph diagrams
Graph TypeCorrelation (r)Description
A1.0Perfect positive correlation — as one variable increases, so does the other.
B0.6Moderate positive correlation — upward trend with some variation.
C0.0No correlation — no linear relationship between the variables.
D-0.6Moderate negative correlation — one increases as the other decreases.
E-1.0Perfect negative correlation — all points lie on a downward line.

Practice Problems and Solutions

Problem 1

Calculate and interpret the correlation coefficient for the following data:

PersonHand Span (cm)Height (cm)
A17150
B15154
C19169
D17172
E21175

Solution

Step 1: Calculate the means:

Step 2: Compute deviations and products:

Personx−x̄y−ȳproduct(x−x̄)²(y−ȳ)²
A-0.8-14.011.20.6196.0
B-2.8-10.028.07.8100.0
C1.15.05.91.425.0
D-0.88.0-6.40.664.0
E3.211.035.210.2121.0

Totals:
Σ(x−x̄)(y−ȳ) = 74.0
Σ(x−x̄)² = 20.8
Σ(y−ȳ)² = 506.0

Step 3: Substitute into the formula:

Interpretation:
There is a strong positive correlation between hand span and height. Taller individuals tend to have larger hand spans.

Problem 2

Calculate and interpret the correlation coefficient for the following data:

PersonWeight (kg)Blood Pressure (mmHg)
A150125
B169130
C175160
D180169
E200150

Solution

Step 1: Calculate the means:

Step 2: Compute deviations and products:

Personx−x̄y−ȳProduct(x−x̄)²(y−ȳ)²
A-24.8-21.8540.6615.0475.2
B-5.8-16.897.433.6282.2
C0.213.22.60.0174.2
D5.222.2115.427.0492.8
E25.23.280.6635.010.2

Totals:
Σ(x−x̄)(y−ȳ) = 836.8
Σ(x−x̄)² = 1310.8
Σ(y−ȳ)² = 1434.8

Step 3: Substitute into the formula:

Interpretation:
There is a moderate positive correlation between weight and blood pressure — as weight increases, blood pressure tends to rise.

Problem 3

Calculate and interpret the correlation coefficient for the following data:

PersonScreen Time (hr)Exam Score (%)
A588
B890
C1078
D1285
E1570

Solution

Step 1: Calculate the means:

Step 2: Compute deviations and products:

Personx−x̄y−ȳProduct(x−x̄)²(y−ȳ)²
A-5.05.8-29.025.033.6
B-2.07.8-15.64.060.8
C0.0-4.2-0.00.017.6
D2.02.85.64.07.8
E5.0-12.2-61.025.0148.8

Totals:
Σ(x−x̄)(y−ȳ) = -100.0
Σ(x−x̄)² = 58.0
Σ(y−ȳ)² = 268.8

Step 3: Substitute into the formula:

Interpretation:
There is a strong negative correlation between screen time and exam score: as screen time increases, exam scores tend to decrease.

Conclusion

Correlation helps us understand whether two variables move together — and how strongly.
A positive correlation means they rise together, a negative correlation means one rises while the other falls, and a zero correlation means no relationship at all.

This concept forms the basis of regression analysis, which allows us to make predictions using real data.

Did you like this article? Rate it!

4.00 (17 rating(s))
Loading...

Gianpiero Placidi

UK-based Chemistry graduate with a passion for education, providing clear explanations and thoughtful guidance to inspire student success.