Introduction

Humans are naturally wired to see patterns — we often link events, behaviours, or numbers and wonder if they’re connected.
In statistics, this idea is formalised through correlation, a measure that quantifies the strength and direction of a relationship between two variables.

For example, ice cream sales and shark attacks might both increase in summer, showing a positive correlation, but that doesn’t mean one causes the other.
This is a crucial lesson in statistics: correlation does not imply causation.

The correlation coefficient, usually represented by r, measures how strongly two variables are related.

Correlation Coefficient Formula

For two variables X and Y, the Pearson correlation coefficient is given by:

Where:

  • is the mean of the x-values
  • is the mean of the y-values
  • - ranges from -1 to +1

Interpretation:

  • : Perfect positive correlation
  • : No correlation
  • : Perfect negative correlation
The best Maths tutors available
Intasar
5
5 (69 reviews)
Intasar
£129
/h
Gift icon
1st lesson free!
Johann
5
5 (48 reviews)
Johann
£60
/h
Gift icon
1st lesson free!
Hiren
5
5 (32 reviews)
Hiren
£149
/h
Gift icon
1st lesson free!
Poonam
5
5 (62 reviews)
Poonam
£100
/h
Gift icon
1st lesson free!
Harjinder
4.9
4.9 (163 reviews)
Harjinder
£25
/h
Gift icon
1st lesson free!
Syed
5
5 (69 reviews)
Syed
£50
/h
Gift icon
1st lesson free!
Farooq
4.9
4.9 (50 reviews)
Farooq
£50
/h
Gift icon
1st lesson free!
Vivek
4.9
4.9 (23 reviews)
Vivek
£35
/h
Gift icon
1st lesson free!
Intasar
5
5 (69 reviews)
Intasar
£129
/h
Gift icon
1st lesson free!
Johann
5
5 (48 reviews)
Johann
£60
/h
Gift icon
1st lesson free!
Hiren
5
5 (32 reviews)
Hiren
£149
/h
Gift icon
1st lesson free!
Poonam
5
5 (62 reviews)
Poonam
£100
/h
Gift icon
1st lesson free!
Harjinder
4.9
4.9 (163 reviews)
Harjinder
£25
/h
Gift icon
1st lesson free!
Syed
5
5 (69 reviews)
Syed
£50
/h
Gift icon
1st lesson free!
Farooq
4.9
4.9 (50 reviews)
Farooq
£50
/h
Gift icon
1st lesson free!
Vivek
4.9
4.9 (23 reviews)
Vivek
£35
/h
Gift icon
1st lesson free!
Let's go

Visual Interpretation of Correlation

correlation graph diagrams
Graph TypeCorrelation (r)Description
A1.0Perfect positive correlation — as one variable increases, so does the other.
B0.6Moderate positive correlation — upward trend with some variation.
C0.0No correlation — no linear relationship between the variables.
D-0.6Moderate negative correlation — one increases as the other decreases.
E-1.0Perfect negative correlation — all points lie on a downward line.

Practice Problems and Solutions

Problem 1

Calculate and interpret the correlation coefficient for the following data:

PersonHand Span (cm)Height (cm)
A17150
B15154
C19169
D17172
E21175

Solution

Step 1: Calculate the means:

Step 2: Compute deviations and products:

Personx−x̄y−ȳproduct(x−x̄)²(y−ȳ)²
A-0.8-14.011.20.6196.0
B-2.8-10.028.07.8100.0
C1.15.05.91.425.0
D-0.88.0-6.40.664.0
E3.211.035.210.2121.0

Totals:
Σ(x−x̄)(y−ȳ) = 74.0
Σ(x−x̄)² = 20.8
Σ(y−ȳ)² = 506.0

Step 3: Substitute into the formula:

Interpretation:
There is a strong positive correlation between hand span and height. Taller individuals tend to have larger hand spans.

Problem 2

Calculate and interpret the correlation coefficient for the following data:

PersonWeight (kg)Blood Pressure (mmHg)
A150125
B169130
C175160
D180169
E200150

Solution

Step 1: Calculate the means:

Step 2: Compute deviations and products:

Personx−x̄y−ȳProduct(x−x̄)²(y−ȳ)²
A-24.8-21.8540.6615.0475.2
B-5.8-16.897.433.6282.2
C0.213.22.60.0174.2
D5.222.2115.427.0492.8
E25.23.280.6635.010.2

Totals:
Σ(x−x̄)(y−ȳ) = 836.8
Σ(x−x̄)² = 1310.8
Σ(y−ȳ)² = 1434.8

Step 3: Substitute into the formula:

Interpretation:
There is a moderate positive correlation between weight and blood pressure — as weight increases, blood pressure tends to rise.

Problem 3

Calculate and interpret the correlation coefficient for the following data:

PersonScreen Time (hr)Exam Score (%)
A588
B890
C1078
D1285
E1570

Solution

Step 1: Calculate the means:

Step 2: Compute deviations and products:

Personx−x̄y−ȳProduct(x−x̄)²(y−ȳ)²
A-5.05.8-29.025.033.6
B-2.07.8-15.64.060.8
C0.0-4.2-0.00.017.6
D2.02.85.64.07.8
E5.0-12.2-61.025.0148.8

Totals:
Σ(x−x̄)(y−ȳ) = -100.0
Σ(x−x̄)² = 58.0
Σ(y−ȳ)² = 268.8

Step 3: Substitute into the formula:

Interpretation:
There is a strong negative correlation between screen time and exam score: as screen time increases, exam scores tend to decrease.

Conclusion

Correlation helps us understand whether two variables move together — and how strongly.
A positive correlation means they rise together, a negative correlation means one rises while the other falls, and a zero correlation means no relationship at all.

This concept forms the basis of regression analysis, which allows us to make predictions using real data.

Summarise with AI:

Did you like this article? Rate it!

4.00 (18 rating(s))
Loading...

Gianpiero Placidi

UK-based Chemistry graduate with a passion for education, providing clear explanations and thoughtful guidance to inspire student success.