Introduction
Humans are naturally wired to see patterns — we often link events, behaviours, or numbers and wonder if they’re connected.
In statistics, this idea is formalised through correlation, a measure that quantifies the strength and direction of a relationship between two variables.
For example, ice cream sales and shark attacks might both increase in summer, showing a positive correlation, but that doesn’t mean one causes the other.
This is a crucial lesson in statistics: correlation does not imply causation.
The correlation coefficient, usually represented by r, measures how strongly two variables are related.
Correlation Coefficient Formula
For two variables X and Y, the Pearson correlation coefficient is given by:

Where:
is the mean of the x-values
is the mean of the y-values
- ranges from -1 to +1
Interpretation:
: Perfect positive correlation
: No correlation
: Perfect negative correlation
Visual Interpretation of Correlation

| Graph Type | Correlation (r) | Description |
|---|---|---|
| A | 1.0 | Perfect positive correlation — as one variable increases, so does the other. |
| B | 0.6 | Moderate positive correlation — upward trend with some variation. |
| C | 0.0 | No correlation — no linear relationship between the variables. |
| D | -0.6 | Moderate negative correlation — one increases as the other decreases. |
| E | -1.0 | Perfect negative correlation — all points lie on a downward line. |
Practice Problems and Solutions
Problem 1
Calculate and interpret the correlation coefficient for the following data:
| Person | Hand Span (cm) | Height (cm) |
|---|---|---|
| A | 17 | 150 |
| B | 15 | 154 |
| C | 19 | 169 |
| D | 17 | 172 |
| E | 21 | 175 |
Solution
Step 1: Calculate the means:


Step 2: Compute deviations and products:
| Person | x−x̄ | y−ȳ | product | (x−x̄)² | (y−ȳ)² |
|---|---|---|---|---|---|
| A | -0.8 | -14.0 | 11.2 | 0.6 | 196.0 |
| B | -2.8 | -10.0 | 28.0 | 7.8 | 100.0 |
| C | 1.1 | 5.0 | 5.9 | 1.4 | 25.0 |
| D | -0.8 | 8.0 | -6.4 | 0.6 | 64.0 |
| E | 3.2 | 11.0 | 35.2 | 10.2 | 121.0 |
Totals:
Σ(x−x̄)(y−ȳ) = 74.0
Σ(x−x̄)² = 20.8
Σ(y−ȳ)² = 506.0
Step 3: Substitute into the formula:

Interpretation:
There is a strong positive correlation between hand span and height. Taller individuals tend to have larger hand spans.
Problem 2
Calculate and interpret the correlation coefficient for the following data:
| Person | Weight (kg) | Blood Pressure (mmHg) |
|---|---|---|
| A | 150 | 125 |
| B | 169 | 130 |
| C | 175 | 160 |
| D | 180 | 169 |
| E | 200 | 150 |
Solution
Step 1: Calculate the means:


Step 2: Compute deviations and products:
| Person | x−x̄ | y−ȳ | Product | (x−x̄)² | (y−ȳ)² |
|---|---|---|---|---|---|
| A | -24.8 | -21.8 | 540.6 | 615.0 | 475.2 |
| B | -5.8 | -16.8 | 97.4 | 33.6 | 282.2 |
| C | 0.2 | 13.2 | 2.6 | 0.0 | 174.2 |
| D | 5.2 | 22.2 | 115.4 | 27.0 | 492.8 |
| E | 25.2 | 3.2 | 80.6 | 635.0 | 10.2 |
Totals:
Σ(x−x̄)(y−ȳ) = 836.8
Σ(x−x̄)² = 1310.8
Σ(y−ȳ)² = 1434.8
Step 3: Substitute into the formula:

Interpretation:
There is a moderate positive correlation between weight and blood pressure — as weight increases, blood pressure tends to rise.
Problem 3
Calculate and interpret the correlation coefficient for the following data:
| Person | Screen Time (hr) | Exam Score (%) |
|---|---|---|
| A | 5 | 88 |
| B | 8 | 90 |
| C | 10 | 78 |
| D | 12 | 85 |
| E | 15 | 70 |
Solution
Step 1: Calculate the means:


Step 2: Compute deviations and products:
| Person | x−x̄ | y−ȳ | Product | (x−x̄)² | (y−ȳ)² |
|---|---|---|---|---|---|
| A | -5.0 | 5.8 | -29.0 | 25.0 | 33.6 |
| B | -2.0 | 7.8 | -15.6 | 4.0 | 60.8 |
| C | 0.0 | -4.2 | -0.0 | 0.0 | 17.6 |
| D | 2.0 | 2.8 | 5.6 | 4.0 | 7.8 |
| E | 5.0 | -12.2 | -61.0 | 25.0 | 148.8 |
Totals:
Σ(x−x̄)(y−ȳ) = -100.0
Σ(x−x̄)² = 58.0
Σ(y−ȳ)² = 268.8
Step 3: Substitute into the formula:
Interpretation:
There is a strong negative correlation between screen time and exam score: as screen time increases, exam scores tend to decrease.
Conclusion
Correlation helps us understand whether two variables move together — and how strongly.
A positive correlation means they rise together, a negative correlation means one rises while the other falls, and a zero correlation means no relationship at all.
This concept forms the basis of regression analysis, which allows us to make predictions using real data.








Very Helpful. Thanks you guys for good job done to many
Thank you and I want to download these.
How is it?
Very nice and easy way.
Well explained.