Académique Documents
Professionnel Documents
Culture Documents
Meaning
a) Correlation studies the relationship between two variables in which change in the value
of one variable causes change in the other variable. It is denoted by letter ‘r’. For e.g.
change in price leads to change in quantity demanded.
b) Correlation studies and measures the direction and intensity of relationship among
variables. It measures co-variation not causation.
Types of Correlation
Correlation is classified into positive and negative correlation.
The correlation is said to be positive when the variables move together in the same
direction. For e.g., sale of Ice cream and temperature move in same direction.
The correlation is said to be negative when the variables move in opposite direction. For
e.g., when you spend more time in studying chances of your failing decline.
Kinds of correlation
1. Positive and Negative correlation.
2. Linear and non – linear correlation.
3. Simple and multiple correlations.
Positive correlation: When both variables move in the same direction. If one increases,
other also increases and vice-versa.
Negative correlation: - When two variables move in the opposite direction, they are
negatively correlated.
Linear Correlation: - When two variables change in a constant proportion.
Non- linear correlation: - When two variables do not change in the same proportion.
Simple correlation – Relationship between two variables are studied.
Multiple Correction – Relationship between three or more than three variables are
studied.
Degrees of Correlation:
Perfect +1 -1
Between -0.75 and
High Between + 0.75 and + 1 -1
Between -0.25 & -
Moderate Between + 0.25 and + 0.75 0.75
Between 0 and -
Low Between 0 and + 0.25 0.25
Zero 0 0
a) Scatter Diagram
A scatter diagram is a useful technique for visually examining the form of relationship,
without calculating any numerical value. In this technique, the values of the two variables
are plotted as points on a graph paper. The cluster of points, so plotted, is referred to as
a scatter diagram. From a scatter diagram, one can get a fairly good idea of the nature of
relationship. In a scatter diagram the degree of closeness of the scatter points and their
overall direction enable us to examine the relationship. If all the points lie on a line, the
correlation is perfect and is said to be unity. If the scatter points are widely dispersed
around the line, the correlation is low. The correlation is said to be linear if the scatter
points lie near a line or on a line.
Merits of Scatter Diagram
1. Most simplest method.
2. Not affected by size of extreme values.
Demerits
1. Exact degree of correlation cannot be found.
xy
r
N y
Here,
r = Coefficient of correlation
x = (X - X)
y = (Y - Y)
N = number of observations
X = Standard deviation of series X
Y = Standard deviation of series Y
OR
Actual Mean Method
xy
r
x 2 y 2
Where x = (X - X)
y = (Y - Y)
Illustration 1:
Find Karl Pearson correlation coefficient using actual mean method:
X 3 4 5 6 7 8 9
Y 5 8 9 10 11 15 18
Solution
xy
r ; x= x ( X - X); y (Y - Y);
x 2 y 2
Substituting
58 58
r 0.96
28 X 130 60.332
Calculate the coefficient of correlation between X and Y using short cut method
X 5 10 15 20 25
Y 12 20 25 30 35
dx dy
dxdy -
r N
(dx) 2 (dy ) 2
dx 2
dy
2
N N
Substituting
5 2
545 -
r 5
543
0.99
(5) 2
(2) 2 544.87
675 444
5 5
Price 10 15 20 25 30
Demand 50 45 40 35 30
Solution
Multiple
Square Square
Deviation dx'=dx/C1 Dema Deviation dy'=dy/ C2 of
Price Deviation Deviation
(dx=X-A) C1= 5 nd (dy=Y-A) C2= 5 deviations
(dx’2) (dy’2)
(dx’dy’)
10 -10 -2 4 50 10 2 4 -4
15 -5 -1 1 45 5 1 1 -1
20 0 0 0 40 0 0 0 0
25 5 1 1 35 -5 -1 1 -1
30 10 2 4 30 -10 -2 4 -4
35 15 3 9 25 -15 -3 9 -9
N=6 ∑dx'=3 ∑dx2=19 N=6 ∑dy'=-3 ∑dy2=19 ∑dxdy=-19
Substituting
3 * (3)
19 -
6
r 1
(3) 2
(3) 2
19 19
6 6
Spearman’s rank correlation was developed by the British psychologist C.E. Spearman.
It is used when the variables cannot be measured exactly. In some situations it may not
be possible to give precise values to some variables. In such cases Spearman Rank
Correlation Coefficient can be used.
Under this method the observations are ranked in ascending or descending order using
numbers and measure the degree of relationship between the ranks instead of actual
numerical values.
6D 2
rk = 1 – 3
N N
rk = Spearman’s rank correlation
D2 = Sum of squares of difference of ranks
N = Number of observation
Illustration 4:
The following are the ranks given by 2 judges to 10 contestants in a beauty contest. Find
the correlation between the two judges.
Judge 1 1 2 3 4 5 6 7 8 9 10
Judge 2 1 4 2 5 3 9 7 10 6 8
Solution:
X Y X-Y=D D2
1 1 -0 0
2 4 -2 4
3 2 1 1
4 5 -1 1
5 3 2 4
6 9 -3 9
7 7 0 0
8 10 -2 4
9 6 3 9
10 8 2 4
36
6D 2
rk = 1 –
N3 N
6 X 36
=1- = 1-0.219 = 0.781
10 X 99
First 92 89 87 86 68 67
Term
Second 86 83 91 77 72 70
Term
Solution:
First Term Second Term R1 R2 R1 - R2= D2
D
92 86 1 2 -1 1
89 83 2 3 1 1
87 91 3 1 2 4
86 77 4 4 0 0
68 72 5 5 0 0
67 70 6 6 0 0
6D 2
rk = 1 – 3
N N
6X 6 36
=1- = 1-
6 6
3
210
= 1- 0.1714 = 0.823
In some cases it may be possible that two values in the series acquire same rank. In such
cases the rank to be assigned to individual observations is an average of the ranks which
these two individual observations would have acquired if they were different.
In case of repeated ranks the formula is given as:-
1 1
6(D 2 (m 3 m) (m 3 m))
rk = 1 – 12 12
N N
3
Judge 4 5 7 8 6 3 5
A
Judge 11 9 13 7 13 8 8
B
Solution:
A 4 11 2 5 -3 9
D 8 7 7 1 6 36
G 5 8 3.5 2.5 1 1
D 2
=51
1 1
6(D 2 (m 3 m) (m 3 m))
rk = 1 – 12 12
N3 N
1 1 1
6(51 (2 3 2) (2 3 2) (2 3 2)
=1- 12 12 12
73 7
6(51 1.5) 315
=1- =1 -
7 7
3
336
= 1 -0.9375
= 0.0625
Exercises
Very Short Answer Type Questions (One Mark Questions)
Y 9 4 1 1 4 9
Ans: r=0
Y 2 6 8 10 14 16
Ans: r= +1
5. Plot the following data as a scatter diagram and comment over the result:
X 11 10 15 13 10 16 13 8 17 14
Y 6 7 9 9 7 11 9 6 12 11
X 15 18 21 24 27 30 36 39 42 48
Y 25 25 27 27 31 33 35 41 41 45
Ans : r= 0.98
7. From the following data, compute the product movement correlation between x and y
:
X series Y series
1. Discuss Karl Pearson’s method of calculating coefficient of correlation. Give its merits
and limitations.
Judge II 3 5 8 4 7 10 2 1 6 9
Judge III 6 4 4 8 1 2 3 10 5 7
Find out by Spearman’s Rank difference method which pair of judges has a common taste
in respect of beauty.
(Ans. rk I & II = - 0.21; rk II & III = -0.29; rk I & III = + 0.64)
3. What are the advantages of Spearman’s rank correlation coefficient over Karl
Pearson’s correlation coefficient? Explain the method of calculating Spearman’s rank
correlation coefficient.
Weight 65 54 55 61 60 54 50 63 65
(in kg.)
Marks (in 50 54 56 59 60 62 61 65 67 71 71 74
Accountancy)
Marks (in 22 25 34 28 26 30 32 30 28 34 36 40
Economics)
(Ans : r = + 0.78)
6. Calculate Spearman’s rank correlation from the following data:
X 10 12 8 15 20 25 40
Y 15 10 6 25 16 12 8
(Ans. rk = + 0.14)
7. Two judges in a beauty competition rank the twelve entries as follows. Calculate rank
correlation coefficient.
X 1 2 3 4 5 6 7 8 9 10 11 12
Y 12 9 6 10 3 5 4 7 8 2 11 1
(Ans r = - 0.45)
X: 68 75 90 75 50 62 40 35
Y: 10 12 14 10 10 13 9 8
(Ans. r = + 0.76)