Académique Documents
Professionnel Documents
Culture Documents
Correlation Analysis
Regression Analysis is Used Primarily for
Prediction
Axis Title
0
20
40
60
Y intercept
Yi 0 1 X i i
Dependent
(Response)
Variable
No Relationship
Slope
Independent
(Explanatory)
Variable
Population
Linear Regression Model
Y
Yi 0 1X i i
Observed
Value
i = Random Error
m
YX
0 1X i
Y i b0 b1X
Yi
Xi
b0
X
Observed Value
REGRESSION COEFFICIENTS
To calculate the estimates of the
coefficients
that minimize the differences
between the data
points and the line, use the
formulas:
b1
cov(X, Y)
y b 0 b1x
s 2x
n X iYi ( X i )( Yi )
n( X i2 ) ( X i ) 2
et b0 Y b1 X
b 0 y b1 x
Store
Square
Feet
Annual
Sales
($000)
1
2
3
4
5
6
7
1,726
1,542
2,816
5,555
1,292
2,208
1,313
3,681
3,395
6,653
9,543
3,318
5,563
3,760
Annua l Sa le s ($000)
10000
8000
6000
4000
2000
0
0
Excel Output
1000
2000
3000
4000
S q u a re F e e t
5000
6000
Y i b0 b1 X i
Annua l Sa le s ($000)
12000
1 6 3 6 .4 1 4 7 2 6
10000
8000
6000
4000
2000
0
0
1000
2000
3000
4000
5000
6000
X V a ria b le 1 1 .4 8 6 6 3 3 6 5 7
S q u a re F e e t
Yi = 1636.415 +1.487Xi
Test Statistic: t
b1 1
Where Sb
1
S b1
SYX
n
2
( Xi X )
i 1
and df = n - 2
( Yi Yi )
i 1
12000
n2
Annua l Sa le s ($000)
Syx
SSE
n2
10000
8000
6000
4000
2000
0
0
1000
2000
3000
4000
S q u a re F e e t
5000
6000
Square
Feet
Annual
Sales
($000)
1,726
1,542
2,816
5,555
1,292
2,208
1,313
3,681
3,395
6,653
9,543
3,318
5,563
3,760
Yi = 1636.415 +1.487Xi
The slope of this model
is 1.487.
Is there a linear
relationship between the
square footage of a store
and its annual sales?
Reject
X V a r i a b l e 11 . 0 6 2 4 9 0 3 7
1.91077694
SSE =(Yi - Yi )2
Decision:
Reject H0
Conclusion:
There is evidence of a
linear relationship.
_
Y
SS
R e g r e ssi o n
30380456.12
R e si d u a l
1871199.595
T o ta l
32251655.71
SSR
Xi
0.0002812
Measures of Variation
The Sum of Squares: Example
9.009944
X V a ria b le 1
0.0151488
U p p er 95%
2797.01853
P-valu e
3.6244333
Measures of Variation:
The Sum of Squares
t S tat
I n te r c e p t
.025
-2.5706 0 2.5706
Reject
.025
I n te r c e p t
Test Statistic:
H0: 1 = 0
H 1: 1 0
a .05
df 7 - 2 = 7
Critical Value(s):
Regression
Model Obtained:
SSE
SST
Explained
(Factor)
k-1
SSR
Within
(Error)
n-k
SSE
Total
n-1
SST =
SSR+SSE
Mean
F Test
Square Statistic
(Variance)
MSR
=
MSR =
MSE
SSR/(k - 1)
MSE =
SSE/(n - k)
The Coefficient of
Determination
MSR=SSR/k-1
[Variation in y] = SSR + SSE.
Large F results from a large SSR.
Then, much of the variation in y is
explained
by the regression
model.
Rejection
region
The null hypothesis should
be rejected; thus, the model is valid.
MSR
F
MSE
MSE=SSE/(n-k)
F >Fa,k,n-k
r2 =
SSR
SST
^=b +b X
Y
i
0
1 i
Measures of Variation:
Example
Excel Output for Produce Stores
R e g r e ssi o n S ta ti sti c s
Y r2 = 1, r = -1
^=b +b X
Y
i
0
1 i
M u lt ip le R
X
Y
r2 = 0, r = 0
^=b +b X
Y
i
0
1 i
X
0 .9 7 0 5 5 7 2
R S q u a re
0 .9 4 1 9 8 1 2 9
A d ju s t e d R S q u a re
0 .9 3 0 3 7 7 5 4
S t a n d a rd E rro r
6 1 1 .7 5 1 5 1 7
O b s e r va t i o n s
r2 = .94
^=b +b X
Y
i
0
1 i
X
Required conditions
must be satisfied.
X
Yr2 = .8, r = +0.9
Coefficients of Determination
(r2) and Correlation (r)
Y r2 = 1, r = +1
Syx
Estimation of
Predicted Values
Estimation of
Predicted Values
Confidence Interval Estimate for mXY
Standard error
of the estimate
Y i t n 2 Syx
t value from table
with df=n-2
1
( X X )2
n i
n ( X X )2
i
1
( X X )2
Y i t n 2 Syx 1 n i
n ( X X )2
i
i 1
i 1
Confidence
Interval for the
mean of Y
Confidence Interval
for a individual Yi
_
X
Square
Feet
Annual
Sales
($000)
1
2
3
4
5
6
7
1,726
1,542
2,816
5,555
1,292
2,208
1,313
3,681
3,395
6,653
9,543
3,318
5,563
3,760
1
( X X )2
Y i t n 2 Syx
n i
n
( X i X )2
i 1
Yi = 1636.415 +1.487Xi
A Given X
Estimation of Predicted
Values: Example
X = 2350.29
tn-2 = t5 = 2.5706
= 4610.45 980.97
Confidence interval for mean Y
Estimation of Predicted
Values: Example
Confidence Interval Estimate for mXY
Find the 95% confidence interval for annual sales of one
particular store of 2,000 square feet
SYX = 611.75
tn-2 = t5 = 2.5706
1
( X X )2
Y i t n 2 Syx 1 n i
= 4610.45 1853.45
n ( X X )2
i
i 1