Académique Documents
Professionnel Documents
Culture Documents
Common estimate of 2 is
n
1 X
Se2 = (yi (0 + 1 xi ))2
n 2 i=1
n
1 X 2
= e
n 2 i=1 i
SSE
= = MSerror
DFerror
1
Dividing by n2 makes Se2 an unbiased estimator for 2 .
(n 2) follows general d.f. rule:
Estimate 2 parameters in the model.
The residuals satisfy two constraints by LSE method.
1 / 17
Inference for 1
Discuss 1 in detail
Pn Pn
i=1 (xi x)(yi y ) (xi x)yi
1 = Pn 2
= Pi=1
n 2
i=1 (xi x) i=1 (xi x)
E(1 ) = 1
2 2
21 = Var(1 ) = Pn =
i=1 (xi x)
2 (n 1)SX2
2 / 17
Inference for 1
3 / 17
Inference for 0
Point estimate of 0 is 0 = y 1 x.
x 2
2 2 1
E(0 ) = 0 , 0 = Var(0 ) = +
n (n 1)SX2
0 N(0 , 2 )
0
q
1 x 2
0 has sample standard error S0 = Se n
+ (n1)SX2
4 / 17
Inference for Regression Line (or Conditional Means)
Inference for E(Y |X = x) = 0 + 1 x
For a chosen x0 ,
estimate is y0 = 0 + 1 x0 = y + 1 (x0 x).
E(y0 ) = Y |X =x
0 = 0 + 1x0
1 (x0 x)2
Var(y0 ) = 2 n
+ (n1)SX2
q
1 (x0 x)2
Sample standard error is Sy0 = Se n
+ (n1)SX2
.
5 / 17
Prediction
Standard error is
s
q 1 (x0 x)2
Sy ,pred = Se2 + Sy20 = Se 1+ +
n (n 1)SX2
(0 + 1 x0 ) tn2,1/2 Sy ,pred .
6 / 17
Example
Forbes Data
7 / 17
BOILING POINT BARAMETRIC NATURAL LOG OF
OF WATER PRESSURE BARAMETRIC
Obs (degrees F) (inches Hg) PRESSURE
1 194.3 20.79 3.034472
2 194.5 20.79 3.034472
3 197.9 22.40 3.109061
4 198.4 22.67 3.121042
5 199.4 23.15 3.141995
6 199.9 23.35 3.150597
7 200.9 23.89 3.173460
8 201.1 23.89 3.173460
9 201.3 24.01 3.178470
10 201.4 24.02 3.178887
11 203.6 25.14 3.224460
12 204.6 26.57 3.279783
13 208.6 27.76 3.323596
14 209.5 28.49 3.349553
15 210.7 29.04 3.368674
16 211.9 29.88 3.397189
17 212.2 30.06 3.403195
8 / 17
Forbes Data
3.5
3.4
Log Pressure
3.3
3.2
3.1
3.0
9 / 17
Analysis of Forbes Data
Proposed regression model
yi = 0 + 1 xi + ei
i.i.d
where ei N(0, 2 ), i = 1, , 17.
Yi = log(pressure)
Xi = boiling point ( F)
10 / 17
Analysis of Forbes Data
Residuals: ei = yi yi , i = 1, , 17.
11 / 17
Analysis of Forbes Data
Inference on 1 :
1 0 0.0206220
Evaluate T = S
= 0.000379
= 54.42. p-value <<
1
0.0001. Reject H0 and conclude that the slope is positive.
A 95% C.I. for the slope indicates that the slope is very
well estimated from these data
1 t15,0.975 S1
0.020622 (2.131)(0.00037895)
(0.0198, 0.0214)
12 / 17
Analysis of Forbes Data
Inference on 0
0 0 0.9710
Evaluate T = S
= 0.0769
= 12.6. p-value
0
<< 0.0001. Reject H0 and conclude that the intercept is
negative. (Is there a practical motivation to do this test?)
0 t15,0.975 S0
0.971 (2.131)(0.0769)
(1.135, 0.807)
13 / 17
Analysis of Forbes Data
Estimated mean is
y = 0 + 1 x = 0.9710 + (0.0206)(209) = 3.339
14 / 17
Analysis of Forbes Data
For every point x, compute 95% C.I. will get us a C.I. band for
the regression line.
3.5
Regression Line
95 percent C.I.
3.4
Log Pressure
3.3
3.2
3.1
3.0
15 / 17
Analysis of Forbes Data
Inference for prediction:
16 / 17
Analysis of Forbes Data
17 / 17