Académique Documents
Professionnel Documents
Culture Documents
FINAL EXAMINATION
(Semester 2: AY 2008–2009)
INSTRUCTIONS TO CANDIDATES
1. This examination paper contains THREE (3) questions and comprises FIFTEEN
(15) printed pages.
3. Read the questions CAREFULLY, and label some important quantities clearly at
your own convenience.
Matriculation No:
Question 1 2 3 Total
Max. marks 17 25 18 60
Marks scored
1
ST3241
1. [17 marks] Suppose that whether there are insect problems in grass fields being
sprayed by 4 different insecticides, T = k, for k = A, B, C, D, is of interest. It is
believed that size S of the grass fields, either small or large, may also affect this
response. The following logit model M is fitted.
_____________________________________________________________________________
T A 1 0 0
B 0 1 0
C 0 0 1
D -1 -1 -1
S large 1
small -1
Deviance 7.9645
Pearson 7.9348
2
Model Fit Statistics
Intercept
Intercept and
Criterion Only Covariates
Wald
Effect DF Chi-Square Pr > ChiSq
T 833.2472
S 50.0683 <.0001
Standard Wald
Parameter DF Estimate Error Chi-Square Pr > ChiSq
3
ST3241
4
ST3241
(b) Estimate the probability that there is insect problem in a small grass field treated
by insecticide D.
(c) Test at 1% significance level whether the response and different kinds of insecti-
cides are conditional independent. What is your conclusion?
5
ST3241
(d) Write the saturated model and find its maximized log-likelihood.
6
ST3241
2. [25 marks] Based on the “horseshoe crab” data displayed in P.76-77 of textbook or
P.64 of lecture notes, we are interested in understanding the probability (π) of pres-
ence of satellites residing nearby a female horseshoe crab by using two explanatory
variables, namely, spine condition (S) and weight in kg (Wt). There are 3 different
levels for S, namely, “both good” (S = 1), “one worn or broken” (S = 2), and “both
worn or broken” (S = 3). Given below is part of the SAS output by proc genmod for
fitting the logistic regression model
logit(π) = α + β1 Wt + β2 s2 + β3 s3 (1)
( (
1, if S = 2 1, if S = 3
where π = π(Wt, s2, s3), s2 = , and s3 = .
0, otherwise 0, otherwise
______________________________________________________________________________
Likelihood Ratio
Standard 95% Confidence Chi-
Parameter DF Estimate Error Limits Square Pr > ChiSq
7
ST3241
(a) Suppose that the prediction equation of model (1) says that with equal weight,
the estimated odds of presence of satellites for a crab with one spine worn or
broken is 0.741 times that for a crab with both spines in good condition. State
the equation.
(b) Construct a 95% confidence interval for the effect of weight on presence of satel-
lites. Interpret.
8
ST3241
(c) (i) Based on the available SAS output, is it possible that one can carry out the
likelihood-ratio test on the effect of spine condition (S) given the presence of
Wt in model (1)? Give your answer as “Yes” or “No”.
(d) Show that the asymptotic standard error of the sample logit, logit(π̂(2.55, 0, 0))
in model (1), is given by 0.4018 (up to 4 decimal places).
9
ST3241
(e) Construct a 95% confidence interval for the probability of presence of satellites
residing nearby a female crab with both spines in good condition and weight
2.55kg to test whether the chance is equal to one-half.
(f) Among a group of 5 female crabs, all with both spines in good condition and
the same weight wkg, estimate the chance that there are 3 out of 5 of them with
no satellite in terms of w. [Hint: Assume independence between all the crabs.]
10
ST3241
(g) Is it valid to use the deviance of value 195.3327 from the SAS output to perform an
overall goodness-of-fit test of the model (1) against the saturated model? Explain.
(h) In the likelihood-ratio test for H0 : β3 = 0, there is not enough evidence to reject
H0 at 1% significance level.
(i) write the resulting model, and
(ii) describe the association between spine condition and presence of satellites in
terms of the parameters in your written model for (h)(i).
11
ST3241
3. [18 marks] Suppose that whether there are imperfections in wafers being treated by 4
different lubricants, A, B, C and D, is of interest. Given the number of imperfections in
a sample of 4000 wafers, denoted by y1 , y2 , . . . , y4000 , such that the sample proportions
of “good” wafers (i.e., without imperfections) in 1000 wafers treated by A is pA = 0.554,
in 1000 wafers treated by B is pB = 0.546, in 1000 wafers treated by C is pC = 0.105,
and in 1000 wafers treated by D is pD , respectively. Suppose that the following logit
model is fitted.
logit(π) = βC TC + βD TD , (2)
and TD is a similarly defined indicator variable for the event whether a wafer is treated
by lubricant D.
Wafer
Lubricant Good Bad Total
A
B
C
D
Total 1304
12
ST3241
Suppose that an alternative model is also of interest,
where (
1 if a wafer is treated by lubricants A or B
TAB = .
0 otherwise
(d) Show that the likelihood function of the data under model (3) is given by
e1304α+1100β
.
[(1 + eα )(1 + eα+β )]2000
13
ST3241
(e) Is it possible that you can find the fitted equation? Why? If “Yes”, find it.
(f) Suppose that the maximized log-likelihoods of both models (2) and (3), respec-
tively denoted by L(2) and L(3) , are given. Describe how to choose between the
two models.
14
ST3241
[END OF PAPER]
15