Vous êtes sur la page 1sur 3

22/11/14 12:51

Published on STAT 503 - Design of Experiments


(https://onlinecourses.science.psu.edu/stat503)
Home > 2.2 - Sample Size Determination

2.2 - Sample Size Determination


The estimation approach to determining sample size addresses the question: "How accurate
do you want your estimate to be?" In this case we are estimating the difference in means.
This approach requires us to specify how large a difference we are interested in detecting, say
B for the Bound on the margin of error, and then to specify how certain we want to be that we
can detect a difference that large. Recall that when we assume equal sample sizes of n, a
confidence interval for 1- 2 is given by:

2
Y 1 Y 2 t(1 /2; df ) s
n }
{
Where n is the sample size for each group, and df = n + n - 2 = 2(n - 1) and s is the pooled
standard deviation. Therefore, we first specify B and then solve this equation:

2
B = t(1 /2; df ) s
n
for n. Therefore,
2

t 2 (1 /2; df ) s 2 2
2
n = t(1 /2; df ) s
=
B ]
B2
[
[
]
Since in practice, we don't know what s will be, prior to collecting the data, we will need a
guesstimate of to substitute into this equation. To do this by hand and we use z rather than
t since we don't know the df if we don't know the sample size n - the computer will iteratively
update the d.f. as it computes the sample size, giving a slightly larger sample size when n is
small.
So we need to have an estimate of 2, a desired margin of error bound B, that we want to
detect, and a confidence level 1-. With this we can determine sample size in this
comparative type of experiment. We may or may not have direct control over 2, but by using
different experimental designs we do have some control over this and we will address this
later in this course. In most cases an estimate of 2 is needed in order to determine the
sample size.

https://onlinecourses.science.psu.edu/stat503/print/book/export/html/10

Pgina 1 de 3

22/11/14 12:51

One special extension of this method is when we have a binomial situation. In this
case where we are estimating proportions rather than some quantitative mean
level, we know that the worst-case variance, p(1-p), is where p (the true
proportion) is equal to 0.5 and then we would have an approximate sample size
formula that is simpler, namely n = 2/B2 for = 0.05.

Another Two-Sample Example Paired Samples


In the paired sample situation, we have a group of subjects where each subject has two
measurements taken. For example, blood pressure was measured before and after a
treatment was administered for five subjects. These are not independent samples, since for
each subject, two measurements are taken, which are typically correlated hence we call this
paired data. If we perform a two sample independent t-test, ignoring the pairing for the
moment we lose the benefit of the pairing, and the variability among subjects is part of the
error. By using a paired t-test, the analysis is based on the differences (after before) and
thus any variation among subjects is eliminated.
In our Minitab output we show the example with Blood Pressure on five subjects.

[1]

By viewing the output, we see that the different patients' blood pressures seem to vary a lot
(standard deviation about 12) but the treatment seems to make a small but consistent
difference with each subject. Clearly we have a nuisance factor involved - the subject - which
is causing much of this variation. This is a stereotypical situation where because the
observations are correlated and paired and we should do a paired t-test.
These results show that by using a paired design and taking into account the pairing of the
data we have reduced the variance. Hence our test gives a more powerful conclusion
regarding the significance of the difference in means.
The paired t-test is our first example of a blocking design. In this context the subject is used
https://onlinecourses.science.psu.edu/stat503/print/book/export/html/10

Pgina 2 de 3

22/11/14 12:51

as a block, and the results from the paired t-test are identical to what we will find when we
analyze this as a Randomize Complete Block Design from lesson 4.
2014 The Pennsylvania State University. All rights reserved.
Source URL: https://onlinecourses.science.psu.edu/stat503/node/10
Links:
[1] https://onlinecourses.science.psu.edu/stat503/javascript:popup_window(
'/stat503/sites/onlinecourses.science.psu.edu.stat503/files/lesson02/L02_2sample_viewlet_swf.html', 'l02_2sample',
704, 652 );

https://onlinecourses.science.psu.edu/stat503/print/book/export/html/10

Pgina 3 de 3

Vous aimerez peut-être aussi