Académique Documents
Professionnel Documents
Culture Documents
com
Chapter 536
Binary Diagnostic
Tests – Paired Samples
Introduction
An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by
comparing summary measures of diagnostic accuracy such as sensitivity or specificity using a statistical test.
Often, you want to show that a new test is similar to another test, in which case you use an equivalence test. Or,
you may wish to show that a new diagnostic test is not inferior to the existing test, so you use a noninferiority test.
All of these hypothesis tests are available in this procedure for the important case when the diagnostic tests
provide a binary (yes or no) result.
Experimental Design
Suppose you are interested in comparing the sensitivities of two diagnostic tests for a particular disease (or
condition). Each test provides a binary (yes or no) response. Further suppose you draw a random sample of
subjects from the population with the disease and administered both diagnostic tests to each subject in random
order. Assume that Test 1 is a new (experimental or treatment) test that will replace Test 2, the existing (standard
or reference) test, if it is found to be better.
The results of such a study can be displayed in a 2-by-2 table in which the Test 1 result is shown as the rows and
the Test 2 result is shown as the columns.
Test 2 Result
Test 1 Result Positive Negative Total
Positive X11 X10 m1
Negative X01 X00 m0
Total n1 n0 N
Data such as this can be analyzed using standard techniques for comparing two correlated proportions which are
presented in the chapter on Two Correlated Proportions. Such a table was originally analyzed using McNemar’s
Test. However, procedures with better statistical properties have recently been proposed. See for example Nam
and Blackwelder (2002).
Sensitivity
Sensitivity is the proportion of those that have the condition for which the diagnostic test is positive. Since this
design assumes that the subjects come from the population of individuals with the disease, the sensitivity can be
calculated.
536-1
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Binary Diagnostic Tests – Paired Samples
Specificity
Specificity is the proportion of those that do not have the condition for which the diagnostic test is negative. To
study specificity, a separate study would have to be conducted in which subjects were drawn from the population
of individuals without the disease. The data from a such a study could be analyzed with this procedure by
changing the meaning of positive and negative. Instead of positive meaning that the person had the disease,
positive would mean that the diagnostic test result matched the true condition of the subject. Likewise, negative
would mean that the diagnostic test result did not match the true condition. In the procedure printouts, you would
substitute specificity for sensitivity.
Test 2 Result
Test 1 Result Positive Negative Total
Positive X11 X10 m1
Negative X01 X00 m0
Total n1 n0 N
536-2
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Binary Diagnostic Tests – Paired Samples
Data Structure
This procedure does not use data from a dataset. Instead, you enter the values directly into the 2-by-2 table on the
panel.
Procedure Options
This section describes the options available in this procedure.
Data Tab
Enter the data values directly on this panel.
Data Values
X11
This is the number of patients that responded positively to both diagnostic tests. The value entered must be a non-
negative number.
X10
This is the number of patients that tested positive using Test 1, but negative using Test 2. The value entered must
be a non-negative number.
X01
This is the number of patients that tested negative using Test 1, but positive using Test 2. The value entered must
be a non-negative number.
X00
This is the number of patients that responded negatively to both diagnostic tests. The value entered must be a non-
negative number.
Report Options
Alpha – Confidence Intervals
The confidence coefficient to use for calculating the confidence limits in proportions. 100 x (1 - alpha)%
confidence limits will be calculated. This must be a value between 0 and 0.5. The most common choice is 0.05.
536-3
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Binary Diagnostic Tests – Paired Samples
Test 2 Result
Test 1 Result Positive Negative Total
Positive 31 5 36
Negative 4 10 14
Total 35 15 50
You may follow along here by making the appropriate entries or load the completed template Example 1 by
clicking on Open Example Template from the File menu of the Binary Diagnostic Tests – Paired Samples
window.
536-4
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Binary Diagnostic Tests – Paired Samples
These reports display the counts that were entered along with various proportions that make interpreting the table
easier. Note that Test 1’s sensitivity of 0.7200 and Test 2’s sensitivity of 0.7000 are displayed in the margins of
the Table Proportions table.
Notes:
Sensitivity: proportion of those that actually have the condition for which the diagnostic test is positive.
Difference confidence limits based on Nam's RMLE method.
Ratio confidence limits based on Blackwelder and Nam's method.
This report displays the sensitivity for each test as well as corresponding confidence interval. It also shows the
value and confidence interval for the difference and ratio of the sensitivity. Note that for a perfect diagnostic test,
the sensitivity would be one. Hence, the larger the values the better.
Note that the type of confidence interval for the difference and ratio is specified on the Data panel.
536-5
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Binary Diagnostic Tests – Paired Samples
Notes:
Odds Ratio = Odds(True Condition = +) / Odds(True Condition = -)
where
Odds(Condition) = P(Positive Test | Condition) / P(Negative Test | Condition)
This report displays estimates of the odds ratio as well as its confidence interval.
This report displays the results of hypothesis tests comparing the sensitivities of the two diagnostic tests using
Nam’s test. Note that for this test, identical test results are obtained from either the test of differences or test of
ratios.
Tests of Equivalence
Reject H0
Lower Upper and Conclude
90.0% 90.0% Lower Upper Equivalence
Prob Conf. Conf. Equiv. Equiv. at the 5.0%
Statistic Level Limit Limit Bound Bound Significance Level
Difference (Se1-Se2) 0.0051 -0.0859 0.1274 -0.2000 0.2000 Yes
Ratio (Se1/Se2) 0.0247 0.8833 1.2039 0.8000 1.2500 Yes
Notes:
Equivalence is concluded when the confidence limits fall completely inside the equivalence bounds.
Difference confidence limits based on Nam's RMLE method.
Ratio confidence limits based on Blackwelder and Nam's method.
This report displays the results of the equivalence tests of sensitivity, one based on the difference and the other
based on the ratio. Equivalence is concluded if the confidence limits are inside the equivalence bounds.
Prob Level
The probability level is the smallest value of alpha that would result in rejection of the null hypothesis. It is
interpreted as any other significance level. That is, reject the null hypothesis when this value is less than the
desired significance level.
Note that for many types of confidence limits, a closed form solution for this value does not exist and it must be
searched for.
Confidence Limits
These are the lower and upper confidence limits calculated using the method you specified. Note that for
equivalence tests, these intervals use twice the alpha. Hence, for a 5% equivalence test, the confidence coefficient
is 0.90, not 0.95.
536-6
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Binary Diagnostic Tests – Paired Samples
Notes:
H0: The Sensitivity of Test2 is inferior to Test1.
Ha: The Sensitivity of Test2 is noninferior to Test1.
The noninferiority of Test2 compared to Test1 is concluded when the upper c.l. < upper bound.
Difference confidence limits based on Nam's RMLE method.
Ratio confidence limits based on Blackwelder and Nam's method.
This report displays the results of two noninferiority tests of sensitivity, one based on the difference and the other
based on the ratio. Report definitions are identical with those above for equivalence.
536-7
© NCSS, LLC. All Rights Reserved.