Académique Documents
Professionnel Documents
Culture Documents
Note: D : Discrimination Index
UG : total number of students who answered correctly in the upper
group
LG : total number of students who answered correctly in the lower
group
N : total number of students
Upper Group of the students are 10 students, can be calculated as follows:
Upper group = 36 27% = 9.72 = 10 students
Lower Group of the students are 10 students, can be calculated as follows:
Lower group = 36 27% = 9.72 = 10 students
No UG LG UG-LG D
1 50 30 20 2.22
2 38 14 24 2.67
3 40 15 25 2.78
4 41 14 27 3
5 40 26 14 1.56
6 38 27 11 1.22
7 40 27 13 1.44
8 38 25 13 1.44
9 40 21 19 2.11
10 43 23 20 2.22
11 40 23 17 1.89
12 36 25 11 1.22
13 33 25 8 0.89
14 34 16 18 2
15 32 14 18 2
6. Content Validity
Validity concerns the extent to which the scale really measures the attitude
construct of interest. It is often difficult to locate criteria to be used in obtaining
evidence for the validity of attitude scales. Some researchers have used
observations of actual behavior as the criterion for the attitude being measured.
This procedure is not often used because it is often difficult to determine what
behavior would be the best criterion for the attitude and also because it is
expensive.
One of the easiest ways to gather validity evidence is to determine the
extent to which the scale is capable of discriminating between two groups whose
members are known to have different attitudes. To validate a scale that measures
attitudes toward organized religion, a researcher would determine if the scale
discriminated between active church members and people who do not attend
church or have no church affiliation. A scale measuring attitudes toward abortion
should discriminate between members of pro-life groups and members of pro-
choice groups. By discriminate, we mean that the two groups would be
expected to have significantly different mean scores on the scale. Another method
of assessing validity is to correlate scores on the attitude scale with those obtained
on another attitude scale measuring the same construct and whose validity is well
established.
The content validity of the items are first analyzed using expert judgments.
In this case, there are two judges to judge the validity by using the following
table.
Criteria for content validity are as follows.
0.8 - 1 Very high
0.6 - 0.79 High
0.4 - 0.59 Moderate
Below 0.4 Low
The expert judgments of relevant and irrelevant test items made to assess
reading comprehension of seventh grade students of junior high school can be
seen in the following table.
Judge I
Judge II
Irrelevant Relevant
Irrelevant
(A) (B)
- -
Relevant
(C) (D)
- 1, 2, 3, 4, 5, 6, 7,
8, 9, 10, 11, 12,
13, 14, 15
Then, the result of content validity testing is analyzed by the formula of Gregory
and below is the calculation.
ontent alidity
.00
As the index of content validity is 1.00 and considered as very high, the multiple
choice test items made in this paper to assess reading comprehension of seventh
grade students of junior high school can be said as valid.
7. Empirical Validity
The instrument with non-dichotomy-scaled data is the test which the data
are in the gradation scale (e.g. a questionnaire with the scale of 1 5). Pearsons
Product Moment Correlation is used to measure the empirical validity of an
instrument with non-dichotomy-scaled data. r critical value (r
cv
) is used to draw
conclusion with (df = n-2). If r
obs
exceeds r
cv
(r
obs
> r
cv
), the item is valid.
The statements used in the present study are then analyzed empirically in
terms of the validity of the test using the formula below.
r
n(xy) (x) (y)
[n x
(x)
] [n y
(y)
]
Note:
x : score of respondent on the discovered item
y : total score of respondent
n : number of respondent
X r hitung
r tabel
(db= 34)
Decision
1 0.890657 0.3291 V
2 0.873144 0.3291 V
3 0.858828 0.3291 V
4 0.904195 0.3291 V
5 0.751701 0.3291 V
6 0.713502 0.3291 V
7 0.787648 0.3291 V
8 0.722137 0.3291 V
9 0.765035 0.3291 V
10 0.795436 0.3291 V
11 0.754931 0.3291 V
12 0.656499 0.3291 V
13 0.556737 0.3291 V
14 0.81815 0.3291 V
15 0.781806 0.3291 V
From the calculation above, it can concluded that the 15 statements used in
questionnaire to measure students motivation are considered valid.
8. Reliability
Reliability is a consistency of an instrument. An instrument with high
reliability tends to give a relatively same result at any given time the instrument is
used. Some statistical formula of reliability are Spearman-Brown Formula,
Kuder-Richadson Formula (KR), Hoyt Formula. Those formulas are specific to
the characteristic of the data. The more general formula was developed by
Cronbach called Alpha Cronbach Formula which can be used to both dichotomy
and politomy (non-dichotomy) type data.
The reliability of the test items used in this paper is analyzed using Alpha
Cronbach formula as the following because the data gathered is classified into
politomy (non-dichotomy) type data.
)
Note:
r : Reliability coefficient of instrument (cronbach alpha)
k : The number of test items
(
.
.
)
. ( .)
. (.)
.
The instrument is said having high reliability if the coefficient is > 0.60 (Ghozali,
2002). Hence, the structure test items were considered to be reliable enough to
gather the data for the present study.
References:
Rosen, Jeffrey A.; Glennie, Elizabeth J.; Dalton, Ben W.; Lennon, Jean M.;
Bozick, Robert N. 2010. Noncognitive Skills in the Classroom: New
Perspectives on Educational Research. USA: RTI International.
Ary, Donald; Jacobs, Lucy Cheser; Sorensen, Chris; Razavieh, Asghar. 2010.
Introduction to Research in Education. Eighth Edition. Wadsworth:
Cengage Learning.
Motivation and Interest Research Project - Student Survey, the University of
Sydney.