Académique Documents
Professionnel Documents
Culture Documents
CONTENTS
Measuring Reliability 2
Indicators Quality 3
Kinds of Validity 5
Content Validity 6
Table Of Specification 8
Validity | 1
MEASURING RELIABILITY
TEST – RETEST
- Give the same test twice to the same group with any time interval between tests.
- Give two forms of the test to the same group in close succession.
SPLIT-HALF
- Test has two equivalent halves. Give test once, score two equivalent halves (odd
items vs. even items).
Validity | 2
INDICATORS OF QUALITY
- Validity
- Reliability
- Utility
- Fairness
Validity | 3
VALIDITY
- Test length
- Item type
Validity | 4
KINDS OF VALIDITY
Face Validity
Content Validity
Construct Validity
Criterion Validity
Predictive
Discriminant Validity
FACE VALIDITY
- If the items seem to be reasonably related to the perceived purpose of the test
- This is not really a type of validity, because it does not offer evidence to support
conclusions drawn from test scores.
Validity | 5
CONTENT VALIDITY
‘ You assigned Chapters 1 - 5, but nearly all of the items came from
Chapters 1-2 — How can you evaluate whether we know anything
about the other material we were supposed to read?
- Can you test students on things they have not been taught?
Validity | 6
CONSTRUCT-IRRELEVANT VARIANCE
- Table of Specification
E.g.
At the end of the chapter, the student will be able to do the following:
4. Differentiate between our stars, the sun, and all other stars
Validity | 7
TABLE OF SPECIFICATION
Categories of Performance
(Mental Skills)
Content Areas
3. Constellations
4. Galaxies
Total Grand
Total
CRITERION VALIDITY
Validity | 8
- It is the degree to which content on a test (predictor) correlates with
performance on relevant criterion measures (concrete criterion in the
"real" world?).
- If they do correlate highly, it means that the test (predictor) is a valid one!
- E.g. test of dictionary skills can estimate students’ current skills in the
actual use of dictionary – observation
Validity | 9
CONSTRUCT VALIDITY
- Interest
- Attention span
- Hours spent
CONVERGENT EVIDENCE
- When a measure correlates well with other tests believed to measure the
same construct
DIVERGENT EVIDENCE
Validity | 10
FACTORS THAT CAN LOWER VALIDITY
- Unclear directions
- Ambiguity in statements
- Teaching
- Nature of criterion
Validity | 11