Académique Documents
Professionnel Documents
Culture Documents
Learning I
LECTURE
Courtesy: Sir Von Anthony
Torio – PNU Manila
Assessment of Learning I
BASIC TERMS AND CONCEPT
Measurement
`
- quantification of what students learned through the use of
test, questionnaires, rating scales and the like (Garcia,
2008)
- answers the question, how much does a student learn or
know
Evaluation
`
- is a process of making judgments, assigning value or
deciding on the worth of student’s performance
- It answers the question how good, adequate or desirable is
the performance of the students?
Evaluation
`
- is a process of making judgments, assigning value or
deciding on the worth of student’s performance
- It answers the question how good, adequate or desirable is
the performance of the students?
Reliability `
- it is the accuracy with which test consistently measure
that which it does measure
A test is reliable if it produces similar results
Make a comment
- reliable test are valid, valid test are reliable
Properties of Measurement
Identity
`
Magnitude
Equal Intervals
Absolute Zero
SCALES OF MEASUREMENT
Nominal, Ordinal, Interval, Ratio
Properties of Measurement
`
Identity (each value has a unique meaning)
Magnitude (ordered relationship)
Equal Intervals (scale units are equal to each other)
Absolute Zero (has a true zero point)
SCALES OF MEASUREMENT
Nominal, Ordinal, Interval, Ratio
`
Traditional
Performance
Portfolio
`
Placement
Summative
Formative
Diagnostic
`
(1) Availability – Evaluation should be readily accessible for
students’ improvement
(2) Significance – Evaluation should be indicative as to
what is currently needed
(3) Scope – Evaluation should be defined by the objectives of
an undertaking
`
(4) Objectivity – Evaluation should not show any form of
bias in its results or even during the process
(5) Compatibility – Evaluation should match with the
objectives
(6) Reliability - Evaluation should reflect consistency of
results
(7) Validity – Evaluation should reflect what is intended to be
measured.
`
(8) Collaborative – Evaluation should be a concerted effort.
(9) Variety – Evaluation should not be limited to only a single
form, it should be of different forms possible and availbe.
(10). Fairness – Evaluation should reflect objective judgment
regardless of the evaluated student.
(11) Continuity - Evaluation is deemed to take place before,
during and after instruction
ACCORDING TO PURPOSE `
1. Psychological – aim is to measure student’s intelligence or
mental ability in a large degree without reference to what the
students has learned. It serves to measure intangible
characteristics of an individual (e.g. Aptitude, Personality,
Intelligence)
2. Educational - Aims to measure the result of teaching and
learning (e.g. Achievement, Performance)
ACCORDING TO SCOPE
`
Survey – covers broad range of objectives. Measures general
achievement in certain subjects. Made by experts
(professionals)
Mastery – covers specific objectives. Measures fundamental
skills and abilities. Teacher made
ACCORDING TO CONSTRUCTION
`
Standardized – Constructed by an expert. Covers a broad
content. Usually in Multiple Choice format. Items are
screened. May be scored by a machine. Usually norm-
referenced when interpreted.
Informal – Teacher-made. May cover a narrow range of
content. May include various types of test. Items are picked
by a teacher. Scored by a teacher. Usually criterion-
referenced when scores are interpreted.
`
ACCORDING TO MANNER OF ADMINSITRATION
Individual – Maybe oral or may require actual
demonstration.
`
ACCORDING TO REFERENCE OF COMPARISON
Norm-Referenced – compares the result of the exam with
respect to other examinees. A group of students will certainly
pass. Performance of students is compared among them
`
ACCORDING TO MODE OF LANGUAGE
Verbal – uses words in meaning-making or responding to
test questions
Non-Verbal - words are not used in responding to test items
(e.g. number sequence, 3-D figures, graphs)
Selective Type
`
a. Multiple Choice – consists of a stem which describes the
problem and 3 or more alternatives which give the suggested
solutions. The incorrect alternatives are the distractors.
b. True-False or Alternative Response – consists of
declarative statement that one has to mark true or false,
right or wrong, correct or incorrect, yes or no, fact or
opinion, and the like.
`
c. Matching Type – consists of two parallel columns:
Column A, the column of premises from which a match is
sought; Column B, the column of responses from which the
selection is made
`
Supply Test
a. Short Answer – uses a direct question that can be
answered by a word, phrase, a number, or a symbol
b. Completion Test – it consists of an incomplete statement
`
Supply Test
a. Short Answer – uses a direct question that can be
answered by a word, phrase, a number, or a symbol
b. Completion Test – it consists of an incomplete statement
Essay Test `
a. Restricted Response – limits the content of the response
by restricting the scope of the topic
b. Extended Response – allows the students to select any
factual information that they think is pertinent, to organize
their answers in accordance with their best judgment
Essay Test `
a. Restricted Response – limits the content of the response
by restricting the scope of the topic
b. Extended Response – allows the students to select any
factual information that they think is pertinent, to organize
their answers in accordance with their best judgment
`
Refers to the process of examining the student’s response
to each item in the test.
Three criteria in determining the desirability and
undesirability of an item.
A. Difficulty of an item
B. Discriminating power of an item (Positive, Negative,
Zero)
C. Measures of attractiveness
`
Refers to the process of examining the student’s response
to each item in the test.
Three criteria in determining the desirability and
undesirability of an item.
A. Difficulty of an item
B. Discriminating power of an item (Positive, Negative,
Zero)
C. Measures of attractiveness
`
1. Score the test. Arrange from lowest to highest.
2. Get the top 27% (T27) and below 27% (B27) of the
examinees.
3. Get the proportion of the Top and Below who got each
item correct.
4. Compute for the Difficulty Index.
5. Compute for the Index of Discrimination
Df = NSC/ TNS
Where:
Df - difficulty index
`
NSC- number of students who got the answers correctly
TNS- total number of students in the Upper and Lower Class
INTERPRETATION
Difficulty Index
.81 – 1.0 = easy (revise)
.21 – .80 = average (accept)
0 – .20 = very difficult (reject)
Di = (Uc-Lc )/ NSC
Where `
Uc- students in the upper class who got the answer correctly
Lc- students in the lower class who got the answer correctly
NSG- number of students in each class
Discrimination Index
.31 – above = good (accept)
.20 – .30 = moderate (revise)
.19 and below = poor (reject)
`
First-impression effect: Judgment is based on early opinions
rather than on a complete picture (e.g., opening paragraph)
Contrast effect: Judging by comparing student against other
students instead of established criteria and standards
Rater drift: Unintentionally redefining criteria and standards
over time or across a series of scorings (e.g., getting tired and
cranky and therefore more severe, getting tired and reading
more quickly/leniently to get the job done)
Frequency Distribution `
- is a tabular arrangement of data into
appropriate categories showing the number of
observations in each category or group
- It encompasses the size of the table and makes
the data more interpretive (Gabuyo,2012)
`
Parts of Frequency Table
1. Class limit
- is the groupings or categories defined by the
lower and upper limits
`
Lower Class limit (LL)
- represents the smallest number in each group
Upper Class limit (UL)
- represents the highest number in each group
Example:
LL `- UL
10 - 14
15 - 19
20 - 24
3. Class Boundaries `
- are the numbers used to separate each category
in the frequency distribution but without gaps
created by class limits
- the scores of the students are discrete
`
Example of Class Boundaries
LL – UL LCB - UCB
10 - 14 9.5 - 14.5
15 - 19 14.5 – 19.5
20 - 24 19.5 – 24.5
`
4. Class marks
- are the midpoint of the upper and lower class limit
XM = LL + UL
2
Example: `
XM = LL + UL
2
LL UL XM
10 14 12
15 19 17
20 24 22
17 25 30 33 25 45 23 19
`
27 35 45 48 20 38 39 18
44 22 46 26 36 29 15 - 21
LS
50 - 47 34 26 37 25 33 49
HS
22 33 44 38 46 41 37 32
Normal/Bell-Shaped/Symmetrical
Positively Skewed `
Negatively Skewed
Leptokurtic
Mesokurtic
Platykurtic
Bimodal curve
Polymodal curve