Académique Documents
Professionnel Documents
Culture Documents
140247
8
Construct validity: the extent to which the content of the test/assessment reflects
current theoretical understandings of the skill(s) being assessed;
Content validity: whether it represents an adequate sample of ability; and
Criterion-related validity: the extent to which the results correlate with other
independent measures of ability.
PURPOSES
Assessment is carried out to collect information on learners' language proficiency and/or
achievement that can be used by the stakeholders in language learning program for
various purposes. These purposes include:
SUMMARY #26
DEASTIKA BAYUNING SUDJASMARA
140247
8
Background
In a series of research studies, John Oiler and his colleagues found strong relationships
between testers' performance in integrative tests and in the sub-components of various
other
test batteries testing other language skills, such as writing and speaking. On the basis of
these
findings, Oiler hypothesized that there was a single general proficiency factor which
underlay
test
performance. This became known as the 'unitary competence hypothesis' (Oiler 1976;
Oiler
and
Hinofotis 1980). However, in the face of critiques of the methodology used in the studies,
Oiler
(1983: 353) subsequently modified the hypothesis, acknowledging that language
proficiency
was
made up of multiple components. It is now generally accepted that a single test of
overall
ability,
such as a cloze passage, does not give an accurate picture of an individual's proficiency
and
that
a
range of different assessment procedures are necessary (Cohen 1994: 196).
Many language tests and assessments used nowadays often contain tasks which
resemble the kinds of language-use situations that test takers would encounter in using
the language for communicative purposes in everyday life. The kinds of tasks used in
communicative assessments of proficiency and achievement thus typically include
activities such as oral interviews, listening to and reading extracts from
the media and various kinds of 'authentic' writing tasks which reflect real-life demands
(for
a
range of examples, see Weir 1990, 1993).
Research
There has been an enormous amount of research activity in language assessment in
recent
years;
for a comprehensive overview, see Clapham and Corson (1997). This volume contains
state-of-heart surveys of a wide range of current issues in language assessment. A
summary of trends in language testing is also provided by Douglas (1995), while
Shohamy (1995) discusses the particular issues and problems involved in the
assessment of language performance. Hamayan (1995) describes a variety of
assessment procedures not involving the use of formal tests. Kunnan (1997) categorizes
over a hundred language testing research studies in terms of the framework for
language test validation proposed by Messick (1980). Current developments and
research in the assessment of the skills of listening, speaking, reading and writing,
respectively, are provided by Brindley (1998b), Turner (1998), Perkins (1998) and Kroll
(1998).
SUMMARY #26
DEASTIKA BAYUNING SUDJASMARA
140247
8
The brief review of research below focuses on two important issues in language
assessment:
(1) the key question of how to define language ability and (2) self-assessment of
language
ability.
Among the many important research topics that would merit attention in a longer review
include
the relationship between test-taker characteristics and test performance (e.g. Kunnan
1997), testtaker strategies (e.g. Storey 1998), test-taker discourse (e.g. O'Loughlin
1997),
factors
influencing
task difficulty (e.g. Fulcher 1996a), rater and interviewer behavior (e.g. Weigle 1994;
McNamara
1996; Morton et al. 1997) and applications of measurement theory to test analysis (e.g.
Lynch
and
McNamara 1998). The important question of the impact of assessment and testing on
teaching
and learning (known as washback) is also beginning to receive a good deal of attention
in
the
language assessment literature (Alderson and Wall 1993) as are issues of ethics and
fairness
(Hamp-Lyons 1998)
RESEARCH INTO THE NATURE OF COMMUNICATIVE LANGUAGE ABILITY
Many assessment specialists would argue that the fundamental issue in language
assessment
is
that
of construct validity. In other words, as Spolsky (1985) asks: 'What does it mean to know
how
to
use a language?' To answer this question, it is necessary to describe the nature of the
abilities
being
assessed; this is known as construct definition. Thus, if we are developing a test of
'speaking',
we
need to be able to specify what we mean by 'speaking ability', i.e. what its components
are
and
how these components are drawn on by different kinds of speaking tasks.
In recent years, two approaches to construct definition have been adopted (McNamara
1996). The first approach focuses on compiling detailed specifications of the features of
target language performances which learners have to carry out, often on the basis of an
analysis of communicative needs (Shohamy 1995). These features form the criteria for
assessment and are built into assessment instruments such as proficiency rating scales;
e.g. see the well-known scale used by the American Council on the Teaching of Foreign
Languages (ACTFL 1986) to assess language ability of foreign language teachers.
The second approach, rather than starting with an analysis of the language that the
learner needs to use, employs a theoretical model of language ability as a basis for
constructing tests and assessment tools; see Canale and Swain 1980; Bachman 1990
(the latter updated by Bachman and Palmer 1996). Such models provide a detailed and
SUMMARY #26
DEASTIKA BAYUNING SUDJASMARA
140247
8