Assesing Listening
Assesing Listening
Assesing Listening
Before focusing on listening itself, think about the two interacting concepts of
performance and observation. All language users perform the acts of listening, speaking,
reading, and writing. They of course rely on their uderlying competence in order to
accomplish these performances. When you propose to asses someones ability in one or a
combination of the four skills, you asses that persons competence, but you obseve the
persons performance. Sometimes the performance does not indicate true competence: a bad
nights rest, illness, an emotional distraction, test anxiety, a memory block, or other student-
related reliability factors could affect performace, thereby providing an unreliable measure of
acyual competence.
So, one one important principle for assessing a learners competence is to consider the
fallibility of the results of a single performance, such as that produced in a test. As with any
attempt at measurement, it is your obligation as a teacher to triangulate your measurements:
consider at least two (or more) performance and/or contexts before drawing a conclusion.
That could take the form of one or more of the following designs:
Several tests that are combined to form an assessment.
A single test with multiple test tasks to account for learning style and performance
variables.
In-class and extra-class graded work.
Alternative forms of assessment (e.g., journal, portofolio, conference, observation, self-
assessment, peer-assessment).
Multiple measures will always give you a more relianble and valid assessment than a single
measure.
A second principle is one that teachers often forget. We must rely as much as possible
onobservable performance in our assessments of students. Observable means being able to
see or hear performance of the learner (the sense of touch, taste, and smell dont apply very
often to language testing!). Receptive skills, then are clearly the more enigmatic of the two
modes of performance. You cannot observe the actual act of listening or reading, nor can
you see or hear an actual product! You can obeserve learners only while they are listening or
reading. The upshot is that all assessment of listening and reading must be made on the basis
of observing the test-takers speaking or writing (or nonverbal response), and not on the
listening or reading itself. So, all assessment of receptive performance must be made by
inference!
Listening has often played second fiddle to its counterpart, speking. In the standardized
testing industry, a number of separate oral production test are available, but it is rare to find
just a listening test. One reason for this emphasis is that listening is often implied as a
component of speaking. How could you speak a language without also listening? In addition,
the overtly observable nature of speaking renders it more empirically measurable then
listening. But, perhaps a deeper cause lies in universal biases toward speaking. A good
speaker is often (unwisely) valued more highly than a good listener. To determine if someone
is proficient user of a language, people customarily asks, Do you speak Spanish? people
rarely ask, Do you understand and speak Spanish?.
Every teacher of language knows thats one oral production ability-other than
monologues, speeches, reading aloud, and the like-is only as good as ones listening
comprehension ability. But of even further impact is the likelihood that input in the aural-oral
mode accounts for a large propotion of successful language acquisition. I n a typical day, we
do measurably more listening than speaking. Whether in the workplace, educational, or home
contexts, aural comprehension far outstrips oral production in quantifiable terms of time,
number of words, effort, and attenstion. We therefore need to pay close attention to listening
as a mode of performance for assessment in the classroom.
As with all effectives tests, designing appropriate assesment task in listening begin with
the specifications of objectives, or criteria. Those objective may be clasified in term of
several types of listening performance, Think about what you do when you listen. Literally in
nanoseconds, the folllowing processes flash through your brain.
1. You recognize specch sounds and hold a temporary imprint of them in short term
memory.
2. You simultancously determine the type of speech event ( monologue, interpersonal
dialogue transactional dialogue) that is being processed and attend to its context (who
the speaker is,location,purpose) and the contect of the message.
3. You use (bottom-up) linguistic decoding skills and/or (top-down) background
schemata to bring a plausible interpretation to the message,and assign a literal and
intended meaning to the utterance.
4. In most cases (except for repetition tasks,which involve short-term memory only)
You delete the exact linguistic form in which the message was originally received in
favor of conceptually retaining improntant of relevant information in long-term
memory.
From these stages we can derive four cammonly indentified types of listening
performance,each of which comprises a category within which tolconsider assessment tasks
and procedures.
For full comprehension, test-takers may at the extensive level need to invoke interactive skills
(perhaps note-taking, questioning, discussion): listening that includes all four of the above
types as test-takers actively participate in discussions, debates, conversations, role plays, and
pair group work. Their listening perfomance must be intricately integrated with speaking (and
perhaps other skills) in the authentic give-and-take of communicative interchange.
A useful way of synthesizing the above two list is two consider a finite number of micro
And macroskills implied in the performance of listening comprehension. Richard (1983) list
of microskills has proven useful in the domain of specifying objectives for learning and may
be even more useful in forcing test makers to carefully identify spesific assessment
obectives. In the following box, the skills are subdivided into what I prefer to think of as
microskills (attending to the smaller bits and chunk of language, in more of a bottom up
process) and macroskills (focus on the large r element involved in a top down approach to the
listening task). The, micro adn macroskills provide 17 different objectives to asses in
listening.
Microskills
1. Discriminate among the distinctive sounds of english.
2. Retain chunk of language of different lengths in short term memory.
3. Recognize English stress patterns, word in stressed and unstressed positions, rytmic
structure , intonation contour, and their role in signaling information.
4. Recognize reduced forms of word.
5. Distinguish word boundaries, recognize a core of word, and intepret word order
patterns and their significance.
6. Process speech at different rates of delivery.
7. Process speech containing pauses, error, corrections and other performance variables.
8. Recognize grammatical word classes (nouns, verbs,etc) System (e.g tense, agreement,
pluralization), pattern,rules and elliptical form.
9. Detect sentence constituents and distinguish between major and minor constituents.
10. Recognize that a particular meaning may be expressed in different grammatical form.
11. Recognize cohesive devices in spoken discourse.
Macroskills
12. Recognize the communication functions of utterances, according to situations,
participants, goals.
13. Infer situations, participants, gooals using real word knowledge.
14. From events,ideas, adn so on.describe, predict outcomes, infer links and connections
between events, deduce causes and effect, and detect such relations as main idea,
suppporting idea,new information,given information,generalization , axemplifications.
15. Distinguish between literal and implieds meanings.
16. Use facial, kinesic, body language and other nonverbal clues to decipher meanings.
17. Develop and use a battery of listening strategies, such as detecting key word, guessing
the meaning of word from context, appealing for help, and signaling comprehension
or lack thereof.
Implied in the taxonomy above is a notion of what makes many aspect s of listening
difficult, or why listening is not simply a linear process of recording strings, of language as
they are transmitted into our brains. Developing a sense of which aspects of listening
performance are predictably difficult will help you to challange your students appropriately
and to assign weight to items. Consider the following list of what makes listening difficult (
adapted from Richard, 1983; Ur, 1984;Dunkel, 1991)
Once you have determined objective, your next step is to design the tasks, including
making decision about how you will elicit performance and how you will expect the test-
taker to respond. We will look at tasks that range from intensive llistening performance, such
as minimal phonemic pair recognition, to extensive comprehension of language in
communicative contexts. The focus in this section is on the microskills of intensive listening.
(b) Is he living?
In both cases above, minimal phonemic distinctions are the target. If you are testing
recognition of morphology, you can use the same format:
Hearing the past tense morpheme in this sentence challenges even advanced learners,
especially if no context is provided. Stressed and unstressed words may also be tested with
the same rubric. In the following example, the reduced form (contraction) of can not is tested:
Because they are decontextualized, these kinds of tasks leave something to be desired in their
authenticity. But they are a step better than items that simply provide a one-word stimulus:
One-word stimulus
(b) wine
Paraphrase Recognition
The next step up on the scale of listening comprehension microskills in words, phrases, and
sentences, which are frequently assessed by providing a stimulus sentence and asking the
test-taker to choose the correct paraphrase from a number of choices.
Sentence paraphrase
Dialogue paraphrase
Here, the criterion is recognition of the adjective form used to indicate country of origin:
Canadian, American, Brazilian, Italian, etc.
Test-takers hear: How much time did you take to do your homework?
The objective of this item is recognition of the wh-question how much and its appropriate
response. Distractors are chosen to represent common learner errors: (a) (b) responding to
how much vs. how much longer, (c) confusing how much in reference to time vs. the more
frequent reference to money; (d) confusing a wh-question with a yes/no question.
None of the tasks so far discussed have to be framed in a multiple-choice format.
They can be offered in a more open-ended framework in which test-takers write or speak the
response. The above item would then look like this:
Test-takers hear: How much time did you take to do your homework?
If open-ended response formats gain a small amount of authenticity and creativity, they of
course suffer some in their practicality, as teachers must then read students responses and
judge their appropriateness, which takes time.
A third type of listening performance is selective listening, in which the task taker
Listens to limited quantity of aural input and must discern within it some specific
information. A number of techniques have been used that require selective listening
Listening Cloze
Listening cloze tasks (sometimes called cloze dictations or partial dictation) require the
tasktaker to listen to a story: monologue or conversation and simultaneously read the written
text in which selective word or phrases have been deleted. Cloze procedure is most
commonly associated with reading only. In its generic from, the test consist of passage in
which every nth word (typically every seventh word) is deleted and the test taker is asked to
supply and appropriate word in a listening cloze task, task takers see transcript of the passage
that they are listening to and fill in the blank with the word or phrase that they hear.
Information Transfer
Selective listening can also be assessed through in information transfer technique in which
aurally process information must be transfer visual representation, such as labeling, diagram,
identifying in element in picture, completing a form, showing routes on a map.
Sentence Repetition
The task of simply repeating a sentence or a partial sentence repetition, is also used as an
assessment of listening comprehension. As in dictation (discussed below), the task taker
retain a stretch of language long enough to reproduce it and then must respond with an oral
repetition of that stimulus. Incorrect listening comprehension, whether at the phonemic or
discourse level, may be manifested in the correctness in the repetition. A miscue in repetition
is scored as a miscue in listening. In the case of somewhat longer sentences, one could argue
that the ability to recognize to retain chunk of language as well as treads of meaning might be
assessed though repetition.
Drawing a clear distinction between any two of the categoriesof listening reffered to here
is problematic, but perhaps the fuzziest division is between selective and extensive listening.
As we gradually move along the continuum from smaller to larger stretchets of language, and
from micro to macroskills of listening, the probability of using more extensive listening tasks
increases. Some important question about designing assessment at this level emerge.
Dictation
Ideally, the language assessment filed would have a stockpile of listening test types that are
cognitively demanding. Communicative, and authentic, not to mention interactive by means of an
integration with speaking. The nature of a test as a simple of performance and a set of tasks with
limited time frames implies an equally limited capacity to mirror all the real world contexts of
listening performance.
1. Note-taking in the academic world, classroom lectures by professors are common features of a
none-native English-users experience.
One among several response formats includes note-taking by the test-takers. These notes are
evaluated by the teacher on a 30-point system, as follow:
0-15 points
Visual representation: are your notes clear and easy to read? Can you easily find and retrieve
information from them? Do you use the space on the paper to visually represent ideas? Do
you use indentation,headers,numbers,etc.?
0-10 points
Accuracy: Do you accurately indicate main ideas from lectures? Do you note important
details and supporting information and examples? Do you leave out unimportant information
and tangents?
0-5 points
Symbols and abbreviations: Do you use symbols and abbreviations as much as possible to
save time? Do you avoid writing out whole words, and do you avoid writing down every
single word the lecture says?
2. Editing. Another authentic task provides both a written and a spoken stimulus, and requires the
test-taker to listen for discrepancies. Scoring achieves relatively high reliability as there are
usually a small number of specific differences that must be identified. Here is the way the task
proceeds.
Test-takers read: the written stimulus material (a news report, an email from a friend, notes
from a lectures, or an editorial in a newspaper)
Test-takers hear: a spoken version of the stimulus that deviates, in a finite number of facts or
opinions, from the original written from.
Test-takers mark: the written stimulus by circling any words, phrases, facts,opinions that
show a discrepancy between the two versions.
3. Interpretive tasks. One of the intensive listening tasks described above was paraphrasing a story
or conversation an interpretive task extends the stimulus material to a longer stretch of discourse
and forces the test-taker to infer a response.
Fotentialstimull include
Song lyrics.
[recited] poetry.
Radio/television news reports, and
An oral account of an experience
Test-takers are then directed to interpret the stimulus by answering a few questions (in open-
ended from). Question might be:
This kind of task moves us away from what might traditionally be considered a test toward an
informal assessment, or possibly even a pedagog:caltechnique or activity.
4. Retelling. In a related task, test-takers listen to story or news event and simply retell it, or
summarize it, either orally (on an audiotape) or in writing. In so doing, test-takers must identify
the gist, main idea, purpose, supporting points, and/or conclusion to show full comprehension.
A fifth category of listening. Comprehension was hinted at earlier in the chapter: interactive listening.
Because such interaction presupposes a process of speaking in concert with listening, the interactive
nature of listening will be addressed in the next chapter.