Assesing Listening

ASSESING LISTENING
A. Observing The Performance of Four Skills
Before focusing on listening itself, think about the two interacting concepts of
performance and observation. All language users perform the acts of listening, speaking,
reading, and writing. They of course rely on their uderlying competence in order to
accomplish these performances. When you propose to asses someones ability in one or a
combination of the four skills, you asses that persons competence, but you obseve the
persons performance. Sometimes the performance does not indicate true competence: a bad
nights rest, illness, an emotional distraction, test anxiety, a memory block, or other student-
related reliability factors could affect performace, thereby providing an unreliable measure of
acyual competence.
So, one one important principle for assessing a learners competence is to consider the
fallibility of the results of a single performance, such as that produced in a test. As with any
attempt at measurement, it is your obligation as a teacher to triangulate your measurements:
consider at least two (or more) performance and/or contexts before drawing a conclusion.
That could take the form of one or more of the following designs:
Several tests that are combined to form an assessment.
A single test with multiple test tasks to account for learning style and performance
variables.
In-class and extra-class graded work.
Alternative forms of assessment (e.g., journal, portofolio, conference, observation, self-
assessment, peer-assessment).
Multiple measures will always give you a more relianble and valid assessment than a single
measure.
A second principle is one that teachers often forget. We must rely as much as possible
onobservable performance in our assessments of students. Observable means being able to
see or hear performance of the learner (the sense of touch, taste, and smell dont apply very
often to language testing!). Receptive skills, then are clearly the more enigmatic of the two
modes of performance. You cannot observe the actual act of listening or reading, nor can
you see or hear an actual product! You can obeserve learners only while they are listening or
reading. The upshot is that all assessment of listening and reading must be made on the basis
of observing the test-takers speaking or writing (or nonverbal response), and not on the
listening or reading itself. So, all assessment of receptive performance must be made by
inference!
B. The Importance of Listening
Listening has often played second fiddle to its counterpart, speking. In the standardized
testing industry, a number of separate oral production test are available, but it is rare to find
just a listening test. One reason for this emphasis is that listening is often implied as a
component of speaking. How could you speak a language without also listening? In addition,
the overtly observable nature of speaking renders it more empirically measurable then
listening. But, perhaps a deeper cause lies in universal biases toward speaking. A good
speaker is often (unwisely) valued more highly than a good listener. To determine if someone
is proficient user of a language, people customarily asks, Do you speak Spanish? people
rarely ask, Do you understand and speak Spanish?.
Every teacher of language knows thats one oral production ability-other than
monologues, speeches, reading aloud, and the like-is only as good as ones listening
comprehension ability. But of even further impact is the likelihood that input in the aural-oral
mode accounts for a large propotion of successful language acquisition. I n a typical day, we
do measurably more listening than speaking. Whether in the workplace, educational, or home
contexts, aural comprehension far outstrips oral production in quantifiable terms of time,
number of words, effort, and attenstion. We therefore need to pay close attention to listening
as a mode of performance for assessment in the classroom.
C. Basic Types of Listening
As with all effectives tests, designing appropriate assesment task in listening begin with
the specifications of objectives, or criteria. Those objective may be clasified in term of
several types of listening performance, Think about what you do when you listen. Literally in
nanoseconds, the folllowing processes flash through your brain.
1. You recognize specch sounds and hold a temporary imprint of them in short term
memory.
2. You simultancously determine the type of speech event ( monologue, interpersonal
dialogue transactional dialogue) that is being processed and attend to its context (who
the speaker is,location,purpose) and the contect of the message.
3. You use (bottom-up) linguistic decoding skills and/or (top-down) background
schemata to bring a plausible interpretation to the message,and assign a literal and
intended meaning to the utterance.
4. In most cases (except for repetition tasks,which involve short-term memory only)
You delete the exact linguistic form in which the message was originally received in
favor of conceptually retaining improntant of relevant information in long-term
memory.
Each of these stages represents a potential assessment objective:
Comprehending of surface structure elements such as phonemes,words,intonation,or a

grammatical category.
Understanding of pragmatic context.
Determining meaning of auditory input.
Developing the gist,a global or comprehensive understanding.
From these stages we can derive four cammonly indentified types of listening
performance,each of which comprises a category within which tolconsider assessment tasks
and procedures.
1. Intensive. Listening for perception of the components. (phonemes, words, intonation,

discourse markers, etc.) of a larger stretch of language.
2. Responsive. Listening to a relatively short stretch of language. (a greeting, question,
command, comprehension check,etc.) in order to make an equally short response.
3. Selective. Processing stretches of discourse such a short monologues for several
minutes in order to scan for certain information.the purpose of such perfomance is
not necessarily to look for global or general meanings,but to be ableto comprehend
designated information in a context of longer stretches of spoken language ( such as
classroom directions from a teacher, TV or radio news items, or stories). Assessment
tasks inselective listening could ask students, for example to listen for
names,numbers,a grammatical category,directions (in a map exercise),or certain facts
and events.
4. Extensive. Listening to depelove a top-down,global understanding of spoken
language.extensive perfomance rangers from listening to lengthy lectures to listening
to a conversation and deriving a comprehensive message or purpose. Listening for the
gist, for the main idea, and making inferencs are all part of extensive listening.
For full comprehension, test-takers may at the extensive level need to invoke interactive skills
(perhaps note-taking, questioning, discussion): listening that includes all four of the above
types as test-takers actively participate in discussions, debates, conversations, role plays, and
pair group work. Their listening perfomance must be intricately integrated with speaking (and
perhaps other skills) in the authentic give-and-take of communicative interchange.
D. MICRO AND MACROSKILLS OF LISTENING
A useful way of synthesizing the above two list is two consider a finite number of micro
And macroskills implied in the performance of listening comprehension. Richard (1983) list
of microskills has proven useful in the domain of specifying objectives for learning and may
be even more useful in forcing test makers to carefully identify spesific assessment
obectives. In the following box, the skills are subdivided into what I prefer to think of as
microskills (attending to the smaller bits and chunk of language, in more of a bottom up
process) and macroskills (focus on the large r element involved in a top down approach to the
listening task). The, micro adn macroskills provide 17 different objectives to asses in
listening.
Micro and macroskills of listening (adapted from Richard, 1983)
Microskills
1. Discriminate among the distinctive sounds of english.
2. Retain chunk of language of different lengths in short term memory.
3. Recognize English stress patterns, word in stressed and unstressed positions, rytmic
structure , intonation contour, and their role in signaling information.
4. Recognize reduced forms of word.
5. Distinguish word boundaries, recognize a core of word, and intepret word order
patterns and their significance.
6. Process speech at different rates of delivery.
7. Process speech containing pauses, error, corrections and other performance variables.
8. Recognize grammatical word classes (nouns, verbs,etc) System (e.g tense, agreement,
pluralization), pattern,rules and elliptical form.
9. Detect sentence constituents and distinguish between major and minor constituents.
10. Recognize that a particular meaning may be expressed in different grammatical form.
11. Recognize cohesive devices in spoken discourse.
Macroskills
12. Recognize the communication functions of utterances, according to situations,
participants, goals.
13. Infer situations, participants, gooals using real word knowledge.
14. From events,ideas, adn so on.describe, predict outcomes, infer links and connections
between events, deduce causes and effect, and detect such relations as main idea,
suppporting idea,new information,given information,generalization , axemplifications.
15. Distinguish between literal and implieds meanings.
16. Use facial, kinesic, body language and other nonverbal clues to decipher meanings.
17. Develop and use a battery of listening strategies, such as detecting key word, guessing
the meaning of word from context, appealing for help, and signaling comprehension
or lack thereof.
Implied in the taxonomy above is a notion of what makes many aspect s of listening
difficult, or why listening is not simply a linear process of recording strings, of language as
they are transmitted into our brains. Developing a sense of which aspects of listening
performance are predictably difficult will help you to challange your students appropriately
and to assign weight to items. Consider the following list of what makes listening difficult (
adapted from Richard, 1983; Ur, 1984;Dunkel, 1991)
1. Clustering: attending to appropriate Chunks of language-Phrases,clauses, constituens.

2. Redundancy: recognizing the kinds of repetitions, rephasing, elaborations, and insertions
that unrehearsed spoken language often contains, and benefiting from that recognition.
3. Reduced form: understanding the reduced form that may not have been a part ofan
English learners past learning experiences in clasess where only formal textbook
language hasbeen presented.
4. Performance variables : being able to weed out hesitations, false starts, pauses, and
corrections in natural speech.
5. Colloquial language: comprehending idioms,slang, reduces form, share cultural
knowledge.
6. Rate of delivery : keeping up with the speed of delivery, processing automatically as the
speakers continues.
7. Stress,rhytym,and intonation : correctly understanding prosodic elements of spoken
language, which is almost always much more difficult than understanding the smaller
phonological bits and pieces.
8. Interaction : managing the interactive flow of lamguage from listening to speaking to
listening, etc.
E. Designing Assessment Tasks: Intensive Listening
Once you have determined objective, your next step is to design the tasks, including
making decision about how you will elicit performance and how you will expect the test-
taker to respond. We will look at tasks that range from intensive llistening performance, such
as minimal phonemic pair recognition, to extensive comprehension of language in
communicative contexts. The focus in this section is on the microskills of intensive listening.
Recognizing Phonological and Morphological Elements

A typical form of intensive listening at this level is the assessment of recognition of
phonological and morphological elements of language. A classic test task gives a spoken
stimulus and asks test-takers to identify the stimulus from two or more choices, as in the
following two examples:
Phonemic pair, consonants
Test-takers hear: Hes from California.
Test-takers read: (a) Hes from California.
(b) Shes from California.
Phonemic pair, vowels
Test-takers hear: Is he living?
Test-taker read: (a) Is he leaving?
(b) Is he living?
In both cases above, minimal phonemic distinctions are the target. If you are testing
recognition of morphology, you can use the same format:
Morphological pair, -ed ending
Test-takers hear: I missed you very much.
Test-takers read: (a) I missed you very much.
(b) I miss you very much.
Hearing the past tense morpheme in this sentence challenges even advanced learners,
especially if no context is provided. Stressed and unstressed words may also be tested with
the same rubric. In the following example, the reduced form (contraction) of can not is tested:
Stress pattern in cant
Test-takers hear: My girlfriend cant go to the party.
Test-takers read: (a) My girlfriend cant go to the party.
(b) My girlfriend can go to the party.
Because they are decontextualized, these kinds of tasks leave something to be desired in their
authenticity. But they are a step better than items that simply provide a one-word stimulus:
One-word stimulus
Test-takers hear: vine
Test-takers read: (a) vine
(b) wine
Paraphrase Recognition
The next step up on the scale of listening comprehension microskills in words, phrases, and
sentences, which are frequently assessed by providing a stimulus sentence and asking the
test-taker to choose the correct paraphrase from a number of choices.
Sentence paraphrase
Test-takers hear: Hellow, my names Keiko. I come from Japan.

Test-takers read: (a) Keiko is comfortable in Japan
(b) Keiko wants to come to Japan.
(c) Keiko is Japanese.
(d) Keiko likes Japan.
In the above items, the idiomatic come from is the phrase being tested. To add a little context,
a conversation can be the stimulus task to which test-takers must respond with the correct
paraphrase:
Dialogue paraphrase
Test-takers hear: Man: Hi, Maria, my names George.
Woman: Nice to meet you, George. Are you american?
Man: No, Im Canadian.
Test takers read: (a) George lives in the United States.
(b) George is American.
(c) George comes from Canada.
(d) Maria is Canadian.
Here, the criterion is recognition of the adjective form used to indicate country of origin:
Canadian, American, Brazilian, Italian, etc.
F. Designing Assessment Tasks: Renponsive Listening
A question-and-answer format can provide some interactivity in these lower-end-

listening tasks. The tesr-takers response is the appropriate answer to a question.
Appropriate response to a question
Test-takers hear: How much time did you take to do your homework?
Test-takers read: (a) In about an hour.
(b) About an hour
(c) About $10
(d) Yes, I did.
The objective of this item is recognition of the wh-question how much and its appropriate
response. Distractors are chosen to represent common learner errors: (a) (b) responding to
how much vs. how much longer, (c) confusing how much in reference to time vs. the more
frequent reference to money; (d) confusing a wh-question with a yes/no question.
None of the tasks so far discussed have to be framed in a multiple-choice format.
They can be offered in a more open-ended framework in which test-takers write or speak the
response. The above item would then look like this:
Open-ended response to a question
Test-takers hear: How much time did you take to do your homework?
Test-takers write or speak:
If open-ended response formats gain a small amount of authenticity and creativity, they of
course suffer some in their practicality, as teachers must then read students responses and
judge their appropriateness, which takes time.
G. Designing Assessment Task: Selective Listening
A third type of listening performance is selective listening, in which the task taker
Listens to limited quantity of aural input and must discern within it some specific
information. A number of techniques have been used that require selective listening
Listening Cloze
Listening cloze tasks (sometimes called cloze dictations or partial dictation) require the
tasktaker to listen to a story: monologue or conversation and simultaneously read the written
text in which selective word or phrases have been deleted. Cloze procedure is most
commonly associated with reading only. In its generic from, the test consist of passage in
which every nth word (typically every seventh word) is deleted and the test taker is asked to
supply and appropriate word in a listening cloze task, task takers see transcript of the passage
that they are listening to and fill in the blank with the word or phrase that they hear.
Information Transfer
Selective listening can also be assessed through in information transfer technique in which
aurally process information must be transfer visual representation, such as labeling, diagram,
identifying in element in picture, completing a form, showing routes on a map.
Sentence Repetition
The task of simply repeating a sentence or a partial sentence repetition, is also used as an
assessment of listening comprehension. As in dictation (discussed below), the task taker
retain a stretch of language long enough to reproduce it and then must respond with an oral
repetition of that stimulus. Incorrect listening comprehension, whether at the phonemic or
discourse level, may be manifested in the correctness in the repetition. A miscue in repetition
is scored as a miscue in listening. In the case of somewhat longer sentences, one could argue
that the ability to recognize to retain chunk of language as well as treads of meaning might be
assessed though repetition.
H. Designing Assessment Tasks: Extensive Listening
Drawing a clear distinction between any two of the categoriesof listening reffered to here
is problematic, but perhaps the fuzziest division is between selective and extensive listening.
As we gradually move along the continuum from smaller to larger stretchets of language, and
from micro to macroskills of listening, the probability of using more extensive listening tasks
increases. Some important question about designing assessment at this level emerge.
1. Can listening performance be distinguished from cognitive processing factors such as

memory, association, storage, and recall?
2. As assessment procedures become more communicative, does the task take into account
test-takers ability to use grammatical expectancies, lexical collocations, semantic
interpretations, and pragmatic compoetense?
3. Are test task themselves correspondingly content valid and authentic-that is, do they
mirror real-word language and context?
4. As assessment tasks become more and more open-ended, they more closely resemble
pedagogical tasks, which leads one to ask what the difference is between assessment and
teaching tasks. The answer is scoring: the former imply specified scoring procudures,
while the latter do not.
Dictation
Dictation is widely researched genre of assessing listening comprehension. In a dictation,

test-takers hear a passage, typically of 50 to 100 words, recited three times: first, at normal
speed; then with long pauses between phrases or natural word groups, during which time test-
takers write down what they have just heard, and finally, at normal speed once or more so
they can check their work and proofread.
Communicative Stimulus-Response Tasks
Listen to a monologue or conversation and respond to a set of comprehension questions.

Disadvantages: some of the multiple-choice questions dont mirror communicative real-life
situations. The conversation is authentic, but listening to a conversation between a doctor and
a patient is rarely done
Authentic Listening tasks
Ideally, the language assessment filed would have a stockpile of listening test types that are
cognitively demanding. Communicative, and authentic, not to mention interactive by means of an
integration with speaking. The nature of a test as a simple of performance and a set of tasks with
limited time frames implies an equally limited capacity to mirror all the real world contexts of
listening performance.
And here are some possibilities:
1. Note-taking in the academic world, classroom lectures by professors are common features of a
none-native English-users experience.
One among several response formats includes note-taking by the test-takers. These notes are
evaluated by the teacher on a 30-point system, as follow:
Scoring system for lecture notes
0-15 points
Visual representation: are your notes clear and easy to read? Can you easily find and retrieve
information from them? Do you use the space on the paper to visually represent ideas? Do
you use indentation,headers,numbers,etc.?
0-10 points
Accuracy: Do you accurately indicate main ideas from lectures? Do you note important
details and supporting information and examples? Do you leave out unimportant information
and tangents?
0-5 points
Symbols and abbreviations: Do you use symbols and abbreviations as much as possible to
save time? Do you avoid writing out whole words, and do you avoid writing down every
single word the lecture says?
2. Editing. Another authentic task provides both a written and a spoken stimulus, and requires the
test-taker to listen for discrepancies. Scoring achieves relatively high reliability as there are
usually a small number of specific differences that must be identified. Here is the way the task
proceeds.
Editing a written version of an aural stimulus
Test-takers read: the written stimulus material (a news report, an email from a friend, notes
from a lectures, or an editorial in a newspaper)
Test-takers hear: a spoken version of the stimulus that deviates, in a finite number of facts or
opinions, from the original written from.
Test-takers mark: the written stimulus by circling any words, phrases, facts,opinions that
show a discrepancy between the two versions.
3. Interpretive tasks. One of the intensive listening tasks described above was paraphrasing a story
or conversation an interpretive task extends the stimulus material to a longer stretch of discourse
and forces the test-taker to infer a response.
Fotentialstimull include
Song lyrics.
[recited] poetry.
Radio/television news reports, and
An oral account of an experience
Test-takers are then directed to interpret the stimulus by answering a few questions (in open-
ended from). Question might be:
Why was the singer feeling said?

What events might have led up to the reciting of this poem?
What do you think the political activists might do next, and why?
What do you think the storyteller felt about the mysterious disappearance of her
necklace?
This kind of task moves us away from what might traditionally be considered a test toward an
informal assessment, or possibly even a pedagog:caltechnique or activity.
4. Retelling. In a related task, test-takers listen to story or news event and simply retell it, or
summarize it, either orally (on an audiotape) or in writing. In so doing, test-takers must identify
the gist, main idea, purpose, supporting points, and/or conclusion to show full comprehension.
A fifth category of listening. Comprehension was hinted at earlier in the chapter: interactive listening.
Because such interaction presupposes a process of speaking in concert with listening, the interactive
nature of listening will be addressed in the next chapter.

Assesing Listening

Uploaded by

Copyright:

Available Formats

Assesing Listening

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Assesing Listening

Uploaded by

Copyright:

Available Formats

ASSESING LISTENING

A. Observing The Performance of Four Skills

B. The Importance of Listening

C. Basic Types of Listening

Each of these stages represents a potential assessment objective:

Comprehending of surface structure elements such as phonemes,words,intonation,or a

1. Intensive. Listening for perception of the components. (phonemes, words, intonation,

D. MICRO AND MACROSKILLS OF LISTENING

Micro and macroskills of listening (adapted from Richard, 1983)

1. Clustering: attending to appropriate Chunks of language-Phrases,clauses, constituens.

Recognizing Phonological and Morphological Elements

Test-takers hear: Hes from California.

Test-takers read: (a) Hes from California.

(b) Shes from California.

Phonemic pair, vowels

Test-takers hear: Is he living?

Test-taker read: (a) Is he leaving?

Morphological pair, -ed ending

Test-takers hear: I missed you very much.

Test-takers read: (a) I missed you very much.

(b) I miss you very much.

Stress pattern in cant

Test-takers hear: My girlfriend cant go to the party.

Test-takers read: (a) My girlfriend cant go to the party.

(b) My girlfriend can go to the party.

Test-takers hear: vine

Test-takers read: (a) vine

Test-takers hear: Hellow, my names Keiko. I come from Japan.

Test-takers hear: Man: Hi, Maria, my names George.

Woman: Nice to meet you, George. Are you american?

Man: No, Im Canadian.

Test takers read: (a) George lives in the United States.

(b) George is American.

(c) George comes from Canada.

(d) Maria is Canadian.

F. Designing Assessment Tasks: Renponsive Listening

A question-and-answer format can provide some interactivity in these lower-end-

Appropriate response to a question

Test-takers read: (a) In about an hour.

(b) About an hour

(c) About $10

(d) Yes, I did.

Open-ended response to a question

Test-takers write or speak:

G. Designing Assessment Task: Selective Listening

H. Designing Assessment Tasks: Extensive Listening

1. Can listening performance be distinguished from cognitive processing factors such as

Dictation is widely researched genre of assessing listening comprehension. In a dictation,

Communicative Stimulus-Response Tasks

Listen to a monologue or conversation and respond to a set of comprehension questions.

Authentic Listening tasks

And here are some possibilities:

Scoring system for lecture notes

Editing a written version of an aural stimulus

Why was the singer feeling said?

You might also like