Using Nawmman Uage Processing: E? Uce
Using Nawmman Uage Processing: E? Uce
Using Nawmman Uage Processing: E? Uce
~ Processing f m Proe?uce
Eli Goldberg and Norbert Driedger, Atmospheric Environment Service, Environment Canada
1 Richard 1. Kittredge, University of Montreal
46
--_I_-- - ._
!
points. For marine forecasts, the primary cri-
!
teria are wind speeds and directions; for pub- I
lic forecasts, cloud cover and precipitation
are more significant. If the time series are I I
sufficiently similar, then FOG merges the
areas represented by the sample points. We
consider the merged area to have multiple
I I
sample points, and at the end of space merg- I
ing, we obtain representative time series I
from the multiple samples. FOG then classi-
fies this data and time-merges it to identify
significant changes in weather conditions.
Since this event data captures the fore-
cast’s physical content without the wording, English text French text
it can serve other purposes. Quality control ........._ _..._
. ....... ......._.
..._..
. ._...
classified as southwest, westerly, or Text Winds southwest 15 to 20 knots diminishing to light late this evening
southerly. The choice should lead to the
fewest transitions, and thus the fewest con-
cepts, in the final results.
The forecast text must give wind speeds ucts. The expert system will therefore return ily control a text product’s content without
to the nearest 5 knots, so a 13-knot wind is different conceptual results for different tar- recourse to the weather maps. In Table 1,
expressed as 15 knots, and a 21-knot wind as get products, even when the input sample the input sample data consists of 19 wind
20 knots. Wind speed ranges can be used, but data is the same. directions and 19 wind speeds (hourly val-
the practice is restricted. For instance, a 5- ues from 6 a.m. to midnight). The expert
knot wind-speed range is permissible only Graphical bulletin editor. Although the system reduced these to two wind events,
during the forecast’s first 18 to 24 hours. Be- weather maps represent the forecasters’ best which the forecaster can adjust easily
yond that time, the unpredictability of efforts, they must occasionally overrule that through the editor.
weather mandates that wind speeds be re- objective data. For example, when forecast- But this approach presents the meteoro-
garded as accurate only to k5 knots. Conse- ers are uncertain about issuing or canceling logical events out of context. To properly as-
quently, a statement of “15 to 20 knots” for weather warnings, they often must wait for sess the combinations of events that go into
the forecast’s second day would imply a pos- confirmation before changing the warning a forecast, the forecaster uses geographical
sible range of 10 to 25 knots. That much vari- status. They can do this using FOG’Sgraph- displays and applies pattem recognition skills
ability is not permitted, so the 15- to 20-knot ical bulletin editor; if necessary, they can also and knowledge of atmospheric physics. The
range in Table 1 could occur only during the use the editor to overrule the expert system FPA’s long-term goal is to refine the con-
early part of the forecast. Once the winds fall that performs time and space merging. ceptual expert system’s expertise so the edi-
below 13 knots, the wind speed is classified The graphical bulletin editor displays the tor is rarely (if ever) required. Until then, it
as light, making direction insignificant (“di- events produced by the conceptual expert is not always practical to change weather
minishing to light late this evening”). This system as interactive histograms. The fore- maps when a rapid response is required (as
rule does not apply to all forecast text prod- caster can adjust these with a mouse to eas- for forecast amendments).
APRIL 1994 ~
47
The alternative to the editor would be to We first performed a distributional analy- the grammatical and conceptual statements
manually edit the forecast text, but we want sis of word use in a large corpus of forecast for each textual forecast product and check
to avoid this. Since the text products must be texts to group words into similarity classes. them against weather service standards,
issued in French and English, manual text We then used the sublanguage word classes to which document some of the accepted norms
editing would require editing in both lan- state the sentence patterns and relationships and preferred usages for forecasts. At this
guages. It would also interrupt the flow of needed for a sentence grammar. A secondary stage we also simplified the task by engi-
meteorological information and reintroduce analysis of the sentence types in whole fore- neering some rare events out of the design;
the forecast consistency problems that the casts led to a preliminary text grammar. this also promoted consistent and under-
FPA is designed to eliminate. The editor, We based the initial description of marine standable final texts.
however, can control consistency when the forecasts on an analysis of more than a mil- There was no corresponding corpus of
data is adjusted. lion words of archived (English) marine fore- French texts. In Canada, most weather fore-
casts. As part of this analysis, linguists rep- casts are produced in English, and a ma-
Linguistic processing. Once FOG has de- resented the syntactic structures of forecasts chine-translation system produces the French
termined the meteorological concepts, its lin- with Backus Naur Form (BNF) grammars text. Research and development for that sys-
guistic processor takes the unordered and un- used for programming language syntax. tem in the 1970s found that the information
structured concept set and outputs fluent They did not intend the BNF grammars to content of the forecasts in both languages
English and French texts according to pro- capture the context sensitivity of forecasts, was close enough to be considered identical
fessional style. There are two major process- for most purposes. Moreover, human trans-
ing stages: text planning and text realization. lations typically break the text into sentences
Text planning enriches the unstructured at the same points in both languages. We de-
concept set with additional derived concepts, cided to exploit these findings by deriving
and then organizes it into a sequence of sen- TO MEET THE NATURAL- texts in both languages from the same con-
tence-sized chunks. This determines the final LANGUAGE REQUlREMENTS tent representation, and by identically scop-
sentence order, but provides only an abstract ing English and French sentences. This
specification of each sentence’s content. The OF FOG’S LINGUISTIC helped reduce the requirement for an exten-
specification is an interlingual structure, sive corpus of French text, although we used
PROCESSOR, WE FIRST HAD
which is input for text realization. some professional French forecasts to clar-
During text realization, grammatical and TO DETERMhW THE TYPE ify specific issues.
lexical modules operate on each sentence
specification to derive a final written text for
OF TEXT THAT OUR SYSTEM Designing the text planner. While devel-
each language. The English and French re- WOULD OUTPUT. oping FOG for different types of forecasts,
alizers use the same general grammatical for- we have implemented two different text plan-
malism and processing substages, but act in- but only to delimit the set of permissible ner designs. The text planner for marine fore-
dependently. structures so forecasters could verify hy- casts relies on data salience to control the
potheses made by linguists about what is or order of sentences and help determine sen-
is not “sayable” in the forecasting domain. tence boundaries. Data salience is the rela-
Designing the linguistic (As a formal language, forecasts are not con- tive significance of different information
prwessor text-free. For example, in French sentences, types to the intended user. In Figure IC,the
nouns and their adjectives-which can eas- content follows a schema whose data
To meet the natural language requirements ily be noncontiguous-must agree in num- salience (starting with the most significant)
of FOG’S linguistic processor, we first had ber and gender.) is: forecast areas, waming status, wind fore-
to determine the type of text that our system We repeated the linguistic analysis for cast, weather forecast (for example, sky
would output. To generate texts that meet op- each forecast product to identify the lexical cover and precipitation), visibility, freezing
erational standards in a domain such as stock and inventory of syntactic structures spray, air temperature, and outlook winds. A
weather forecasting, we must describe the used naturally in human-composed texts. It text schema specifies the text’s global struc-
sublanguage (of English or another natural was easy to identify differences between ture, yet it allows variability in the syntactic
language) that experts use spontaneously to products and between regional variants of structures of the output sentences. It also al-
communicate in that domain.2 As Figure 1 the same product, and to state distinct gram- lows context sensitivity between the
shows, weather forecasts use a distinctive matical and lexical descriptions of those dif- schema’s linguistic structures. For example,
telegraphic style that is quite different from ferences. The analysis also provided an em- forecast sentences often have long-distance
standard English and French. The sentences pirical basis for building a conceptual syntactic and semantic dependencies: When
do not use tensed verbs, articles, or many apparatus that could mediate between raw the end of a paragraph mentions freezing
other words needed to communicate the meteorological facts and texts. This was due spray, as in Figure IC, the beginning must
same information in a more conversational to the fortunate fact that, in a situationally re- have a warning statement. Also, marine vis-
style. Consequently, existing English gram- stricted sublanguage, word classes and word- ibility and the conditions that reduce it are
mars are of little practical use, and we had to class syntax more clearly reflect domain se- normally mentioned in different sentences.
build a specialized sublanguage grammar for mantics than the whole language does. Data salience works well for planning the
forecasts in each language. Expert forecasters still needed to validate text of marine forecasts, but temporal order
~ ~
48 IEEE EXPERT
change-in-precipitation ::=
-
APRIL 1994 49
“southwest‘ A practical example of
linguistic processing
Let’s now look at how the planner and re-
“late”
1 0 ‘this“ alizer transform the conceptual content in
“light“ Table 1 into the output sentence “Winds
southwest 15to 20 knots diminishing to light
II late this evening.”
50 IEEE EXPERT
I/*“. ATTR
‘
,\DIMINISH v, change
SOUTHWEST OIoc
ISH can replace “diminish if the interlin- cn
gual node has “wind” or “gale” as dependent
I, and “wind,” “gale,” or the null node
“omega” as dependent II. However, if the de-
adv preposed
pendency tree involved temperatures, then
the above rule would not be satisfied, and the
“diminish” node would become the verb
LOWER (for example, “Temperatures low- II ’
ering to minus lo...’’), using the rules for that
lexical entry. 1
O*’num plur
The deep syntactic trees in Figures 5 and
6 are similar to the interlingua in Figure 4, ATTR ’
but their nodes are labeled with full lexemes,
and each lexeme has its syntactic class and 1
‘”’‘meas unit
morphosyntactic features (cn for common
~- -~
noun, prep for preposition, and so on). ~~~ ~ ~~ ~
APRIL 1994 51
Meaningedext Theory
Meaning<->Text Theory is based on a large 4,5, and 6 in the article). Aconstituent of the mappings that facilitate software design and
structured dictionary (lexicon) and a set of sentence is a word and all the words that hang maintenance. Second, the systemic lexicon
correspondence rules that provide mappings from it. Because the arc labels carry additional does not provide M l T s features for organiz-
between several representation levels. information, dependency trees require ahout ing lexical relations. In MTT, a variety of lexi-
Some linguistic theories emphasize the lexi- half the nodes of phrase structure trees. h4lT cal functions let general rules rephrase a given
con’s role as a linguistic knowledge repository, uses dependency trees whose nodes have no sentence in many ways (semantic paraphrases)
while others emphasize information in the horizontal ordering. It represents linear order when stylistic variation is desirable.
grammatical rules at the lexicon’s expense. between words only at the morphological level.
During generation, where much of the seman- This direct relational linkage between Applying MTT to another domain. The LFS
tic information partially determines lexical words favors a lexically oriented grammar; system for providing bilingual statistical
choices before determining the major syntactic that is, one in which small lexical sets or even reports on labor force surveys also uses MTT.6
choices, a lexically oriented model is more ad- single words closely control syntactic rules. LFS uses a semantic net representation of sen-
vantageous than a syntactically oriented This is particularly important for representing tences as input for realization, and theme and
model. MTT gives the lexicon enough priority restricted lexical combinations, including id- rheme constraints help optimize lexical and
to allow powerful synthesis techniques. iomatic expressions. MTT’s explanatory- syntactic choices. The LFS interlingua is much
To describe the generation of written text, combinatory dictionary has special provisions deeper than FOG’Sdeep-syntactic interlingua,
MTT provides seven representation levels: for handling word collocations. so it can introduce more subtle and natural-
one semantic, two syntactic, two morphologi- sounding semantic differences between Eng-
cal, and two phonological levels. (MTT mod- MTT and other language models. Text gen- lish and French sentences, which are required
els for English and French typically use five: eration in a single language (not to mention for the more complex LFS sublanguages.
semantic, deep syntactic, surface syntactic, two) places heavy demands on a language Eventually, we may adopt such an interlingua
morphological, and the final written form.) model. The model must account for all the as we extend FOG to cover new languages and
MTT is oriented toward text synthesis (mean- steps in the complex mapping from meaning new text varieties.
ing-to-text) rather than text analysis (text-to- representations to text. Traditional transforma-
meaning), although it was formulated to pro- tional grammars cannot do this. As of the late
vide bidirectional (reversible) models. 1980s, only three kinds of linguistic models 1. LA. Mel’cuk, “Meaning-Text Models: A
M’lT’s strata allow for expansion and gen- had been implemented extensively for genera- Recent Trend in Soviet Linguistics,” Ann.
eralization while maintaining clarity and sim- tion: Meaning-Text models,’,’ tree-adjoining Rev. ofAnthropology, Vol. 10, 1981, pp.
plicity for individual rules. Also, the two syn- grammars (TAGs) as implemented in 27-62.
tactic strata (the deep syntactic and surface Mumble,3 and systemic grammars!.’ 2. L. Iordanskaja, R. Kittredge, and A. Pol-
syntactic levels) facilitate the treatment of dif- MTT and other Meaning-Text models have gukre, “Lexical Selection and Paraphrase
ferences between French and English. We did several advantages over TAGs. MTT treats the in a Meaning-Text Generation Model,” in
not initially know if we needed a full language full multistage mapping from semantics to Natural-Language Generation in Artifi-
theory with several representation levels to phonology, whereas TAGs mostly handle a full cial Intelligence and Computational Lin-
generate the limited number of forms in fore- range of syntactic phenomena and relate them guistics, C. Paris, W. Swartout, and W.
casts. But we decided early in the project to to a morphological or textual level. MTT in- Mann, eds., Kluwer Academic Publish-
implement a well-developed theory to provide cludes a semantic layer (for more complex ers, Boston, 1991, pp. 293-312.
a solid reference model for later extensions forecasts and dissimilar languages), theme and 3. M. Meteer et al., “Mumble-86: Design
and modifications. rheme modeling (for speech synthesis), and and Implementation,” Tech. Report 87-
Linguistic models typically represent syn- lexical functions (for paraphrasing); TAGs do 87, Dept. of Computer and Information
Science, Univ. of Mass., Amherst, Mass.,
tax as phrase structure trees or dependency not. Although FOG currently uses lexical 1987.
trees. Phrase structure trees represent relations functions in relatively minor ways, future ex-
between sentence constituents. Each node is tensions (such as for the generation of weather 4. M.A.K. Halliday, An Introduction to
Functional Grammar, Edward Amold,
labeled with a constituent name (for example, synopses) will make heavier demands. London, 1985.
a noun phrase) and is linked vertically by un- Like MTT, the systemic grammar of the
labeled arcs to its immediate constituents (de- Penman text-generation system covers the full 5. C.W. Mann and C. Matthiessen, “Nigel:
A Systemic Grammar for Text
terminer, adjective, noun, and so on); lexemes transition from semantics to ph~nology.~ It Generation,” Tech. Report RR-83-105,
(dictionary words) appear at the leaves. Hori- goes beyond MTT by integrating pragmatic Information Sciences Inst., Univ. of
zontal ordering of the tree’s node labels issues such as social dimensions of language Southem Calif., Los Angeles, 1983.
reflects linear order among co-constituents. use and nonliteral meaning. However, this po-
6. L. Iordanskaja et al., “Generation of Ex-
In contrast, all dependency tree nodes and tential advantage is of little use in the weather tended Bilingual Statistical Reports,”
leaves are lexical items, linked by oriented de- domain, and systemic approaches are deficient Proc. Coling 92, 15th Int’l Con$ Compu-
pendency relations. The grammatical relations for FOG in two important areas. First, a sys- tational Linguistics, Assoc. for Computa-
between words are explicitly expressed by la- temic grammar does not lend itself to the sim- tional Linguistics, Morristown, N.J.,
bels on the arcs linking the words (see Figures ple, intuitive modularization of level-to-level 1992, pp. 1019-1023.
prosody of spoken output (the pitch, accent, Recent experiments with PHOG, an ex- additions to existing rules, lets PHOG gener-
and timing), thus allowing more economical tended phonological model for FOG, have ate enriched output strings that can be fed to
automatic telephone answering systems to shown that we may achieve good rule-driven a speech synthesizer. To fully realize this po-
handle the millions of calls received each prosody in the near future. The addition of a tential we need a synthesizer that accepts
year for weather forecast information. phonological representation level, plus small high-level prosodic specifications. Commer-
51 IEEE EXPERT
limitations to previous approaches
More than twenty years ago, advances in computer-generated text so forecaster inter- were written in Fortran, and some are now
numerical weather prediction stimulated inter- vention would be unnecessary. However, being rewritten in C. Experience has shown
est in computer-worded weather forecasts. NWP still has difficulty predicting actual sur- that this software is difficult to maintain. This
Since then, a number of difficulties have pre- face weather conditions, so this type of com- has hampered the testing and implementation
vented widescale application of techniques for puter-worded text is used most often as guid- of the software and has made it difficult to up-
wording forecasts. ance or advice for the meteorologist instead date the programs for changing user require-
One problem has been the human of information for the public. Consequently ments. This is a critical factor. Although
interface. Initial forecast data usually comes the computer-worded forecast adds to the in- Canadian textual forecast products fall into
from central NWP atmosphere simulations. formation overload instead of solving the several common broad categories (marine
Until recently, workstations with the graphi- problem. forecasts, public forecasts, and so on), each
cal interfaces needed to conveniently visual- The complexity and brittleness of software category contains many regional variations.
ize and modify meteorological fields were developed to produce computer-worded fore- Also, content, structure, and terminology tend
not available. Consequently, to use computer- casts is a more severe impediment. Earlier to vary with time, albeit slowly. To succeed, a
worded forecast systems, meteorologists Canadian and American schemes resemble system must address variations between fore-
were forced to work with large tables of nu- “slot-filler” techniques, which closely link cast types, variations between geographical
merical data that were neither efficient nor words and phrases to data values and the rules regions in a forecast type, and gradually
effective. This motivated attempts to perfect that govem their use. Most of these programs changing requirements.
cia1 synthesizers are only now beginning to References meteorological systems from the University of
accept good prosody specifications. In the Guelph in 1981, his MSc in theoretical physics
1. R. Paterson et al., “The Forecast Production from the University of Western Ontario in 1970,
meantime, we need a more complete linguis- Assistant,” Proc. Fourth AESKMOS Work- and his BSc in physics from the University of Cal-
tic description of the prosody rules required shop on Operational Meteorology, Canadian gary. He can be reached at goldberge@aestor.
for voice output of each text forecast product. Meteorological and Oceanographic Soc., dots.doe.ca
Toronto, 1992, pp. 262-269.
2. R. Kittredge and J. Lerhberger, eds., Sublan-
T H E WORKPLACE AUTOMATION guaget Studies of Language in Restricted Se-
mantic Domains, de Gruyter, Berlin, 1982. Richard Kittredge is profes-
made possible by the FPA and FOG lets fore- sor of linguistics at the
casters exchange the task of text composi- 3. R. Kittredge, T. Korelsky, and 0. Rambow, University of Montreal,
tion for that of preparing more detailed “On the Need for Domain Communication where he previously di-
graphical weather depictions. The result is Knowledge,” Computational Intelligence, rected the TAUM ma-
Vol. 7, No. 4,Nov. 1991, pp. 305-314. chine-translation project,
that forecasters can accomplish more with
4. A. Polgukre, “Grammatical and Lexical For- and is president of CoGen-
the same effort. One site has added an addi- Tex, a business specializ-
tional forecast product to its production suite: malisms in FOG-89,” intemal report, Atmos-
pheric Environment Service, Environment ing in text generation. His
it now routinely produces three text forecasts technical interests are mul-
Canada, Downsview, Ontario, Canada, 1989.
instead of two. Another has maintained the tilingual text generation,
5. R. Kittredge and A. Polguere, “Dependency sublanguage analysis, and machine language. He
same set of products, but can extend the fore- received his PhD in formal linguistics at the Uni-
Grammars for Bilingual Text Generation: In-
cast’s period to include the 36- to 60-hour side FOG’s Stratificational Models,” Proc. versity of Pennsylvania in 1969, and his BA in
time frame. Manually produced forecasts Int’l Con$ Current Issues in Computational mathematics from Swarthmore College in 1963.
provided weather conditions for today with Linguistics, Universiti Sains Malaysia, He is a member of the Association for Computa-
an outlook for tomorrow; forecasts produced Penang, Malaysia, 1991, pp. 318-330. tional Linguistics, AAAI, CSCSI, ACM SIG Art,
and IFIP WG 12.4. He can be reached at the Dept.
on the FPA provide information on today and 6. L. Iordanskaja, R. Kittredge, and A. Polguere, of Linguistics and Translation, Univ. of Montreal,
tomorrow with an outlook for the next day. “Lexical Selection and Paraphrase in a Mean- c.p. 6128, station A, Montreal, Canada H3C 3J7.
We anticipate more significant savings as ing-Text Generation Model,” in Natural-
Language Generation in Artificial Intelli-
soon as FOG can produce other varieties of gence and Computational Linguistics, C .
text. To this end, we are making FOG’s pub- Paris, W. Swartout, and W. Mann, eds.,
lic forecast implementation operational as Kluwer Academic Publishers, Boston, 1991, Norbert Driedger is a com-
pp. 293-3 12. puter programmer with the
quickly as possible. Meteorological Research
Branch of the Atmos-
Eli Goldberg is a research pheric Environment Ser-
meteorologist with the vice of Canada. His tech-
Ac knowledgments Meteorological Research nical interests include
Branch of the Atmospheric pattern recognition, liter-
We thank Alain Polgukre for his energy and cre- Environment Service of ate programming, and AI-
ativityin implementingFOG, and Myung-Hee Kim Canada, where he devel- related languages. He re-
for her valuable technical assistance in maintain- ops computer applications ceived his BSc in applied
ing and upgradingit. We also thankTanya Korelsky, to support weather fore- physics from the University of Waterloo, Canada.
Owen Rambow, and Ehud Reiter for their substan- casting operations. His He can be reached at King Radar Facility, Envi-
tive comments on this work. Finally, we thank three primary focus is the appli- ronment Canada, 14780 Jane St., R.R. #1, King
anonymous reviewers whose comments and sug- cation of AI to weather City, Ontario, Canada LOG 1KO; Internet
gestions helped focus and clarify this article. forecasting. He obtained his PhD in bydro- ndriedger @rpn.aes.doe.ca
APRIL 1994 53