Adaptations of ROUGE and BLEU to Better Evaluate Machine Reading Comprehension Task

Yang, An; Liu, Kai; Liu, Jing; Lyu, Yajuan; Li, Sujian

Computer Science > Computation and Language

arXiv:1806.03578 (cs)

[Submitted on 10 Jun 2018]

Title:Adaptations of ROUGE and BLEU to Better Evaluate Machine Reading Comprehension Task

Authors:An Yang, Kai Liu, Jing Liu, Yajuan Lyu, Sujian Li

View PDF

Abstract:Current evaluation metrics to question answering based machine reading comprehension (MRC) systems generally focus on the lexical overlap between the candidate and reference answers, such as ROUGE and BLEU. However, bias may appear when these metrics are used for specific question types, especially questions inquiring yes-no opinions and entity lists. In this paper, we make adaptations on the metrics to better correlate n-gram overlap with the human judgment for answers to these two question types. Statistical analysis proves the effectiveness of our approach. Our adaptations may provide positive guidance for the development of real-scene MRC systems.

Comments:	7 pages, 2 figures, ACL 2018 MRQA Workshop camera-ready version
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1806.03578 [cs.CL]
	(or arXiv:1806.03578v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1806.03578

Submission history

From: An Yang [view email]
[v1] Sun, 10 Jun 2018 03:50:10 UTC (27 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2018-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

An Yang
Kai Liu
Jing Liu
Yajuan Lyu
Sujian Li

export BibTeX citation

Computer Science > Computation and Language

Title:Adaptations of ROUGE and BLEU to Better Evaluate Machine Reading Comprehension Task

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Adaptations of ROUGE and BLEU to Better Evaluate Machine Reading Comprehension Task

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators