Google Scholar

Tigerscore: Towards building explainable metric for all text generation tasks

D Jiang, Y Li, G Zhang, W Huang, BY Lin… - … on Machine Learning …, 2023 - openreview.net

We present TIGERScore, a\textbf {T} rained metric that follows\textbf {I} nstruction\textbf {G}
uidance to perform\textbf {E} xplainable, and\textbf {R} eference-free evaluation over a wide
spectrum of text generation tasks. Different from other automatic evaluation methods that
only provide arcane scores, TIGERScore is guided by natural language instruction to
provide error analysis to pinpoint the mistakes in the generated text. Our metric is based on
LLaMA-2, trained on our meticulously curated instruction-tuning dataset MetricInstruct which …

Save Cite Cited by 23 Related articles All 4 versions View as HTML

[CITATION][C] TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks. ArXiv, Vol. abs/2310.00752 (2023)

D Jiang, Y Li, G Zhang, W Huang, BY Lin, W Chen - 2023

Save Cite Cited by 2 Related articles

Showing the best results for this search. See all results

Cite

Advanced search

Saved to My library

Tigerscore: Towards building explainable metric for all text generation tasks

[CITATION][C] TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks. ArXiv, Vol. abs/2310.00752 (2023)