Anticipating Future with Large Language Model for Simultaneous Machine Translation

S Ouyang, O Hrinchuk, Z Chen, V Lavrukhin… - arXiv preprint arXiv …, 2024 - arxiv.org
arXiv preprint arXiv:2410.22499, 2024arxiv.org
Simultaneous machine translation (SMT) takes streaming input utterances and incrementally
produces target text. Existing SMT methods only use the partial utterance that has already
arrived at the input and the generated hypothesis. Motivated by human interpreters'
technique to forecast future words before hearing them, we propose $\textbf {T} $ ranslation
by $\textbf {A} $ nticipating $\textbf {F} $ uture (TAF), a method to improve translation quality
while retraining low latency. Its core idea is to use a large language model (LLM) to predict …
Simultaneous machine translation (SMT) takes streaming input utterances and incrementally produces target text. Existing SMT methods only use the partial utterance that has already arrived at the input and the generated hypothesis. Motivated by human interpreters' technique to forecast future words before hearing them, we propose ranslation by nticipating uture (TAF), a method to improve translation quality while retraining low latency. Its core idea is to use a large language model (LLM) to predict future source words and opportunistically translate without introducing too much risk. We evaluate our TAF and multiple baselines of SMT on four language directions. Experiments show that TAF achieves the best translation quality-latency trade-off and outperforms the baselines by up to 5 BLEU points at the same latency (three words).
arxiv.org
Showing the best result for this search. See all results