Does Neural Machine Translation Benefit From Larger Context? | Awesome LLM Papers Contribute to Awesome LLM Papers

Does Neural Machine Translation Benefit From Larger Context?

Sebastien Jean, Stanislas Lauly, Orhan Firat, Kyunghyun Cho . Arxiv 2017 – 139 citations

[Paper]   Search on Google Scholar   Search on Semantic Scholar
Uncategorized

We propose a neural machine translation architecture that models the surrounding text in addition to the source sentence. These models lead to better performance, both in terms of general translation quality and pronoun prediction, when trained on small corpora, although this improvement largely disappears when trained with a larger corpus. We also discover that attention-based neural machine translation is well suited for pronoun prediction and compares favorably with other approaches that were specifically designed for this task.

Similar Work