Incorporating Structural Alignment Biases Into An Attentional Neural Translation Model | Awesome LLM Papers Contribute to Awesome LLM Papers

Incorporating Structural Alignment Biases Into An Attentional Neural Translation Model

Trevor Cohn, Cong Duy Vu Hoang, Ekaterina Vymolova, Kaisheng Yao, Chris Dyer, Gholamreza Haffari . Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2016 – 181 citations

[Paper]   Search on Google Scholar   Search on Semantic Scholar
NAACL Uncategorized

Neural encoder-decoder models of machine translation have achieved impressive results, rivalling traditional translation models. However their modelling formulation is overly simplistic, and omits several key inductive biases built into traditional models. In this paper we extend the attentional neural translation model to include structural biases from word based alignment models, including positional bias, Markov conditioning, fertility and agreement over translation directions. We show improvements over a baseline attentional model and standard phrase-based model over several language pairs, evaluating on difficult languages in a low resource setting.

Similar Work