Does Multimodality Help Human And Machine For Translation And Image Captioning? | Awesome LLM Papers Contribute to Awesome LLM Papers

Does Multimodality Help Human And Machine For Translation And Image Captioning?

Ozan Caglayan, Walid Aransa, Yaxing Wang, Marc Masana, Mercedes García-Martínez, Fethi Bougares, Loïc Barrault, Joost van de Weijer . Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers 2016 – 77 citations

[Paper]   Search on Google Scholar   Search on Semantic Scholar
Uncategorized WMT

This paper presents the systems developed by LIUM and CVC for the WMT16 Multimodal Machine Translation challenge. We explored various comparative methods, namely phrase-based systems and attentional recurrent neural networks models trained using monomodal or multimodal data. We also performed a human evaluation in order to estimate the usefulness of multimodal data for human machine translation and image description generation. Our systems obtained the best results for both tasks according to the automatic evaluation metrics BLEU and METEOR.

Similar Work