Insertion-based Decoding With Automatically Inferred Generation Order · Awesome LLM Papers Contribute to LLM-Bible

Insertion-based Decoding With Automatically Inferred Generation Order

Jiatao Gu, Qi Liu, Kyunghyun Cho. Transactions of the Association for Computational Linguistics 2019 – 17 citations

[Paper]    
Transformer Reinforcement Learning GPT Model Architecture

Conventional neural autoregressive decoding commonly assumes a fixed left-to-right generation order, which may be sub-optimal. In this work, we propose a novel decoding algorithm – InDIGO – which supports flexible sequence generation in arbitrary orders through insertion operations. We extend Transformer, a state-of-the-art sequence generation model, to efficiently implement the proposed approach, enabling it to be trained with either a pre-defined generation order or adaptive orders obtained from beam-search. Experiments on four real-world tasks, including word order recovery, machine translation, image caption and code generation, demonstrate that our algorithm can generate sequences following arbitrary orders, while achieving competitive or even better performance compared to the conventional left-to-right generation. The generated sequences show that InDIGO adopts adaptive generation orders based on input information.

Similar Work