LSTM Based Conversation Models | Awesome LLM Papers Contribute to Awesome LLM Papers

LSTM Based Conversation Models

Yi Luan, Yangfeng Ji, Mari Ostendorf . Arxiv 2016 – 60 citations

[Paper]   Search on Google Scholar   Search on Semantic Scholar
Uncategorized

In this paper, we present a conversational model that incorporates both context and participant role for two-party conversations. Different architectures are explored for integrating participant role and context information into a Long Short-term Memory (LSTM) language model. The conversational model can function as a language model or a language generation model. Experiments on the Ubuntu Dialog Corpus show that our model can capture multiple turn interaction between participants. The proposed method outperforms a traditional LSTM model as measured by language model perplexity and response ranking. Generated responses show characteristic differences between the two participant roles.

Similar Work