Attentive Memory Networks: Efficient Machine Reading For Conversational Search | Awesome LLM Papers Add your paper to Awesome LLM Papers

Attentive Memory Networks: Efficient Machine Reading For Conversational Search

Tom Kenter, Maarten de Rijke . Proceedings of 1st International Workshop on Conversational Approaches to Information Retrieval Tokyo Japan August 11 2017 (CAIR17) 2017 – 40 citations

[Paper]   Search on Google Scholar   Search on Semantic Scholar
Datasets Efficiency Productivity Enhancement RAG

Recent advances in conversational systems have changed the search paradigm. Traditionally, a user poses a query to a search engine that returns an answer based on its index, possibly leveraging external knowledge bases and conditioning the response on earlier interactions in the search session. In a natural conversation, there is an additional source of information to take into account: utterances produced earlier in a conversation can also be referred to and a conversational IR system has to keep track of information conveyed by the user during the conversation, even if it is implicit. We argue that the process of building a representation of the conversation can be framed as a machine reading task, where an automated system is presented with a number of statements about which it should answer questions. The questions should be answered solely by referring to the statements provided, without consulting external knowledge. The time is right for the information retrieval community to embrace this task, both as a stand-alone task and integrated in a broader conversational search setting. In this paper, we focus on machine reading as a stand-alone task and present the Attentive Memory Network (AMN), an end-to-end trainable machine reading algorithm. Its key contribution is in efficiency, achieved by having an hierarchical input encoder, iterating over the input only once. Speed is an important requirement in the setting of conversational search, as gaps between conversational turns have a detrimental effect on naturalness. On 20 datasets commonly used for evaluating machine reading algorithms we show that the AMN achieves performance comparable to the state-of-the-art models, while using considerably fewer computations.

Similar Work