LSTM-based Deep Learning Models for Non-factoid Answer Selection

About

In this paper, we apply a general deep learning (DL) framework for the answer selection task, which does not depend on manually defined features or linguistic tools. The basic framework is to build the embeddings of questions and answers based on bidirectional long short-term memory (biLSTM) models, and measure their closeness by cosine similarity. We further extend this basic model in two directions. One direction is to define a more composite representation for questions and answers by combining convolutional neural network with the basic framework. The other direction is to utilize a simple but efficient attention mechanism in order to generate the answer representation according to the question context. Several variations of models are provided. The models are examined by two datasets, including TREC-QA and InsuranceQA. Experimental results demonstrate that the proposed models substantially outperform several strong baselines.

Ming Tan, Cicero dos Santos, Bing Xiang, Bowen Zhou• 2015

Related benchmarks

Task	Dataset	Result
Answer Selection	WikiQA (test)	MAP0.6557	149
Multi-turn Response Selection	E-commerce Dialogue Corpus (test)	R@1 (Top 10 Set)40.1	70
Multi-turn Response Selection	Douban Conversation Corpus	MAP0.495	67
Multi-turn Response Selection	Ubuntu Corpus	Recall@1 (R10)63.3	65
QA Answer Selection	TREC Answer Selection (test)	MAP0.675	33
Fake News Stance Detection	FNC 1 (test)	Agree58.74	30
Multi-turn Response Selection	E-commerce	R@140.1	14
Answer Sentence Selection	TREC-QA clean-version	MAP72.8	14
Answer Ranking	Ubuntu v2 (test)	Recall@1 (1/2 Pool)90.3	11
Answer Selection	InsuranceQA (dev)	Accuracy68.4	6

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord