Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Attention Boosted Sequential Inference Model

About

Attention mechanism has been proven effective on natural language processing. This paper proposes an attention boosted natural language inference model named aESIM by adding word attention and adaptive direction-oriented attention mechanisms to the traditional Bi-LSTM layer of natural language inference models, e.g. ESIM. This makes the inference model aESIM has the ability to effectively learn the representation of words and model the local subsentential inference between pairs of premise and hypothesis. The empirical studies on the SNLI, MultiNLI and Quora benchmarks manifest that aESIM is superior to the original ESIM model.

Guanyu Li, Pengfei Zhang, Caiyan Jia• 2018

Related benchmarks

TaskDatasetResultRank
Natural Language InferenceSNLI (test)
Accuracy88.1
681
Paraphrase IdentificationQuora Question Pairs (test)
Accuracy88.01
72
Natural Language InferenceMultiNLI Mismatched
Accuracy73.9
60
Natural Language InferenceMultiNLI Matched
Accuracy73.9
49
Showing 4 of 4 rows

Other info

Follow for update