Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

About

Because of their superior ability to preserve sequence information over time, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have obtained strong results on a variety of sequence modeling tasks. The only underlying LSTM structure that has been explored so far is a linear chain. However, natural language exhibits syntactic properties that would naturally combine words to phrases. We introduce the Tree-LSTM, a generalization of LSTMs to tree-structured network topologies. Tree-LSTMs outperform all existing systems and strong LSTM baselines on two tasks: predicting the semantic relatedness of two sentences (SemEval 2014, Task 1) and sentiment classification (Stanford Sentiment Treebank).

Kai Sheng Tai, Richard Socher, Christopher D. Manning• 2015

Related benchmarks

Task	Dataset	Result
Node Classification	Pubmed	--	865
Subjectivity Classification	Subj	Accuracy92.1	343
Text Classification	AG News (test)	Accuracy86.06	293
Text Classification	TREC	Accuracy91.8	281
Question Classification	TREC	Accuracy93	262
Natural Language Inference	SNLI	Accuracy80.6	196
Relation Extraction	TACRED (test)	F1 Score62.4	194
Sentiment Classification	SST-2	Accuracy88	190
Sentiment Analysis	SST-5 (test)	Accuracy51	177
Text Classification	MR	Accuracy80.7	174

Showing 10 of 54 rows

Other info

Code

Follow for update

@wizwand_team Discord