Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Natural Language Inference on MultiNLI mismatched (test)

81.4Accuracy

LM-Transformer

63.82468.38772.9577.513Jun 1, 2017Aug 17, 2017Nov 2, 2017Jan 18, 2018Apr 5, 2018Jun 21, 2018Sep 7, 2018
Updated 1mo ago

Evaluation Results

MethodLinks
2018.05
81.4
2018.05
81.4
2018.05
79.5
2018.05
79
2017.11
78.8
2017.09
78.7
2018.05
78.7
2018.05
78.4
2018.05
77.9
2017.11
77.9
2017.09
77.8
2018.05
77.8
2017.11
76.4
2017.11
75.8
2017.09
74.9
74
2017.06
73.92
2018.09
73.7
2017.12
73.6
2017.09
73.6
2017.09
73.6
73.6
2018.09
73.6
2018.09
73.6
73.6
2017.11
73.6
2017.12
73.5
73.5
2018.09
73.4
2018.09
73.3
2018.09
73.1
2018.08
73
2017.12
72.9
2017.09
72.8
2017.12
72.1
2017.09
72.1
2017.09
72.1
2018.05
72.1
72.1
2017.12
71.4
2017.11
71.4
70.8
70.8
2017.06
70.7
2017.06
69.76
2017.06
68.57
68.2
2017.09
67.6
2017.12
67.1
2018.08
67.1
2018.09
66.9
2017.11
66.9
2017.12
64.6
2018.08
64.6
2018.09
64.5
2017.11
64.5