Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Natural Language Inference on HANS (test)

78.65Accuracy

Roberta-large w/ Z-Aug

48.635656.427864.2272.0122Jan 28, 2022Feb 22, 2022Mar 19, 2022Apr 14, 2022May 9, 2022Jun 3, 2022Jun 29, 2022
Updated 1mo ago

Evaluation Results

MethodLinks
2022.03
78.65
2022.03
75.74
2022.01
73.16
2022.03
71.2
2022.06
71.2
2022.06
70.99
2022.06
70.92
2022.06
70.77
2022.03
70.5
2022.06
69.82
2022.06
69.75
2022.06
69.73
2022.03
69.26
2022.06
69.2
2022.06
69.15
2022.06
69.11
2022.03
69.1
2022.06
69.1
2022.03
68.75
2022.06
68.72
2022.06
68.57
2022.06
68.57
2022.03
67.9
2022.03
67.69
2022.03
66.87
2022.06
66.87
2022.06
66.43
2022.06
66.42
2022.03
66.31
2022.03
66.15
2022.06
66.02
2022.06
65.57
2022.06
65.57
2022.06
65.35
2022.06
65.35
2022.03
65.32
2022.06
65.16
2022.03
65.11
2022.06
64.88
2022.06
64.88
2022.06
64.25
2022.03
64
2022.03
63.4
2022.06
63.22
2022.06
62.72
2022.03
62.57
2022.01
60.01
2022.03
59.6
2022.03
58.42
2022.01
54.79
2022.03
54.36
2022.01
51.61
2022.01
49.92
2022.01
49.79