Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reading Comprehension on DROP (dev)

88.1F1 Score

QDGATp

-2.088821.325644.7468.1544Mar 1, 2019Aug 17, 2019Feb 3, 2020Jul 22, 2020Jan 7, 2021Jun 26, 2021Dec 13, 2021
Updated 1mo ago

Evaluation Results

MethodLinks
2020.09
88.185.31
87.880.5
87.179.7
2020.09
87.0584.07
2019.09
86.878.4
2020.09
85.8582.74
2020.09
85.5982.63
2019.09
85.577.9
2019.09
8576.2
2020.09
84.4281.07
2020.09
83.9880.22
2019.09
83.976.4
2019.09
81.373.2
2019.08
80.5476.68
2019.09
80.270.6
2019.09
79.969.7
2019.09
78.969.1
2019.08
72.8168.17
2019.10
68.3164.92
2020.09
68.3164.92
2019.08
67.3564.61
2019.10
64.8561.47
2019.08
58.7555.82
2021.12
58.6-
2021.12
57.8-
2021.12
57.3-
2019.08
49.2446.2
2019.10
49.2446.2
2020.09
49.2446.2
2019.03
49.2446.2
2019.03
45.7143.07
2021.12
36.5-
2021.12
34.3-
2019.03
33.9230.09
2019.08
33.3630.1
2019.10
33.3630.1
2020.09
33.3630.1
2019.03
33.3630.1
2019.10
30.4427.5
2020.09
30.4427.5
2019.03
30.4427.5
2019.08
30.3327.71
2019.03
30.3327.71
2019.03
29.1725.94
2019.08
28.8526.06
2019.10
28.8526.06
2020.09
28.8526.06
2019.03
28.8526.06
2021.12
23.6-
13.6711.03
2019.10
11.729.38
2020.09
11.729.28
2019.03
11.729.28
2019.10
11.649.38
2020.09
11.649.38
2019.03
11.649.38
2019.10
11.318.8
2020.09
11.318.8
2019.03
11.318.8
8.074.28
2019.03
8.074.28
2019.03
2.270.13
2019.03
1.380.09