Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Question Answering on CommonsenseQA (CSQA)

91.2Accuracy

DeBERTaV3-large + KEAR

49.39260.24671.181.954May 2, 2020Dec 17, 2020Aug 3, 2021Mar 21, 2022Nov 5, 2022Jun 22, 2023Feb 7, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
91.2
2023.05
90.4
2023.05
80.7
2020.05
79.1
2020.05
78.1
2020.05
72.2
2024.02
68.3
2024.02
67.5
2024.02
67.4
2024.02
67.4
2024.02
67.4
2024.02
67.4
2024.02
67.2
2024.02
66.8
2024.02
66.5
2024.02
66.4
2024.02
66.2
2024.02
66.1
2024.02
65.8
2024.02
65.6
2024.02
65.2
2024.02
65.1
2024.02
65.1
2024.02
65.1
2024.02
65
2024.02
64.6
2024.02
64.5
2024.02
64.3
2024.02
64.3
2020.05
64
2024.02
63.8
2024.02
63.6
2024.02
63.5
2024.02
63.5
2024.02
63.4
2024.02
63.4
2024.02
63.4
2024.02
63.3
2024.02
63.2
2024.02
63.1
2024.02
63.1
2024.02
63.1
2024.02
63
2024.02
62.7
2024.02
62.5
2020.05
62.5
2024.02
62.3
2024.02
62.3
2024.02
61.9
2024.02
61.8
2024.02
61.7
2024.02
61.7
2024.02
61.7
2024.02
61.6
2024.02
61.4
2024.02
61.3
2024.02
61.3
2024.02
61.3
2024.02
61.2
2024.02
61.2
2024.02
61.1
2024.02
61
2024.02
60.7
2024.02
59.9
2024.02
59.1
2024.02
58.9
2024.02
58.7
2024.02
58.7
2024.02
58.5
2024.02
57.9
2024.02
57.6
2024.02
57.6
2024.02
57.4
2024.02
56.6
2024.02
56.5
2024.02
56.3
2024.02
55.9
2024.02
55.9
2024.02
55.7
2024.02
55
2024.02
54.8
2024.02
54.8
2024.02
54.8
2024.02
54.6
2024.02
54.5
2024.02
54.4
2024.02
54.2
2024.02
54.1
2024.02
54
2024.02
54
2024.02
54
2024.02
53.5
2024.02
53.4
2024.02
53.2
2024.02
53.2
2024.02
52.2
2024.02
52.2
2024.02
51.5
2024.02
51.5
2024.02
51
Showing 100 of 124 rows