Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Commonsense Reasoning on StoryCloze

97.6Accuracy

Traditional MTL

64.517673.106381.69590.2837Mar 30, 2023Sep 19, 2023Mar 11, 2024Sep 1, 2024Feb 22, 2025Aug 15, 2025Feb 5, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
97.6-
2024.05
95.3-
2024.05
94.9-
2024.05
94.1-
2024.05
93.6-
2024.05
92.5-
2024.05
92.5-
2024.05
91.1-
2024.05
90.3-
2023.05
87.4-
2023.05
86.7-
2023.05
86.1-
2023.05
85.6-
2025.10
84.9-2
2023.03
84.7-
2025.10
82.70.2
2023.03
81.83-
2025.10
81.617.2
2024.05
81.5-
2025.10
81.1-0.2
2025.10
80.97.4
2023.03
80.87-
2025.10
80.54.5
2023.03
80.28-
2023.03
78.3-
2026.02
67.72-
2026.02
67.34-
2026.02
67.13-
2026.02
67.13-
2026.02
66.86-
2026.02
66.76-
2026.02
66.7-
2026.02
66.38-
2026.02
65.79-