Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Date Understanding

Benchmarks

Task NameDataset NameSOTA ResultTrend
Temporal ReasoningDate Understanding
Accuracy89.53
14
Symbolic ReasoningDate Understanding (DU)
Accuracy87.2
10
Multiple-Choice ReasoningDate Understanding (test)
Accuracy78.2
8
Symbolic ReasoningDate Understanding (DU) (test)
Accuracy67.52
4
Logical ReasoningDate Understanding
Accuracy80.8
4
Commonsense ReasoningDate Understanding
Accuracy16.3
3
Showing 6 of 6 rows