Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Date Understanding on BIG-bench Hard (test)

75.2Test Accuracy

DLN-2

21.43235.39149.3563.309Jun 21, 2023Jul 14, 2023Aug 7, 2023Aug 30, 2023Sep 23, 2023Oct 16, 2023Nov 9, 2023
Updated 1mo ago

Evaluation Results

MethodLinks
2023.06
75.2
2023.06
72.4
2023.06
56.4
2023.11
56
2023.06
55.7
2023.11
54.4
2023.11
48
2023.11
48
2023.11
46.7
2023.11
45
2023.11
39.1
2023.11
36
2023.06
32.1
2023.06
23.5