Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Temporal Question Answering on TimeQA Hard

77.7EM

DeepSeek-V3-AdapTime

-1.3419.1839.760.22Nov 16, 2023Apr 12, 2024Sep 8, 2024Feb 4, 2025Jul 3, 2025Nov 29, 2025Apr 27, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
77.7--79.2
2026.04
76.4--77.9
2026.04
76.4--78.4
2026.04
75.6--77
2026.04
68.8--71.6
2026.04
66.5--68.2
2026.04
63.1--66.4
2026.04
62.9--64.7
2026.04
60.5--69.3
2026.04
59.5--68.1
2026.04
58.2--67.3
2026.04
56.9--60.1
2026.04
55.6--64.1
2026.04
54.6--57.1
2023.11
52.747.349.861
2023.11
50.545.147.659.8
2023.11
4641.143.354.7
2023.11
44.339.441.753.2
2026.04
39.6--49.3
2023.11
3934.236.448.4
2023.11
37.332.934.946.8
2026.04
33.3--38.6
2026.04
31.6--33.8
2023.11
10.3--19.7
2026.04
1.7--3.6