Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Temporal Question Answering benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Temporal Question Answering
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
LoCoMo
ShardMemo
F1
0.6634
85
8d ago
TimeQA Hard
DeepSeek-V3-AdapTime
EM
77.7
25
1mo ago
LoCoMo Temporal (test)
Reproduced Baselines
F1 Score
44.09
24
3mo ago
TransientTables (test)
Gemini-2.0-Flash
EM
80.39
24
3mo ago
TimeQA Easy
o4-mini
R-1
63.4
20
3mo ago
TimeQA Easy-mode
DeepSeek-V3-AdapTime
Exact Match (EM)
85.4
18
1mo ago
TIME QUESTIONS 1.0 (test)
QUASAR
P@1
75.4
18
3mo ago
TempReason OBQA-L3
DeepSeek-V3-AdapTime
Exact Match (EM)
49.8
17
1mo ago
TempReason OBQA-L2
DeepSeek-V3-AdapTime
EM
48
17
1mo ago
LoCoMo Temporal
LightMem
F1
59.76
12
2mo ago
TimeQA Hard v1
CoT+RL pipeline
R-1
0.504
12
3mo ago
TimeQA Easy v1
CoT+RL pipeline
R-1 Score
58
12
3mo ago
TIQ 1.0 (test)
FAITH
P@1
0.491
10
3mo ago
TimeQA
Hybrid-Passage
Nugget R@20
49.7
9
1mo ago
ReasonQA Multi-hop
T5-large PIT-SFT
Set Accuracy
85
7
3mo ago
ReasonQA Single-hop
T5-large PIT-SFT
Set Accuracy
95.1
7
3mo ago
ActivityNet RTL
LITA
Score
44
5
3mo ago
TRACIE Unstructured
Neuro-symbolic (w/o PIS)
Accuracy
50.3
4
27d ago
TimeX-NLI Semi-structured
PIS
Accuracy
75.1
4
27d ago
TempReason Structured
Symbolic
Accuracy
100
4
27d ago
Synthetic Structured
Symbolic
Accuracy
100
4
27d ago
ArchivalQA
AdapTime
Accuracy
32.2
4
1mo ago
TIMEQUESTIONS (test)
EXAQT
P@1 (Overall)
56.5
4
3mo ago
FreshQA
B1 entropy
AUROC
0.657
2
2mo ago
MultiTQ
-
-
0
3mo ago
Showing 25 of 25 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs