Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Temporal Question Answering on TimeQA Hard
Loading...
52.7
EM
T5-L-FiD-PIT
8.604
20.052
31.5
42.948
Nov 16, 2023
EM
Set Accuracy
Answer F1
Token F1
Updated 1mo ago
Evaluation Results
Method
Method
Links
EM
Set Accuracy
Answer F1
Token F1
T5-L-FiD-PIT
Backbone=T5-Large, Fra...
2023.11
52.7
47.3
49.8
61
T5-L-FiD
Backbone=T5-Large, Fra...
2023.11
50.5
45.1
47.6
59.8
T5-B-FiD-PIT
Backbone=T5-Base, Fram...
2023.11
46
41.1
43.3
54.7
T5-B-FiD
Backbone=T5-Base, Fram...
2023.11
44.3
39.4
41.7
53.2
T5-B-PIT
Backbone=T5-Base, Stra...
2023.11
39
34.2
36.4
48.4
T5-B
Backbone=T5-Base
2023.11
37.3
32.9
34.9
46.8
T5-B-FiD
Backbone=T5-Base, Fram...
2023.11
10.3
-
-
19.7
Feedback
Search any
task
Search any
task