Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Time-Time Temporal Reasoning (L1) on TEMPREASON 1.0 (test)
Loading...
100
EM
T5-SFT
-4
23
50
77
Jun 15, 2023
EM
F1 Score
Delta F1
Updated 4d ago
Evaluation Results
Method
Method
Links
EM
F1 Score
Delta F1
T5-SFT
Setting=CBQA
2023.06
100
100
-
TempT5
Setting=CBQA
2023.06
100
100
0
ChatGPT
Setting=CBQA
2023.06
30.5
56.7
-
FLAN-T5-L
Setting=CBQA
2023.06
0
2.9
-
Feedback
Search any
task
Search any
task