Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Temporal Commonsense Reasoning on MCTACO (test)
Loading...
79.5
F1 Score
ALICE
68.788
71.569
74.35
77.131
Dec 30, 2020
F1 Score
Exact Match
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Exact Match
ALICE
Protocol=Best single m...
2020.12
79.5
56.5
ECONET
Protocol=Best single m...
2020.12
76.8
54.7
RoBERTa-Large + ECONET
Protocol=Average of 3...
2020.12
76.3
52.8
RoBERTa-Large
Protocol=Average of 3...
2020.12
75.5
50.4
RoBERTa-Large + Generator
Protocol=Average of 3...
2020.12
75.1
50.2
BERT-Large
Protocol=Average of 3...
2020.12
70.3
43.2
TacoLM
Protocol=Average of 3...
2020.12
69.3
40.5
BERT-Large + ECONET
Protocol=Average of 3...
2020.12
69.2
42.3
Feedback
Search any
task
Search any
task