Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multiple Choice Question on CICERO
Loading...
70.66
Macro F1
DIALECT-Large
69.984
70.1595
70.335
70.5105
Oct 6, 2022
Macro F1
Exact Match (Cause)
Exact Match (Subseq)
Exact Match (Prereq)
Exact Match (Motiv)
Exact Match (Reaction)
Exact Match (Average)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Macro F1
Exact Match (Cause)
Exact Match (Subseq)
Exact Match (Prereq)
Exact Match (Motiv)
Exact Match (Reaction)
Exact Match (Average)
DIALECT-Large
Model=DIALECT-Large, F...
2022.10
70.66
27.36
25.6
24.57
35.39
34.2
27.54
T5-Large
Model=T5-Large, Finetu...
2022.10
70.01
25.21
23.58
24.3
32.58
32.58
25.66
Feedback
Search any
task
Search any
task