Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multiple Choice Question on CICERO v2
Loading...
88.63
Macro F1
DIALECT-Large
87.9228
88.1064
88.29
88.4736
Oct 6, 2022
Macro F1
Exact Match (Cause)
Exact Match (Subseq)
Exact Match (Prereq)
Exact Match (Motiv)
Exact Match (Reaction)
Exact Match (Average)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Macro F1
Exact Match (Cause)
Exact Match (Subseq)
Exact Match (Prereq)
Exact Match (Motiv)
Exact Match (Reaction)
Exact Match (Average)
DIALECT-Large
Model=DIALECT-Large, F...
2022.10
88.63
69.05
73.88
-
75.37
76.14
73.8
T5-Large
Model=T5-Large, Finetu...
2022.10
87.95
65.52
71.48
-
75.87
72.43
71.95
Feedback
Search any
task
Search any
task