Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Discriminative tasks on RA-QA
Loading...
72
Accuracy
RAMoEA-QA
20
33.5
47
60.5
Mar 6, 2026
Accuracy
Macro F1
Token F1
Exact Match
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Macro F1
Token F1
Exact Match
RAMoEA-QA
2026.03
72
67
88
60
CareAQA-operaGT
2026.03
67
59
86
55
CareAQA-operaCT
2026.03
61
53
83
49
PENGI
2026.03
22
21
2
0
Feedback
Search any
task
Search any
task