Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

String-level response similarity on RA-QA Global, Discriminative tasks

0.9BERTScore

RAMoEA-QA

-0.12960.13770.4050.6723Mar 6, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
0.988.38
2026.03
0.8987.05
2026.03
0.8784.89
2026.03
0.8783.22
2026.03
0.8683.15
2026.03
0.8381.02
2026.03
-0.012.61
2026.03
-0.090