Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

String-level response similarity on RA-QA Single-Verify, Discriminative tasks

94BERTScore

RAMoEA-QA

-0.6423.9348.573.07Mar 6, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
9492.64
2026.03
9189.95
2026.03
8786.08
2026.03
34.11