Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long narrative understanding QA on Prelude

62.96Accuracy

HGMEM

49.866453.265756.66560.0643Dec 30, 2025
Updated 3mo ago

Evaluation Results

MethodLinks
2025.12
62.96
2025.12
62.22
2025.12
61.48
2025.12
60.74
2025.12
60
2025.12
59.26
2025.12
56.3
2025.12
54.81
2025.12
54.07
2025.12
52.59
2025.12
51.85
2025.12
51.11
2025.12
50.37
2025.12
50.37