Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-entity Reasoning on MEBench Set2 (11-100)

95.2Comparison Accuracy

DocSage

36.54451.7726782.228Mar 12, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
95.292.381.890.6
2026.03
79.360.165.767.6
2026.03
77.761.366.767.9
2026.03
71.458.970.765.9
2026.03
38.850.552.547.3