Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-entity Reasoning on MEBench Set3 (>100)

94.6Comparison Accuracy

DocSage

12.12833.53954.9576.361Mar 12, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
94.688.479.187.9
2026.03
50.83541.341.5
2026.03
49.237.435.940.6
2026.03
4534.441.739.6
2026.03
15.321.430.621.9