Share your thoughts, 1 month free Claude Pro on usSee more

Fact Consolidation on MAB FC-SH (262K context) v3 (full)

93Accuracy (SubEM)

SH-conflict (fact + Python max)

Updated 1mo ago

Evaluation Results

Method	Links
SH-conflict (fact + Python max) 2026.05		93	-
SH-conflict (fact + Python max) 2026.05		82	-
SH-conflict (chunk4096 + Python max) 2026.05		73	-
GPT-4o 2026.05		60	-33
HippoRAG-v2 2026.05		54	-39
BM25 2026.05		48	-45
GPT-4o-mini 2026.05		45	-48
Claude-3.7-Sonnet 2026.05		43	-50
GPT-4.1-mini 2026.05		36	-57