Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Open Question Answering on LegalMC4 (test)

77.2LLM Factual Correctness

GPT-5 (min. reasoning)

33.72845.01456.367.586Jan 20, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
77.2
2026.01
70.1
2026.01
55.4
2026.01
54.5
2026.01
43
2026.01
41.8
2026.01
38.7
2026.01
35.4