Share your thoughts, 1 month free Claude Pro on usSee more

Question Answering on FinanceBench N=150

98.7Accuracy

Mafin 2.5 (Vectify AI)

Updated 2mo ago

Evaluation Results

Method	Links
Mafin 2.5 (Vectify AI) 2026.05		98.7
Golden Evidence + GPT-5-mini 2026.05		94
Our Oracle (evidence pages) 2026.05		93.3
AgenticRAG 2026.05		92
AgenticRAG 2026.05		91.78
Oracle (evidence pages) 2026.05		85
OODA 2026.05		82
Full filing in context 2026.05		79
Databricks 2026.05		75
Single vector store per filing 2026.05		50
Agentic w. keyword search tools 2026.05		32.71
Traditional RAG 2026.05		24.24
Shared vector store 2026.05		19
Closed book 2026.05		9