Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-step Reasoning over Code Dependencies on FinQA hard
Loading...
65.56
Accuracy
SGKR
46.7776
51.6538
56.53
61.4062
Apr 12, 2026
Accuracy
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy
SGKR
Avg. Nodes=1.50
2026.04
65.56
BGE-LARGE-EN-V1.5-RAG
Avg. Nodes=2
2026.04
63.61
GRAPHSAGE-RAG
Avg. Nodes=2
2026.04
62.78
CODEBERT-RAG
Avg. Nodes=2
2026.04
61.94
few-shot
2026.04
57.78
vanilla
2026.04
47.5
Feedback
Search any
task
Search any
task