Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-step Reasoning over Code Dependencies on ConvFinQA hard
Loading...
74.67
Accuracy (Multi-step Reasoning)
SGKR
51.0932
57.2141
63.335
69.4559
Apr 12, 2026
Accuracy (Multi-step Reasoning)
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy (Multi-step Reasoning)
SGKR
Avg. Nodes=1.52
2026.04
74.67
GRAPHSAGE-RAG
Avg. Nodes=2
2026.04
71.67
BGE-LARGE-EN-V1.5-RAG
Avg. Nodes=2
2026.04
71.33
CODEBERT-RAG
Avg. Nodes=2
2026.04
68.33
few-shot
2026.04
63
vanilla
2026.04
52
Feedback
Search any
task
Search any
task