Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Conversational Numerical Reasoning Question Answering on ConvFinQA v1 (dev)
Loading...
0.7846
Execution Accuracy
APOLLO
0.42008
0.514715
0.60935
0.703985
Dec 14, 2022
Execution Accuracy
Program Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Execution Accuracy
Program Accuracy
APOLLO
Protocol=Fine-tuning,...
2022.12
0.7846
0.7591
GPT-4
Protocol=Prompting, Mo...
2022.12
0.7648
-
APOLLO
Protocol=Fine-tuning
2022.12
0.7647
0.7414
FinQANet
Protocol=Fine-tuning,...
2022.12
0.6832
0.6787
GPT-3.5-turbo
Protocol=Prompting, Mo...
2022.12
0.5986
-
GPT-2
Protocol=Fine-tuning,...
2022.12
0.5912
0.5752
T-5
Protocol=Fine-tuning,...
2022.12
0.5838
0.5671
BloombergGPT
Protocol=Prompting
2022.12
0.4341
-
Feedback
Search any
task
Search any
task