Share your thoughts, 1 month free Claude Pro on usSee more

Conversational Numerical Reasoning Question Answering on ConvFinQA v1 (dev)

0.7846Execution Accuracy

APOLLO

Updated 5mo ago

Evaluation Results

Method	Links
APOLLO 2022.12		0.7846	0.7591
GPT-4 2022.12		0.7648	-
APOLLO 2022.12		0.7647	0.7414
FinQANet 2022.12		0.6832	0.6787
GPT-3.5-turbo 2022.12		0.5986	-
GPT-2 2022.12		0.5912	0.5752
T-5 2022.12		0.5838	0.5671
BloombergGPT 2022.12		0.4341	-