Share your thoughts, 1 month free Claude Pro on usSee more

Conversational Numerical Reasoning Question Answering on ConvFinQA v1 (test)

89.44Execution Accuracy

Human Expert Performance

Updated 4mo ago

Evaluation Results

Method	Links
Human Expert Performance 2022.12		89.44	86.34
APOLLO 2022.12		78.76	77.19
APOLLO 2022.12		76	74.56
FinQANet 2022.12		68.9	68.24
T-5 2022.12		58.66	57.05
GPT-2 2022.12		58.19	57
General Crowd Performance 2022.12		46.9	45.52