Share your thoughts, 1 month free Claude Pro on usSee more

Long-form Question Answering on Biography

49.7VeriScore F1

EWE

Updated 4mo ago

Evaluation Results

Method	Links
EWE 2024.12		49.7	50.2
RA 2024.12		43.8	49.4
DRAGIN 2024.12		42.8	33.5
EWE 2024.12		42.2	21.5
NEST 2024.12		41.8	21.8
NEST 2024.12		41.5	22.1
RA 2024.12		41.4	21.3
COVE w/ Retrieval 2024.12		38.2	29.4
COVE 2024.12		37.7	31.3
Llama-3.1 2024.12		37.1	-
DRAGIN 2024.12		34.7	11.4
COVE w/ Retrieval 2024.12		29.1	10.2
Llama-3.1 2024.12		28.9	24.2
COVE 2024.12		25.1	13.3