Share your thoughts, 1 month free Claude Pro on usSee more

Long-form Question Answering on AlpacaFact

66.9VeriScore F1

EWE

Updated 4mo ago

Evaluation Results

Method	Links
EWE 2024.12		66.9	49.9
RA 2024.12		66	43.1
EWE 2024.12		65.5	28
DRAGIN 2024.12		65.3	31.5
Llama-3.1 2024.12		65.3	26.7
COVE w/ Retrieval 2024.12		64	28.8
RA 2024.12		63.9	18.5
Llama-3.1 2024.12		63.8	-
COVE 2024.12		61.5	33.3
DRAGIN 2024.12		61.3	11.1
NEST 2024.12		58.1	30.2
NEST 2024.12		57.8	9.1
COVE w/ Retrieval 2024.12		54.6	12.5
COVE 2024.12		51.3	15.1