Share your thoughts, 1 month free Claude Pro on usSee more

Long-form Question Answering on LongFact

75.9VeriScore F1

EWE

Updated 4mo ago

Evaluation Results

Method	Links
EWE 2024.12		75.9	50.1
RA 2024.12		72.7	41.2
DRAGIN 2024.12		71.5	38.2
COVE w/ Retrieval 2024.12		67.4	31.8
EWE 2024.12		67.3	40.5
RA 2024.12		65.9	28.1
Llama-3.1 2024.12		64.3	-
DRAGIN 2024.12		63.9	15.9
COVE 2024.12		63.8	39.3
NEST 2024.12		63.2	9.1
Llama-3.1 2024.12		63.1	40.6
NEST 2024.12		62.3	4.2
COVE w/ Retrieval 2024.12		53.5	12.2
COVE 2024.12		44.1	8.8