Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Knowledge-intensive NLP tasks on FEVER, NQ, WoW, and ZSRE

41.22Average Performance

UpperBound

35.312836.846438.3839.9136May 28, 2023
Updated 1mo ago

Evaluation Results

MethodLinks
2023.05
41.22
2023.05
39.68
2023.05
37.39
2023.05
35.54