Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge-intensive NLP tasks on FEVER, NQ, WoW, and ZSRE
Loading...
41.22
Average Performance
UpperBound
35.3128
36.8464
38.38
39.9136
May 28, 2023
Average Performance
Updated 1mo ago
Evaluation Results
Method
Method
Links
Average Performance
UpperBound
FLOPs=453.1 G, Time=22...
2023.05
41.22
PlugD
FLOPs=139.3 G, Time=98 ms
2023.05
39.68
ED2LM
FLOPs=114.9 G, Time=60 ms
2023.05
37.39
EmbRecy
FLOPs=197.5 G, Time=14...
2023.05
35.54
Feedback
Search any
task
Search any
task