Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Fact Checking on FEVER v1.0 (dev)
Loading...
55.1
Acc
LongLLMLingua
51.564
52.482
53.4
54.318
Feb 19, 2024
Acc
Token Count
Updated 4d ago
Evaluation Results
Method
Method
Links
Acc
Token Count
LongLLMLingua
refinement_type=Abstra...
2024.02
55.1
111
BIDER
refinement_type=Abstra...
2024.02
52.4
93
LLM-Embedder
refinement_type=Extrac...
2024.02
52.2
192
Bge-Reranker
refinement_type=Extrac...
2024.02
52.2
194
Selective-Context
refinement_type=Abstra...
2024.02
52.2
236
SBERT
refinement_type=Extrac...
2024.02
52.1
192
BM25
refinement_type=Extrac...
2024.02
52
193
BART-Summarizer
refinement_type=Abstra...
2024.02
51.8
215
Original Prompt
refinement_type=Withou...
2024.02
51.7
805
Zero-shot
refinement_type=Withou...
2024.02
51.7
0
Feedback
Search any
task
Search any
task