Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Fact Verification on X-Fact Zero-Shot (ZS)
Loading...
30
Macro-F1
SEEK
14.4
18.45
22.5
26.55
May 26, 2026
Macro-F1
Updated 7d ago
Evaluation Results
Method
Method
Links
Macro-F1
SEEK
Model=LLaMA
2026.05
30
SEEK
Model=Mistral
2026.05
27
Sentence Chunking
Model=Mistral
2026.05
25
SEEK
Model=Gemma
2026.05
24
Sentence Chunking
Model=Gemma
2026.05
23
Semantic Chunking
Model=LLaMA
2026.05
23
Semantic Chunking
Model=Gemma
2026.05
23
Search Snippets
Model=LLaMA
2026.05
22
Sentence Chunking
Model=LLaMA
2026.05
22
Search Snippets
Model=Mistral
2026.05
21
Semantic Chunking
Model=Mistral
2026.05
21
Search Snippets
Model=Gemma
2026.05
20
CONCRETE
Model=LLaMA
2026.05
18
CONCRETE
Model=Mistral
2026.05
18
CONCRETE
Model=Gemma
2026.05
15
Feedback
Search any
task
Search any
task