Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FEVER

Benchmarks

Task NameDataset NameSOTA ResultTrend
Fact VerificationFEVER
Accuracy53.9
67
Fact VerificationFEVER (dev)
Label Accuracy82.1
57
Fact VerificationFEVER (test)
LA Score79.47
32
Fact VerificationFEVER 1.0 (dev)
Label Accuracy89.07
23
Fact Extraction and VerificationFEVER (test)
Label Accuracy (LA)75.96
18
Explanation EvaluationFEVER (test)
Sufficiency9.72
16
Fact VerificationFEVER-Symmetric
Precision88
16
Fact-checkingFEVER
F1 Macro94.3
14
Fact VerificationFEVER 1.0 (test)
Label Accuracy74.07
14
ClassificationFEVER Symmetric v2 1.0
Accuracy69.1
13
ClassificationFEVER v1 (ID)
Accuracy87.5
13
Fact VerificationFEVER-S
Accuracy54
12
Fact VerificationFEVER
Accuracy61.4
12
Fact-verificationFEVER
Accuracy73.73
11
Sentence-Level Confidence PredictionFEVER
AUROC0.7
10
global fact consistency verificationFEVER
Precision99.5
10
Fact checkingFEVER v1.0 (dev)
Acc55.1
10
Claim VerificationFEVER (test)
Accuracy72.5
10
Fact VerificationFEVER
Accuracy78
9
Neural CachingFEVER
Online Accuracy (AUC)75.3
9
Fact VerificationSymmetric FEVER 1.0 (test)
Accuracy85.88
9
Fact Extraction and VerificationFEVER (dev)
Label Accuracy (LA)76.3
9
Information RetrievalFEVER (test)
NDCG@100.796
9
Fact VerificationFEVER S R
Precision95.2
8
Fact Extraction and VerificationFEVER leaderboard March 2019 (test)
Evidence F177.7
8
Showing 25 of 46 rows