Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LIAR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reasoning Quality Correlation AnalysisLIAR
Somers' D0.2769
45
Veracity PredictionLIAR RAW
Macro Precision47
24
Fact VerificationLIAR
F1 Score68.6
24
Multi-ClassificationLIAR Open
Accuracy46.81
23
Binary ClassificationLIAR Closed
Accuracy79.15
23
Multi-ClassificationLIAR Closed
Accuracy26.99
22
Binary ClassificationLIAR Open
Accuracy84.21
22
Fake News DetectionLIAR (test)
Accuracy65.2
21
Fact-checkingLIAR-RAW
Precision77.38
20
Fake News DetectionLIAR (val)
Accuracy27.7
13
Fact-checkingLIAR
Accuracy79
12
Claim VerificationLIAR (test)
Precision46.8
12
Veracity Explanation RankingLIAR RAW
Informativeness (MAR)2.09
12
Veracity PredictionLIAR-RAW (test)
Precision43.83
12
Fact-CheckingLIAR (test)
Accuracy68.2
11
Explanation GenerationLIAR-RAW (test)
ROU-125.5
11
Node classificationLIAR (test)
Fidelity100
8
Explainable Fake News DetectionLIAR RAW
Misleadingness1.85
7
Topic ModelingLIAR labeled holdout (test)
AUPC64.1
7
Explanation GenerationLIAR
Politeness97.3
4
Text Anomaly DetectionLiar2
AUPRC0.2937
3
Binary classificationLIAR (test)
Accuracy67.73
3
Showing 22 of 22 rows