Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DeepfakeJudge

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reasoning EvaluationDeepfakeJudge Reason 1.0 (test)
BLEU-19
16
Deepfake DetectionDeepfakeJudge-Detect (test)
Accuracy (Real)96.6
15
Pointwise Reasoning EvaluationDeepfakeJudge Meta-Human
RMSE0.5
12
Pointwise Reasoning EvaluationDeepfakeJudge Meta
RMSE0.61
12
Pairwise ComparisonDeepfakeJudge Meta-Human
Pairwise Accuracy99.4
12
Pairwise ComparisonDeepfakeJudge Meta
Pairwise Accuracy96.2
12
Showing 6 of 6 rows