Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Pairwise Comparison on DeepfakeJudge Meta
Loading...
96.2
Pairwise Accuracy
DeepfakeJudge-7B
73.944
79.722
85.5
91.278
Feb 23, 2026
Pairwise Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Pairwise Accuracy
DeepfakeJudge-7B
Type=Ours
2026.02
96.2
DeepfakeJudge-3B
Type=Ours
2026.02
94.4
Qwen-3-VL-235B-Instruct
Type=Open
2026.02
93.2
Qwen-3-VL-30B-Thinking
Type=Thinking
2026.02
92.5
Gemini-Flash-2.5
Type=Closed
2026.02
91.7
Qwen-3-VL-30B-Instruct
Type=Open
2026.02
91.3
Qwen-3-VL-235B-Thinking
Type=Thinking
2026.02
90.8
GPT-4o-Mini
Type=Closed
2026.02
90.3
Qwen-3-VL-8B-Thinking
Type=Thinking
2026.02
89.2
Qwen-3-VL-8B-Instruct
Type=Open
2026.02
86
Qwen-3-VL-4B-Instruct
Type=Open
2026.02
75.8
Qwen-3-VL-2B-Instruct
Type=Open
2026.02
74.8
Feedback
Search any
task
Search any
task