Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Pairwise Comparison on DeepfakeJudge Meta-Human
Loading...
99.4
Pairwise Accuracy
Qwen-3-VL-235B-Instruct
63.728
72.989
82.25
91.511
Feb 23, 2026
Pairwise Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Pairwise Accuracy
Qwen-3-VL-235B-Instruct
Type=Open
2026.02
99.4
DeepfakeJudge-7B
Type=Ours
2026.02
98.9
Qwen-3-VL-30B-Thinking
Type=Thinking
2026.02
97.7
DeepfakeJudge-3B
Type=Ours
2026.02
96.6
Qwen-3-VL-30B-Instruct
Type=Open
2026.02
96.3
Qwen-3-VL-235B-Thinking
Type=Thinking
2026.02
95.5
Gemini-Flash-2.5
Type=Closed
2026.02
94.2
Qwen-3-VL-8B-Thinking
Type=Thinking
2026.02
93.2
GPT-4o-Mini
Type=Closed
2026.02
89.8
Qwen-3-VL-8B-Instruct
Type=Open
2026.02
88.6
Qwen-3-VL-4B-Instruct
Type=Open
2026.02
72.7
Qwen-3-VL-2B-Instruct
Type=Open
2026.02
65.1
Feedback
Search any
task
Search any
task