Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Binary Classification on Turing Test (Human-Human)
Loading...
95.07
Accuracy
interpretable AI judge
69.2884
75.9817
82.675
89.3683
Feb 27, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
interpretable AI judge
2026.02
95.07
Qwen2.5-Omni
Fine-tuned=LoRA
2026.02
92.3
Qwen2.5-Omni
2026.02
78.17
Human Judge
2026.02
70.28
Feedback
Search any
task
Search any
task