Share your thoughts, 1 month free Claude Pro on usSee more

Binary Classification on Turing Test (Human-Human)

95.07Accuracy

interpretable AI judge

Updated 4mo ago

Evaluation Results

Method	Links
interpretable AI judge 2026.02		95.07
Qwen2.5-Omni 2026.02		92.3
Qwen2.5-Omni 2026.02		78.17
Human Judge 2026.02		70.28