Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Human-likeness classification on S2S Turing (test)

96.48Accuracy (H-H)

GPT-4o-Audio-Preview

7.861630.868353.87576.8817Feb 27, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
96.4827.080.6941.16
81.6915.2812.536.28
78.1723.6123.6141.63
77.4614.5822.2237.91
70.2883.5763.8472.84
67.6143.0629.8646.74
57.7572.9257.6462.79
51.4150.6938.8946.98
2026.02
46.4844.4440.2843.72
11.2784.7272.9256.51