Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Binary classification (Human vs Machine speech) on MultiDialog (Human-Human) OOD (test)

95.31Accuracy

interpretable AI judge

90.544592.9272595.3197.69275Feb 27, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
95.31