Binary classification (Human vs Machine speech) on MultiDialog (Human-Human) OOD (test)

95.31Accuracy

interpretable AI judge

Updated 4mo ago

Evaluation Results

Method	Links
interpretable AI judge 2026.02		95.31