Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Defective Dialog Detection on Multi-domain n = 714 (test)
Loading...
84
Precision
Mean TLD
77.76
79.38
81
82.62
Jun 6, 2023
Precision
Recall
F1-Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1-Score
Mean TLD
aggregation_strategy=mean
2023.06
84
54
66
Last-turn TLD
aggregation_strategy=l...
2023.06
83
68
75
Rising linear weights
aggregation_strategy=l...
2023.06
83
63
72
Union of mean & last-turn TLD
aggregation_strategy=u...
2023.06
82
73
77
DQM
model_type=supervised...
2023.06
78
83
81
Feedback
Search any
task
Search any
task