Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Defective Dialog Detection on OOD Shopping n = 105 (test)
Loading...
48
Precision
DQM
37.6
40.3
43
45.7
Jun 6, 2023
Precision
Recall
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
DQM
model_type=supervised...
2023.06
48
80
60
Last-turn TLD
aggregation_strategy=l...
2023.06
47
23
31
Rising linear weights
aggregation_strategy=l...
2023.06
41
67
51
Mean TLD
aggregation_strategy=mean
2023.06
39
77
52
Union of mean & last-turn TLD
aggregation_strategy=u...
2023.06
38
77
51
Feedback
Search any
task
Search any
task