Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Paraphrase Detection on QQP
Loading...
71.2
Average Accuracy
PERFECT
53.624
58.187
62.75
67.313
Apr 3, 2022
Average Accuracy
Worst-case Accuracy
Standard Deviation
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Accuracy
Worst-case Accuracy
Standard Deviation
PERFECT
Initialization=random
2022.04
71.2
64.2
3.5
PERFECT
Initialization=prototy...
2022.04
71.1
65.6
3.5
PET
Strategy=Best
2022.04
70.7
55.2
5.8
Logan IV et al. (2021)
2022.04
70.4
62.7
3.4
bitfit+mte
Mode=Ablation
2022.04
69.4
63
3.9
FINETUNE
2022.04
65
59.8
3.6
PET
Strategy=Average
2022.04
63.4
44.7
7.9
prompt+mte
Mode=Ablation
2022.04
54.3
46.2
5.6
Feedback
Search any
task
Search any
task