Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Error Detection on Digits
Loading...
88.1
F1 Score
TRM
45.044
56.222
67.4
78.578
Jun 29, 2021
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
TRM
Model architecture=Typ...
2021.06
88.1
Proxy Risk
Model architecture=Typ...
2021.06
84.4
TRM
Model architecture=DAN...
2021.06
84.1
Proxy Risk
Model architecture=DAN...
2021.06
79.6
TRI
Model architecture=DAN...
2021.06
70.1
TRI
Model architecture=Typ...
2021.06
69.8
Trust Score
Model architecture=Typ...
2021.06
49.6
MSP
Model architecture=DAN...
2021.06
48.5
Trust Score
Model architecture=DAN...
2021.06
48.4
MSP
Model architecture=Typ...
2021.06
46.7
Feedback
Search any
task
Search any
task