Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
System-level Correlation with Human Judgment on MS COCO M1 (system-level)
Loading...
0.986
Pearson r
BERTScore
-0.88392
-0.39846
0.087
0.57246
Jun 2, 2021
Pearson r
P-Value
Updated 4d ago
Evaluation Results
Method
Method
Links
Pearson r
P-Value
BERTScore
2021.06
0.986
0.014
SMURF
2021.06
0.984
0.016
SPICE
2021.06
0.956
0.044
SPURTS
2021.06
0.956
0.044
SPARCS
2021.06
0.874
0.126
METEOR
2021.06
0.479
0.521
BS-w/oidf
2021.06
0.374
0.626
CIDEr
2021.06
0.023
0.977
Bleu-1
2021.06
-0.279
0.721
Bleu-2
2021.06
-0.709
0.291
Rouge-L
2021.06
-0.812
0.188
Feedback
Search any
task
Search any
task