Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PMR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Procedural Multimedia ReasoningPMR (test)
Accuracy84.7
15
Pairwise classificationPMR-Synth (Total)
Accuracy73
14
Pairwise classificationPMR-Synth Med
Accuracy77
14
Pairwise classificationPMR-Reddit (Med)
Accuracy86
14
Pairwise classificationPMR-Reddit Easy
Accuracy98
14
Inbox SortingPMR-Real (test)
T-NDCG@1077
14
Pairwise classificationPMR-Real (Total)
Accuracy77
13
Pairwise classificationPMR-Real (Hard)
Accuracy0.72
13
Pairwise classificationPMR-Real Med
Accuracy82
13
Pairwise classificationPMR-Real (Easy)
Accuracy92
13
Procedural Multimedia ReasoningPMR (val)
Accuracy85.8
8
Showing 11 of 11 rows