Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MergeBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-task Language ModelingMergeBench
Instruction Score39.56
11
Vision-Language Multi-task PerformanceMergeBench (Vision-Language tasks: MMSI-Bench, EmbSpatial, MMMU_Med, PathVQA, OCRBench, CharXiv)
MMSI-Bench32.6
11
Showing 2 of 2 rows