Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Editing Alignment Assessment on HPE-Bench 1.0 (test)

0.8591SRCC

Layer-Selective MLLM

-0.0251080.2044460.4340.663554Jan 15, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.85910.72430.8714
2026.01
0.84790.67850.8481
2026.01
0.84370.65460.8436
2026.01
0.83610.68790.8322
2026.01
0.53280.39280.5827
2026.01
0.4850.35490.4883
2026.01
0.45180.37170.5415
2026.01
0.44030.30720.5194
2026.01
0.36020.23460.3416
2026.01
0.34370.24060.0357
2026.01
0.33540.3170.3552
2026.01
0.31320.25840.3979
2026.01
0.30790.21010.2904
2026.01
0.29590.22210.3584
2026.01
0.26590.18320.2716
2026.01
0.25870.18190.2822
2026.01
0.24930.1430.2348
2026.01
0.23470.13640.2607
2026.01
0.23170.18990.2391
2026.01
0.22640.15280.006
2026.01
0.21560.15550.2609
2026.01
0.20990.13280.2355
2026.01
0.20660.14080.277
2026.01
0.19610.12930.1382
2026.01
0.17390.16180.2445
2026.01
0.16550.10930.2206
2026.01
0.15060.12130.0288
2026.01
0.13860.09410.1497
2026.01
0.1180.0710.1656
2026.01
0.09370.07690.0755
2026.01
0.0710.05650.2816
2026.01
0.05150.0310.1303
2026.01
0.00890.00710.0732