Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Attributing Multimodal Foundation Model Errors on ImageNet misclassified samples (val)

66.39Avg Highest Confidence (0-25%)

LIMA

1.6518.457535.26552.0725Apr 1, 2025
Updated 14d ago

Evaluation Results

MethodLinks
2025.04
66.3974.8777.3678.7242.37
2025.04
64.473.0776.7678.7841.51
2025.04
56.7770.374.7975.8347.89
2025.04
54.0163.9967.5369.5534.75
2025.04
53.5168.1674.4575.5746.81
2025.04
44.3658.6561.5763.4839.29
2025.04
38.4353.1158.2459.7834.83
2025.04
27.139.8545.5848.7918.27
2025.04
24.2740.4547.5751.6422.37
2025.04
22.0234.0437.6741.2313.97
2025.04
21.4827.2330.5134.6210.62
2025.04
20.529.3333.635.0618.12
2025.04
20.1230.7335.8239.8920.17
2025.04
17.1127.2632.636.3621.8
2025.04
14.0428.0237.5542.4517.21
2025.04
12.9925.9335.3242.6111.29
2025.04
12.6823.6731.3739.0212.36
2025.04
10.6419.0126.5834.6311.24
2025.04
10.0418.6125.6133.9710.86
2025.04
9.6719.0225.329.215
2025.04
7.8713.817.7921.128.61
2025.04
4.1411.2316.2823.488.16