Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Cross-modal Hallucination Detection on Curse of Multi-Modalities (CMM) 1.0 (test)

96.5VL Precision (pa)

Qwen 3 Omni

74.1479.94585.7591.555Mar 3, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
96.577.5938897.59495.56094.56793659575.3
2026.03
92.591.542.530.577.772.95724.477.467.78655.572.257.1
2026.03
92.598.592.785.894.595.591.375.189.184.374.983.889.287.2
2026.03
91.598.592.383.294.594.592.374.288.483.873.983.188.886.2
2026.03
90.595.5927691.58682.584.88978.982.473.78882.5
2026.03
9096.592.577.591.58783.585.688.479.683.875.188.383.6
2026.03
899889.583.5949388.869.986.181.372.282.686.684.7
2026.03
88.58174.59098729163.59635877589.269.4
889889.984.49393.587.867.785.380.774.283.286.484.6
2026.03
88959176.588.58479.178.888.477.883.271.686.480.6
879490.574878479.580.587.476.48273.785.680.4
2026.03
879389.574.58684.579.579.587.87681.574.385.280.3
2026.03
8698.589.586.592.494.486.763.584.380.472.584.185.284.6
758677.59478986275.5809057.54371.781.1