Enhanced-alignment Measure for Binary Foreground Map Evaluation
About
The existing binary foreground map (FM) measures to address various types of errors in either pixel-wise or structural ways. These measures consider pixel-level match or image-level information independently, while cognitive vision studies have shown that human vision is highly sensitive to both global information and local details in scenes. In this paper, we take a detailed look at current binary FM evaluation measures and propose a novel and effective E-measure (Enhanced-alignment measure). Our measure combines local pixel values with the image-level mean value in one term, jointly capturing image-level statistics and local pixel matching information. We demonstrate the superiority of our measure over the available measures on 4 popular datasets via 5 meta-measures, including ranking models for applications, demoting generic, random Gaussian noise maps, ground-truth switch, as well as human judgments. We find large improvements in almost all the meta-measures. For instance, in terms of application ranking, we observe improvementrangingfrom9.08% to 19.65% compared with other popular measures.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Meta-Measure 4 (Structural Sensitivity - Dilate) | Trans10K | Error Rate18 | 7 | |
| Meta-Measure 4 (Structural Sensitivity - Erode) | COD10K | Error Rate1.79 | 7 | |
| Meta-Measure 4 (Structural Sensitivity - Erode) | NC4K | Error Rate1.16 | 7 | |
| Meta-Measure 4 (Structural Sensitivity - Erode) | Trans10K | Error Rate38 | 7 | |
| Meta-Measure 4 (Structural Sensitivity - Dilate) | COD10K | Error Rate1.16 | 7 | |
| Meta-Measure 4 (Structural Sensitivity - Dilate) | NC4K | Error Rate0.93 | 7 | |
| Meta-Measure 3 (Noise Sensitivity) | Trans10K | Error Rate5.62 | 7 | |
| Meta-Measure 1 (Semantic Alignment) | CamoHR | Error Rate18 | 7 | |
| Meta-Measure 2 (Ground-Truth Switch) | COD10K | Error Rate3.46 | 7 | |
| Meta-Measure 2 (Ground-Truth Switch) | NC4K | Error Rate237 | 7 |