Human-Aligned Evaluation of a Pixel-wise DNN Color Constancy Model
About
We previously investigated color constancy in photorealistic virtual reality (VR) and developed a Deep Neural Network (DNN) that predicts reflectance from rendered images. Here, we combine both approaches to compare and study a model and human performance with respect to established color constancy mechanisms: local surround, maximum flux and spatial mean. Rather than evaluating the model against physical ground truth, model performance was assessed using the same achromatic object selection task employed in the human experiments. The model, a ResNet based U-Net from our previous work, was pre-trained on rendered images to predict surface reflectance. We then applied transfer learning, fine-tuning only the network's decoder on images from the baseline VR condition. To parallel the human experiment, the model's output was used to perform the same achromatic object selection task across all conditions. Results show a strong correspondence between the model and human behavior. Both achieved high constancy under baseline conditions and showed similar, condition-dependent performance declines when the local surround or spatial mean color cues were removed.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Color Constancy | Outdoor scenes | ΔCCI (%)-78.39 | 32 | |
| Color Constancy | All scenes (combined) | ΔCCI%-82.76 | 32 | |
| Color Constancy | Indoor Scenes (IN) | ΔCCI (%)-87.137 | 32 | |
| Color Constancy Evaluation | Human CCI Data Indoor scenes ΔCCI | Accuracy79.7 | 14 | |
| Color Constancy Evaluation | Human CCI Data All conditions, ΔCCI (All) | Accuracy52.1 | 7 | |
| Color Constancy Evaluation | Human CCI Data Outdoor scenes ΔCCI (Out) | Accuracy66.4 | 7 | |
| Color Constancy Index modeling | CCI All scenes (combined) | Accuracy55.2 | 7 | |
| Color Constancy Index modeling | CCI Outdoor scenes | Accuracy67.6 | 7 |