CrossScore: Towards Multi-View Image Evaluation and Scoring
About
We introduce a novel cross-reference image quality assessment method that effectively fills the gap in the image assessment landscape, complementing the array of established evaluation schemes -- ranging from full-reference metrics like SSIM, no-reference metrics such as NIQE, to general-reference metrics including FID, and Multi-modal-reference metrics, e.g., CLIPScore. Utilising a neural network with the cross-attention mechanism and a unique data collection pipeline from NVS optimisation, our method enables accurate image quality assessment without requiring ground truth references. By comparing a query image against multiple views of the same scene, our method addresses the limitations of existing metrics in novel view synthesis (NVS) and similar tasks where direct reference images are unavailable. Experimental results show that our method is closely correlated to the full-reference metric SSIM, while not requiring ground truth references.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | Tanks&Temples (test) | PSNR15.86 | 289 | |
| Novel View Synthesis | Mip-NeRF 360 (test) | PSNR16.31 | 199 | |
| Image Quality Assessment Correlation | RealEstate10K | PLCC0.442 | 52 | |
| Image Quality Assessment Correlation | Mip-NeRF 360 | PLCC0.29 | 39 | |
| Image Quality Assessment Correlation | Tanks&Temples | PLCC0.444 | 26 | |
| Image Quality Assessment | Tanks&Temples | PLCC0.312 | 26 | |
| Image Quality Assessment | Mip-NeRF 360 | PLCC0.224 | 13 | |
| Image Quality Assessment | Tanks and Temples SEVA | DINOv2 PLCC0.204 | 6 | |
| Image Quality Assessment | Mip-NeRF 360 GEN3C | DINOv2 PLCC0.076 | 6 | |
| Image Quality Assessment | Mip-NeRF SEVA | DINOv2 PLCC-0.005 | 6 |