CrossScore: Towards Multi-View Image Evaluation and Scoring

About

We introduce a novel cross-reference image quality assessment method that effectively fills the gap in the image assessment landscape, complementing the array of established evaluation schemes -- ranging from full-reference metrics like SSIM, no-reference metrics such as NIQE, to general-reference metrics including FID, and Multi-modal-reference metrics, e.g., CLIPScore. Utilising a neural network with the cross-attention mechanism and a unique data collection pipeline from NVS optimisation, our method enables accurate image quality assessment without requiring ground truth references. By comparing a query image against multiple views of the same scene, our method addresses the limitations of existing metrics in novel view synthesis (NVS) and similar tasks where direct reference images are unavailable. Experimental results show that our method is closely correlated to the full-reference metric SSIM, while not requiring ground truth references.

Zirui Wang, Wenjing Bian, Victor Adrian Prisacariu• 2024

Related benchmarks

Task	Dataset	Result
Novel View Synthesis	Tanks&Temples (test)	PSNR15.86	323
Novel View Synthesis	Mip-NeRF 360 (test)	PSNR16.31	228
Image Quality Assessment Correlation	RealEstate10K	PLCC0.442	52
Image Quality Assessment Correlation	Mip-NeRF 360	PLCC0.29	39
Image Quality Assessment Correlation	Tanks&Temples	PLCC0.444	26
Image Quality Assessment	Tanks&Temples	PLCC0.312	26
Image Quality Assessment	Mip-NeRF 360	PLCC0.224	13
Image Quality Assessment	Tanks and Temples SEVA	DINOv2 PLCC0.204	6
Image Quality Assessment	Mip-NeRF 360 GEN3C	DINOv2 PLCC0.076	6
Image Quality Assessment	Mip-NeRF SEVA	DINOv2 PLCC-0.005	6

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord