VISOR

Benchmarks

Task Name	Dataset Name	SOTA Result
Text-to-Image Generation	VISOR	OA (%)77.28	21
Egocentric Referring Video Object Segmentation	VISOR (val)	mIoU67	10
Segmentation	VISOR	mIoU61.8	9
3D Hand Mesh Reconstruction	HInt VISOR All Joints (test)	PCK@0.0547.2	8
Temporal contact detection	VISOR (val)	BC87.3	5
Referring Video Object Segmentation	VISOR hard	mIoU62.3	4
Referring Video Object Segmentation	VISOR novel	mIoU60	4
Object Segmentation	VISOR	J&F Score76.6	2
Segment Anything	VISOR	PAvPU51.9	2

Showing 9 of 9 rows