Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VISOR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-Image GenerationVISOR
OA (%)77.28
21
Egocentric Referring Video Object SegmentationVISOR (val)
mIoU67
10
SegmentationVISOR
mIoU61.8
9
3D Hand Mesh ReconstructionHInt VISOR All Joints (test)
PCK@0.0547.2
8
Temporal contact detectionVISOR (val)
BC87.3
5
Referring Video Object SegmentationVISOR hard
mIoU62.3
4
Referring Video Object SegmentationVISOR novel
mIoU60
4
Showing 7 of 7 rows