Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VISOR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-Image GenerationVISOR
OA (%)77.28
21
Egocentric Referring Video Object SegmentationVISOR (val)
mIoU67
10
SegmentationVISOR
mIoU61.8
9
3D Hand Mesh ReconstructionHInt VISOR All Joints (test)
PCK@0.0547.2
8
Temporal contact detectionVISOR (val)
BC87.3
5
Referring Video Object SegmentationVISOR hard
mIoU62.3
4
Referring Video Object SegmentationVISOR novel
mIoU60
4
Object SegmentationVISOR
J&F Score76.6
2
Segment AnythingVISOR
PAvPU51.9
2
Showing 9 of 9 rows