Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Segment Anything

About

We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. Using our efficient model in a data collection loop, we built the largest segmentation dataset to date (by far), with over 1 billion masks on 11M licensed and privacy respecting images. The model is designed and trained to be promptable, so it can transfer zero-shot to new image distributions and tasks. We evaluate its capabilities on numerous tasks and find that its zero-shot performance is impressive -- often competitive with or even superior to prior fully supervised results. We are releasing the Segment Anything Model (SAM) and corresponding dataset (SA-1B) of 1B masks and 11M images at https://segment-anything.com to foster research into foundation models for computer vision.

Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Doll\'ar, Ross Girshick• 2023

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K (val)
mIoU33.63
2888
Image ClassificationImageNet-1K
Top-1 Acc22.12
1239
Instance SegmentationCOCO 2017 (val)--
1201
Video Object SegmentationDAVIS 2017 (val)
J mean79
1193
Semantic segmentationADE20K
mIoU28.08
1024
Image DeblurringGoPro (test)
PSNR27.491
617
Video Instance SegmentationYouTube-VIS 2019 (val)
AP51.8
604
Instance SegmentationCOCO (val)
APmk46.5
475
Salient Object DetectionDUTS (test)
M (MAE)0.058
325
Object CountingFSC-147 (test)
MAE42.48
322
Showing 10 of 664 rows
...

Other info

Code

Follow for update