Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Segment Anything

About

We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. Using our efficient model in a data collection loop, we built the largest segmentation dataset to date (by far), with over 1 billion masks on 11M licensed and privacy respecting images. The model is designed and trained to be promptable, so it can transfer zero-shot to new image distributions and tasks. We evaluate its capabilities on numerous tasks and find that its zero-shot performance is impressive -- often competitive with or even superior to prior fully supervised results. We are releasing the Segment Anything Model (SAM) and corresponding dataset (SA-1B) of 1B masks and 11M images at https://segment-anything.com to foster research into foundation models for computer vision.

Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Doll\'ar, Ross Girshick• 2023

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K (val)
mIoU33.63
2731
Instance SegmentationCOCO 2017 (val)--
1144
Video Object SegmentationDAVIS 2017 (val)
J mean79
1130
Semantic segmentationADE20K
mIoU28.08
936
Image DeblurringGoPro (test)
PSNR27.491
585
Video Instance SegmentationYouTube-VIS 2019 (val)
AP51.8
567
Instance SegmentationCOCO (val)
APmk46.5
472
Salient Object DetectionDUTS (test)
M (MAE)0.058
302
Object CountingFSC-147 (test)
MAE42.48
297
Interactive SegmentationBerkeley
NoC@901.91
230
Showing 10 of 465 rows
...

Other info

Code

Follow for update