Segment Anything
About
We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. Using our efficient model in a data collection loop, we built the largest segmentation dataset to date (by far), with over 1 billion masks on 11M licensed and privacy respecting images. The model is designed and trained to be promptable, so it can transfer zero-shot to new image distributions and tasks. We evaluate its capabilities on numerous tasks and find that its zero-shot performance is impressive -- often competitive with or even superior to prior fully supervised results. We are releasing the Segment Anything Model (SAM) and corresponding dataset (SA-1B) of 1B masks and 11M images at https://segment-anything.com to foster research into foundation models for computer vision.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | ADE20K (val) | mIoU33.63 | 2731 | |
| Instance Segmentation | COCO 2017 (val) | -- | 1144 | |
| Video Object Segmentation | DAVIS 2017 (val) | J mean79 | 1130 | |
| Semantic segmentation | ADE20K | mIoU28.08 | 936 | |
| Image Deblurring | GoPro (test) | PSNR27.491 | 585 | |
| Video Instance Segmentation | YouTube-VIS 2019 (val) | AP51.8 | 567 | |
| Instance Segmentation | COCO (val) | APmk46.5 | 472 | |
| Salient Object Detection | DUTS (test) | M (MAE)0.058 | 302 | |
| Object Counting | FSC-147 (test) | MAE42.48 | 297 | |
| Interactive Segmentation | Berkeley | NoC@901.91 | 230 |