Segment Anything
About
We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. Using our efficient model in a data collection loop, we built the largest segmentation dataset to date (by far), with over 1 billion masks on 11M licensed and privacy respecting images. The model is designed and trained to be promptable, so it can transfer zero-shot to new image distributions and tasks. We evaluate its capabilities on numerous tasks and find that its zero-shot performance is impressive -- often competitive with or even superior to prior fully supervised results. We are releasing the Segment Anything Model (SAM) and corresponding dataset (SA-1B) of 1B masks and 11M images at https://segment-anything.com to foster research into foundation models for computer vision.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | ADE20K (val) | mIoU33.63 | 2888 | |
| Image Classification | ImageNet-1K | Top-1 Acc22.12 | 1239 | |
| Instance Segmentation | COCO 2017 (val) | -- | 1201 | |
| Video Object Segmentation | DAVIS 2017 (val) | J mean79 | 1193 | |
| Semantic segmentation | ADE20K | mIoU28.08 | 1024 | |
| Image Deblurring | GoPro (test) | PSNR27.491 | 617 | |
| Video Instance Segmentation | YouTube-VIS 2019 (val) | AP51.8 | 604 | |
| Instance Segmentation | COCO (val) | APmk46.5 | 475 | |
| Salient Object Detection | DUTS (test) | M (MAE)0.058 | 325 | |
| Object Counting | FSC-147 (test) | MAE42.48 | 322 |