Segment Anything

About

We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. Using our efficient model in a data collection loop, we built the largest segmentation dataset to date (by far), with over 1 billion masks on 11M licensed and privacy respecting images. The model is designed and trained to be promptable, so it can transfer zero-shot to new image distributions and tasks. We evaluate its capabilities on numerous tasks and find that its zero-shot performance is impressive -- often competitive with or even superior to prior fully supervised results. We are releasing the Segment Anything Model (SAM) and corresponding dataset (SA-1B) of 1B masks and 11M images at https://segment-anything.com to foster research into foundation models for computer vision.

Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Doll\'ar, Ross Girshick• 2023

Related benchmarks

Task	Dataset	Result
Semantic segmentation	ADE20K (val)	mIoU33.63	3069
Instance Segmentation	COCO 2017 (val)	--	1275
Image Classification	ImageNet-1K	Top-1 Acc22.12	1239
Video Object Segmentation	DAVIS 2017 (val)	J mean79	1226
Semantic segmentation	ADE20K	mIoU28.08	1028
Image Deblurring	GoPro (test)	PSNR27.491	672
Video Instance Segmentation	YouTube-VIS 2019 (val)	AP51.8	604
Semantic segmentation	Cityscapes	mIoU57.15	494
Instance Segmentation	COCO (val)	APmk46.5	485
Salient Object Detection	DUTS (test)	M (MAE)0.058	357

Showing 10 of 769 rows

...

Other info

Code

Follow for update

@wizwand_team Discord