Textual Training for the Hassle-Free Removal of Unwanted Visual Data: Case Studies on OOD and Hateful Image Detection

About

In our study, we explore methods for detecting unwanted content lurking in visual datasets. We provide a theoretical analysis demonstrating that a model capable of successfully partitioning visual data can be obtained using only textual data. Based on the analysis, we propose Hassle-Free Textual Training (HFTT), a streamlined method capable of acquiring detectors for unwanted visual content, using only synthetic textual data in conjunction with pre-trained vision-language models. HFTT features an innovative objective function that significantly reduces the necessity for human involvement in data annotation. Furthermore, HFTT employs a clever textual data synthesis method, effectively emulating the integration of unknown visual data distribution into the training process at no extra cost. The unique characteristics of HFTT extend its utility beyond traditional out-of-distribution detection, making it applicable to tasks that address more abstract concepts. We complement our analyses with experiments in out-of-distribution detection and hateful image detection. Our codes are available at https://github.com/Saehyung-Lee/HFTT

Saehyung Lee, Jisoo Mok, Sangha Park, Yongho Shin, Dahuin Jung, Sungroh Yoon• 2024

Related benchmarks

Task	Dataset	Result
Out-of-Distribution Detection	SUN OOD with ImageNet-1k In-distribution (test)	AUROC95.28	267
Out-of-Distribution Detection	ImageNet-1k ID iNaturalist OOD	FPR9527.44	143
Out-of-Distribution Detection	Places with ImageNet-1k OOD In-distribution (test)	FPR9543.54	99
Out-of-Distribution Detection	ImageNet-1K OOD Average	AUROC91.76	71
Out-of-Distribution Detection	ImageNet-1k (ID) with 4 OOD datasets (iNaturalist, SUN, Places, Textures)	FPR9533.33	69
Out-of-Distribution Detection	ImageNet-1k vs Texture	AUROC88.23	18
Hateful Image Detection	Hate dataset (in-distribution) vs SUN (out-of-distribution) (test)	FPR1.05	7
Hateful Image Detection	Hate dataset (in-distribution) vs Places (out-of-distribution) (test)	FPR4.38	7
Hateful Image Detection	Hate dataset (in-distribution) vs Texture (out-of-distribution) (test)	FPR1.73	7
Hateful Image Detection	Hate dataset (in-distribution) vs NINCO (out-of-distribution) (test)	FPR4.18	7

Showing 10 of 17 rows

Other info

Code

Follow for update

@wizwand_team Discord