Influence of Resampling on Accuracy of Imbalanced Classification

About

In many real-world binary classification tasks (e.g. detection of certain objects from images), an available dataset is imbalanced, i.e., it has much less representatives of a one class (a minor class), than of another. Generally, accurate prediction of the minor class is crucial but it's hard to achieve since there is not much information about the minor class. One approach to deal with this problem is to preliminarily resample the dataset, i.e., add new elements to the dataset or remove existing ones. Resampling can be done in various ways which raises the problem of choosing the most appropriate one. In this paper we experimentally investigate impact of resampling on classification accuracy, compare resampling methods and highlight key points and difficulties of resampling.

Evgeny Burnaev, Pavel Erofeev, Artem Papanov• 2017

Related benchmarks

Task	Dataset	Result
Scene Graph Classification	VG150 (test)	mR@5011	66
Scene Graph Classification	Visual Genome	R@5011	45
Predicate Classification	Visual Genome (VG) 150 object categories, 50 relationship categories (test)	mR@10020	44
Scene Graph Detection	VG150	R@5030.5	39
Predicate Classification	Visual Genome (VG) Zero-Shot	Recall@5011.1	19
Scene Graph Classification	Visual Genome (VG) Zero-Shot	R@502.3	19
Sentence-to-Graph Retrieval	Visual Genome Gallery Size 1000 (test)	Recall@2013.1	19
Scene Graph Detection	Visual Genome (VG) Zero-Shot	R@5010	19
Sentence-to-Graph Retrieval	Visual Genome Gallery Size 5000 (test)	R@202.5	19

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord