A Boundary-Aware Non-parametric Granular-Ball Classifier Based on Minimum Description Length

About

Existing granular-ball classification methods are often driven by handcrafted quality measures, neighborhood rules, or heuristic splitting and stopping criteria, which may reduce the transparency of local construction decisions and hinder explicit modeling of boundary-sensitive regions. To address this issue, this paper proposes a Minimum Description Length based Granular-Ball Classifier (MDL-GBC), a boundary-aware non-parametric and interpretable granular-ball classifier. MDL-GBC formulates class-conditional granular-ball construction as a local model selection problem under the Minimum Description Length principle. For each class, samples from the target class provide positive class evidence, while samples from the remaining classes provide negative boundary evidence. For each current granular ball, three candidate explanations are compared under a unified description-length criterion: a single-ball model, a two-ball model, and a core-boundary model. The selected model determines whether the ball is retained, geometrically split, or refined into core and boundary-sensitive child balls, thereby making local construction decisions consistent with the MDL-based classification mechanism. During prediction, a class-level mixture coding rule aggregates stable granular balls of the same class and assigns the test sample by comparing class-wise coding costs. Experiments on 18 benchmark datasets show that MDL-GBC achieves competitive classification performance against classical classifiers and representative granular-ball-based methods, obtaining the best average Accuracy, Macro-F1, and average rank. These results indicate that MDL-GBC provides an effective and interpretable alternative to conventional heuristic granular-ball classification strategies.

Zeqiang Xian, Caihui Liu, Yong Zhang, Wenjing Qiu, Duoqian Miao, Witold Pedrycz• 2026

Related benchmarks

Task	Dataset	Result
Classification	Shuttle	Accuracy99.77	28
Classification	Thyroid	Accuracy94.83	27
Classification	Heart	--	24
Classification	HTRU2	Accuracy97.76	15
Classification	Wine	Accuracy96.67	13
Classification	Selected 18 datasets (test)	Accuracy (Avg Rank)2.6944	8
Classification	Lenses	Accuracy90	8
Classification	zoo	Accuracy95.09	8
Classification	Iris	Accuracy97.33	8
Classification	SEEDS	Accuracy93.81	8

Showing 10 of 19 rows

Other info

Follow for update

@wizwand_team Discord