Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Boundary-Aware Non-parametric Granular-Ball Classifier Based on Minimum Description Length

About

Existing granular-ball classification methods are often driven by handcrafted quality measures, neighborhood rules, or heuristic splitting and stopping criteria, which may reduce the transparency of local construction decisions and hinder explicit modeling of boundary-sensitive regions. To address this issue, this paper proposes a Minimum Description Length based Granular-Ball Classifier (MDL-GBC), a boundary-aware non-parametric and interpretable granular-ball classifier. MDL-GBC formulates class-conditional granular-ball construction as a local model selection problem under the Minimum Description Length principle. For each class, samples from the target class provide positive class evidence, while samples from the remaining classes provide negative boundary evidence. For each current granular ball, three candidate explanations are compared under a unified description-length criterion: a single-ball model, a two-ball model, and a core-boundary model. The selected model determines whether the ball is retained, geometrically split, or refined into core and boundary-sensitive child balls, thereby making local construction decisions consistent with the MDL-based classification mechanism. During prediction, a class-level mixture coding rule aggregates stable granular balls of the same class and assigns the test sample by comparing class-wise coding costs. Experiments on 18 benchmark datasets show that MDL-GBC achieves competitive classification performance against classical classifiers and representative granular-ball-based methods, obtaining the best average Accuracy, Macro-F1, and average rank. These results indicate that MDL-GBC provides an effective and interpretable alternative to conventional heuristic granular-ball classification strategies.

Zeqiang Xian, Caihui Liu, Yong Zhang, Wenjing Qiu, Duoqian Miao, Witold Pedrycz• 2026

Related benchmarks

TaskDatasetResultRank
ClassificationHeart--
24
ClassificationThyroid
Accuracy94.83
23
ClassificationWine
Accuracy96.67
13
ClassificationSelected 18 datasets (test)
Accuracy (Avg Rank)2.6944
8
ClassificationLenses
Accuracy90
8
Classificationzoo
Accuracy95.09
8
ClassificationIris
Accuracy97.33
8
ClassificationSEEDS
Accuracy93.81
8
ClassificationLIBRAS
Accuracy81.67
8
ClassificationBalance
Accuracy90.24
8
Showing 10 of 19 rows

Other info

Follow for update