Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Fingerprinting Concepts in Data Streams with Supervised and Unsupervised Meta-Information

About

Streaming sources of data are becoming more common as the ability to collect data in real-time grows. A major concern in dealing with data streams is concept drift, a change in the distribution of data over time, for example, due to changes in environmental conditions. Representing concepts (stationary periods featuring similar behaviour) is a key idea in adapting to concept drift. By testing the similarity of a concept representation to a window of observations, we can detect concept drift to a new or previously seen recurring concept. Concept representations are constructed using meta-information features, values describing aspects of concept behaviour. We find that previously proposed concept representations rely on small numbers of meta-information features. These representations often cannot distinguish concepts, leaving systems vulnerable to concept drift. We propose FiCSUM, a general framework to represent both supervised and unsupervised behaviours of a concept in a fingerprint, a vector of many distinct meta-information features able to uniquely identify more concepts. Our dynamic weighting strategy learns which meta-information features describe concept drift in a given dataset, allowing a diverse set of meta-information features to be used at once. FiCSUM outperforms state-of-the-art methods over a range of 11 real world and synthetic datasets in both accuracy and modeling underlying concept drift.

Ben Halstead, Yun Sing Koh, Patricia Riddle, Mykola Pechenizkiy, Albert Bifet, Russel Pears• 2026

Related benchmarks

TaskDatasetResultRank
Drift Detection and Model SelectionRTREE-U
κ Statistic0.83
10
Drift Detection and Model SelectionCMC
Kappa (κ)0.3
10
Drift Detection and Model SelectionUCI Wine
Kappa0.26
10
Drift Detection and Model SelectionHPLANE-U
Kappa Statistic0.44
10
Drift Detection and Model SelectionSTAGGER
Kappa0.98
10
Concept Drift DetectionAQSex
Kappa0.95
6
Concept Drift DetectionRBF
Kappa0.81
6
Concept Drift DetectionArabic
Kappa0.9
6
Concept Drift DetectionQG
Kappa Statistic0.84
6
Drift Detection and Model SelectionAQSex
Kappa Statistic0.94
4
Showing 10 of 15 rows

Other info

Follow for update