Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Semantic Invariant Multi-view Clustering with Fully Incomplete Information

About

Robust multi-view learning with incomplete information has received significant attention due to issues such as incomplete correspondences and incomplete instances that commonly affect real-world multi-view applications. Existing approaches heavily rely on paired samples to realign or impute defective ones, but such preconditions cannot always be satisfied in practice due to the complexity of data collection and transmission. To address this problem, we present a novel framework called SeMantic Invariance LEarning (SMILE) for multi-view clustering with incomplete information that does not require any paired samples. To be specific, we discover the existence of invariant semantic distribution across different views, which enables SMILE to alleviate the cross-view discrepancy to learn consensus semantics without requiring any paired samples. The resulting consensus semantics remain unaffected by cross-view distribution shifts, making them useful for realigning/imputing defective instances and forming clusters. We demonstrate the effectiveness of SMILE through extensive comparison experiments with 13 state-of-the-art baselines on five benchmarks. Our approach improves the clustering accuracy of NoisyMNIST from 19.3\%/23.2\% to 82.7\%/69.0\% when the correspondences/instances are fully incomplete. The code could be accessed from https://pengxi.me.

Pengxin Zeng, Mouxing Yang, Yiding Lu, Changqing Zhang, Peng Hu, Xi Peng• 2023

Related benchmarks

TaskDatasetResultRank
Multimodal ClassificationBRCA (train test)
Accuracy80.5
36
Multimodal ClassificationROSMAP (train test)
Accuracy79.2
36
Multimodal ClassificationCUB (train test)
Accuracy0.886
36
Multimodal ClassificationFOOD101 UPMC (train test)
Accuracy91.3
36
Multimodal ClassificationBRCA multimodal noise η=10%, ε=5 original (test)
Accuracy45
9
Multimodal ClassificationROSMAP multimodal noise η=10%, ε=5 original (test)
Acc52.8
9
Multimodal ClassificationCUB multimodal noise η=10%, ε=5 original (test)
Accuracy49.2
9
Multimodal ClassificationUPMC FOOD101 multimodal noise η=10%, ε=5 original (test)
Accuracy50.7
9
Showing 8 of 8 rows

Other info

Follow for update