Semantic Invariant Multi-view Clustering with Fully Incomplete Information

About

Robust multi-view learning with incomplete information has received significant attention due to issues such as incomplete correspondences and incomplete instances that commonly affect real-world multi-view applications. Existing approaches heavily rely on paired samples to realign or impute defective ones, but such preconditions cannot always be satisfied in practice due to the complexity of data collection and transmission. To address this problem, we present a novel framework called SeMantic Invariance LEarning (SMILE) for multi-view clustering with incomplete information that does not require any paired samples. To be specific, we discover the existence of invariant semantic distribution across different views, which enables SMILE to alleviate the cross-view discrepancy to learn consensus semantics without requiring any paired samples. The resulting consensus semantics remain unaffected by cross-view distribution shifts, making them useful for realigning/imputing defective instances and forming clusters. We demonstrate the effectiveness of SMILE through extensive comparison experiments with 13 state-of-the-art baselines on five benchmarks. Our approach improves the clustering accuracy of NoisyMNIST from 19.3\%/23.2\% to 82.7\%/69.0\% when the correspondences/instances are fully incomplete. The code could be accessed from https://pengxi.me.

Pengxin Zeng, Mouxing Yang, Yiding Lu, Changqing Zhang, Peng Hu, Xi Peng• 2023

Related benchmarks

Task	Dataset	Result
Multi-view Clustering	aloi	Accuracy32.87	57
Multimodal Classification	BRCA (train test)	Accuracy80.5	36
Multimodal Classification	ROSMAP (train test)	Accuracy79.2	36
Multimodal Classification	CUB (train test)	Accuracy0.886	36
Multimodal Classification	FOOD101 UPMC (train test)	Accuracy91.3	36
Multi-view Clustering	Reuters dim10	Accuracy46.36	14
Multi-view Clustering	yale_mtv	Accuracy41.15	14
Multi-view Clustering	BDGP	Accuracy57.56	14
Multi-view Clustering	3Sources	Accuracy34.62	14
Multimodal Classification	BRCA multimodal noise η=10%, ε=5 original (test)	Accuracy45	9

Showing 10 of 19 rows

Other info

Follow for update

@wizwand_team Discord