Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Trustworthy Multimodal Regression with Mixture of Normal-inverse Gamma Distributions

About

Multimodal regression is a fundamental task, which integrates the information from different sources to improve the performance of follow-up applications. However, existing methods mainly focus on improving the performance and often ignore the confidence of prediction for diverse situations. In this study, we are devoted to trustworthy multimodal regression which is critical in cost-sensitive domains. To this end, we introduce a novel Mixture of Normal-Inverse Gamma distributions (MoNIG) algorithm, which efficiently estimates uncertainty in principle for adaptive integration of different modalities and produces a trustworthy regression result. Our model can be dynamically aware of uncertainty for each modality, and also robust for corrupted modalities. Furthermore, the proposed MoNIG ensures explicitly representation of (modality-specific/global) epistemic and aleatoric uncertainties, respectively. Experimental results on both synthetic and different real-world data demonstrate the effectiveness and trustworthiness of our method on various multimodal regression tasks (e.g., temperature prediction for superconductivity, relative location prediction for CT slices, and multimodal sentiment analysis).

Huan Ma, Zongbo Han, Changqing Zhang, Huazhu Fu, Joey Tianyi Zhou, Qinghua Hu• 2021

Related benchmarks

TaskDatasetResultRank
Multimodal Sentiment AnalysisCMU-MOSI (test)
F179.1
238
OOD DetectionCT Slices (test)
AUROC87
40
Sentiment AnalysisCMU-MOSEI (test)
Acc (2-class)80.2
40
OOD DetectionSuperconductivity (test)
AU-ROC (AU)0.887
24
Multimodal Sentiment AnalysisCMU-MOSI Word Aligned (test)
Accuracy (7-Class)34.1
21
Multimodal Sentiment AnalysisCMU-MOSEI Unaligned (test)
Accuracy (2-Class)81.7
18
Multimodal Sentiment AnalysisCMU-MOSEI Word Aligned (test)
Accuracy (7-Class)50.2
14
Multimodal regressionSuperconductivity (test)
RMSE11.7
13
Multimodal Sentiment AnalysisCMU-MOSI Unaligned (test)
Accuracy (7-Class)35.8
13
Relative Location PredictionCT Slices (test)
RMSE0.79
8
Showing 10 of 14 rows

Other info

Code

Follow for update