Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Early Detection of Misinformation for Infodemic Management: A Domain Adaptation Approach

About

An infodemic refers to an enormous amount of true information and misinformation disseminated during a disease outbreak. Detecting misinformation at the early stage of an infodemic is key to reduce its harm to public health. An early stage infodemic is characterized by a large volume of unlabeled information concerning a disease. As a result, conventional misinformation detection methods are not suitable for this misinformation detection task because they rely on labeled information in the infodemic domain to train their models. To address this limitation, state-of-the-art methods learn their models using labeled information in other domains to detect misinformation in the infodemic domain. The efficacy of these methods depends on their ability to mitigate both covariate shift (i.e., differences in feature distributions) and concept shift (i.e., differences in labeling patterns) between the infodemic domain and the domains from which they leverage labeled information. However, these methods focus on mitigating covariate shift but overlook concept shift, rendering them less effective for the task. In response, we theoretically show the necessity of tackling both covariate and concept shifts as well as how to operationalize each of them. Built on the theoretical analysis, we develop a novel misinformation detection method that addresses both covariate and concept shifts. Using real-world datasets, we conduct extensive empirical evaluations to demonstrate the superior performance of our method over state-of-the-art misinformation detection methods as well as prevalent domain adaptation methods that can be tailored to solve the misinformation detection task.

Minjia Mao, Xiaohang Zhao, Xiao Fang• 2024

Related benchmarks

TaskDatasetResultRank
Early Misinformation DetectionCOVID Domain 150 Labeled News Articles
Precision88.8
12
Misinformation DetectionCOVID Domain News (50 labeled articles)
Precision81.9
10
Misinformation DetectionCOVID Domain 1.0 (100 Labeled News Articles)
Precision86.5
10
Misinformation DetectionConstraint Dataset
Precision76.8
9
Misinformation DetectionChinese News Dataset
Precision0.822
9
Misinformation DetectionFinancial Misinformation infodemic domain
Precision80.4
9
Misinformation DetectionCOVID news domain (test)
Precision71.6
9
Comment Count PredictionCOVID dataset
RMSE18.548
7
Misinformation DetectionMisinformation Detection Dataset (test)
Precision71.6
4
Early Misinformation DetectionCOVID news articles 2020-04-01 to 2020-06-30 (Stage 2)
Precision72.1
2
Showing 10 of 10 rows

Other info

Follow for update