Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Lightweight Stylistic Consistency Profiling: Robust Detection of LLM-Generated Textual Content for Multimedia Moderation

About

The increasing prevalence of Large Language Models (LLMs) in content creation has made distinguishing human-written textual content from LLM-generated counterparts a critical task for multimedia moderation. Existing detectors often rely on statistical cues or model-specific heuristics, making them vulnerable to paraphrasing and adversarial manipulations, and consequently limiting their robustness and interpretability. In this work, we proposeLiSCP , a novel lightweight stylistic consistency profiling method for robust detection of LLM-generated textual content, focusing on feature stability under adversarial manipulation. Our approach constructs a consistency profile that combines discrete stylistic features with continuous semantic signals, leveraging stylistic stability across multimodal-guided paraphrased text variants. Experiments spanning real-world multimedia news and movie datasets and conventional text domains demonstrate that LiSCP achieves superior performance on in-domain detection and outperforms existing approaches by up to 11.79% in cross-domain settings. Additionally,it demonstrates notable robustness under adversarial scenarios, including adversarial attacks and hybrid human-AI settings.

Siyuan Li, Aodu Wulianghai, Xi Lin, Xibin Yuan, Qinghua Mao, Guangyan Li, Xiang Chen, Jun Wu, Jianhua Li• 2026

Related benchmarks

TaskDatasetResultRank
LLM-generated content detectionStudent Essay IvyPanda
AUROC0.9455
11
LLM-generated content detectionYelp Review
AUROC0.8718
11
LLM-generated content detectionVisualNews
AUROC0.9746
11
LLM-generated content detectionMM-IMDB
AUROC95.76
11
Machine-generated text detectionPaper Abstract (test)
F1 Score91.54
11
LLM-generated content detectionHumanEval
AUROC0.8108
11
LLM-generated content detectionReuter News 50_50
AUROC0.9356
11
Machine-generated text detectionHumanEval (test)
F1 Score83.33
3
Machine-generated text detectionStudent Essay (test)
F1 Score89.27
3
Machine-generated text detectionYelp Review (test)
F1 Score0.8083
3
Showing 10 of 11 rows

Other info

Follow for update