Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MAGA-Bench: Machine-Augment-Generated Text via Alignment Detection Benchmark

About

Large Language Models (LLMs) alignment is constantly evolving. Machine-Generated Text (MGT) is becoming increasingly difficult to distinguish from Human-Written Text (HWT). This has exacerbated abuse issues such as fake news and online fraud. Fine-tuned detectors' generalization ability is highly dependent on dataset quality, and simply expanding the sources of MGT is insufficient. Further augment of generation process is required. According to HC-Var's theory, enhancing the alignment of generated text can not only facilitate attacks on existing detectors to test their robustness, but also help improve the generalization ability of detectors fine-tuned on it. Therefore, we propose \textbf{M}achine-\textbf{A}ugment-\textbf{G}enerated Text via \textbf{A}lignment (MAGA). MAGA's pipeline achieves comprehensive alignment from prompt construction to reasoning process, among which \textbf{R}einforced \textbf{L}earning from \textbf{D}etectors \textbf{F}eedback (RLDF), systematically proposed by us, serves as a key component. In our experiments, the RoBERTa detector fine-tuned on MAGA training set achieved an average improvement of 4.60\% in generalization detection AUC. MAGA Dataset caused an average decrease of 8.13\% in the AUC of the selected detectors, expecting to provide indicative significance for future research on the generalization detection ability of detectors.

Anyang Song, Ying Cheng, Yiqian Xu, Rui Feng• 2026

Related benchmarks

TaskDatasetResultRank
Machine-generated text detectionSemEval2024-M4 (test)
AUC98.87
13
Machine-generated text detectionCOLING2025 M4GT (val)
AUC88.4
13
Machine-generated text detectionMAGE COLING2025 (val)
AUC64.63
13
Machine-generated text detectionHC3 COLING2025 (val)
AUC98.14
13
Machine-generated text detectionS-M4-CN (test)
AUC99.41
6
Machine-generated text detectionC-M4GT-CN (test)
AUC99.42
6
Machine-generated text detectionC-HC3-CN (test)
AUC99.67
6
Showing 7 of 7 rows

Other info

Follow for update