MAGA-Bench: Machine-Augment-Generated Text via Alignment Detection Benchmark
About
Large Language Models (LLMs) alignment is constantly evolving. Machine-Generated Text (MGT) is becoming increasingly difficult to distinguish from Human-Written Text (HWT). This has exacerbated abuse issues such as fake news and online fraud. Fine-tuned detectors' generalization ability is highly dependent on dataset quality, and simply expanding the sources of MGT is insufficient. Further augment of generation process is required. According to HC-Var's theory, enhancing the alignment of generated text can not only facilitate attacks on existing detectors to test their robustness, but also help improve the generalization ability of detectors fine-tuned on it. Therefore, we propose \textbf{M}achine-\textbf{A}ugment-\textbf{G}enerated Text via \textbf{A}lignment (MAGA). MAGA's pipeline achieves comprehensive alignment from prompt construction to reasoning process, among which \textbf{R}einforced \textbf{L}earning from \textbf{D}etectors \textbf{F}eedback (RLDF), systematically proposed by us, serves as a key component. In our experiments, the RoBERTa detector fine-tuned on MAGA training set achieved an average improvement of 4.60\% in generalization detection AUC. MAGA Dataset caused an average decrease of 8.13\% in the AUC of the selected detectors, expecting to provide indicative significance for future research on the generalization detection ability of detectors.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Machine-generated text detection | SemEval2024-M4 (test) | AUC98.87 | 13 | |
| Machine-generated text detection | COLING2025 M4GT (val) | AUC88.4 | 13 | |
| Machine-generated text detection | MAGE COLING2025 (val) | AUC64.63 | 13 | |
| Machine-generated text detection | HC3 COLING2025 (val) | AUC98.14 | 13 | |
| Machine-generated text detection | S-M4-CN (test) | AUC99.41 | 6 | |
| Machine-generated text detection | C-M4GT-CN (test) | AUC99.42 | 6 | |
| Machine-generated text detection | C-HC3-CN (test) | AUC99.67 | 6 |