Reasoning-Aware AIGC Detection via Alignment and Reinforcement
About
The rapid advancement and widespread adoption of Large Language Models (LLMs) have elevated the need for reliable AI-generated content (AIGC) detection, which remains challenging as models evolve. We introduce AIGC-text-bank, a comprehensive multi-domain dataset with diverse LLM sources and authorship scenarios, and propose REVEAL, a detection framework that generates interpretable reasoning chains before classification. Our approach uses a two-stage training strategy: supervised fine-tuning to establish reasoning capabilities, followed by reinforcement learning to improve accuracy, improve logical consistency, and reduce hallucinations. Extensive experiments show that REVEAL achieves state-of-the-art performance across multiple benchmarks, offering a robust and transparent solution for AIGC detection. The project is open-source at https://aka.ms/reveal
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Binary AIGC Detection | DetectRL | Accuracy97.2 | 12 | |
| Binary AIGC Detection | Pan | Accuracy88.8 | 12 | |
| Binary AIGC Detection | AIGC-bench | Accuracy96.3 | 12 | |
| Binary AIGC Detection | LOKI | Accuracy95.6 | 12 | |
| Binary AIGC Detection | M4 | Accuracy77.86 | 12 | |
| AIGC Detection | M4 (test) | Accuracy97.33 | 3 | |
| AIGC Detection | DetectRL (test) | Accuracy75.2 | 3 | |
| AIGC Detection | Pan (test) | Accuracy49.07 | 3 | |
| Fine-grained AIGC Detection | AIGC-bench | Accuracy70.74 | 3 |