Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Machine-generated text detection prevents language model collapse

About

As Large Language Models (LLMs) become increasingly prevalent, their generated outputs are proliferating across the web, risking a future where machine-generated content dilutes human-authored text. Since online data is the primary resource for LLM pre-training, subsequent models could be trained on an unknown portion of synthetic samples. This could lead to model collapse, a degenerative process whereby LLMs reinforce their own errors, reduce output diversity, and ultimately yield declining performance. In this study, we investigate the impact of decoding strategy on model collapse, analysing the text characteristics at each model generation, the similarity to human references, and the resulting model performance. Using the decoding strategies that lead to the most significant degradation, we evaluate model collapse in a more realistic scenario where the origin of the data (human or synthetic) is unknown. We train a machine-generated text detector and propose an importance resampling approach to prevent model collapse by up-sampling likely human content in the training data. Our method is validated on four LLMs from two model families (GPT-2 and SmolLM2), across a range of model sizes 124M to 1.7B). We demonstrate that it not only prevents model collapse but also improves performance compared to training on purely human data, underscoring the benefit of synthetic samples and the importance of data curation.

George Drayson, Emine Yilmaz, Vasileios Lampos• 2025

Related benchmarks

TaskDatasetResultRank
Machine-generated text detectionMAGE
AUROC (Avg)97.9
24
LLM-generated text detectionDetectRL--
12
AI Text DetectionM4GT
AUROC76.9
10
AI Text DetectionGhostbuster
AUROC94.8
10
AI Text DetectionMELD (eval)
AUROC99.7
10
LLM-generated text detectionMELD GPT-5.4-Mini (eval)
TPR @ 1% FPR97.2
10
LLM-generated text detectionMELD-eval Gemini-3-Flash
TPR@1%FPR93
10
LLM-generated text detectionMELD-eval Claude-Haiku-4.5
TPR @ 1% FPR98.2
10
LLM-generated text detectionMELD Qwen-3.6-Plus (eval)
TPR @ 1% FPR93.7
10
LLM-generated text detectionMELD Overall (eval)
TPR @ 1% FPR95.5
10
Showing 10 of 13 rows

Other info

Follow for update