Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text

About

Modern large language models (LLMs) such as GPT, Claude, and Gemini have transformed the way we learn, work, and communicate. Yet, their ability to produce highly human-like text raises serious concerns about misinformation and academic integrity, making it an urgent need for reliable algorithms to detect LLM-generated content. In this paper, we start by presenting a geometric approach to demystify rewrite-based detection algorithms, revealing their underlying rationale and demonstrating their generalization ability. Building on this insight, we introduce a novel rewrite-based detection algorithm that adaptively learns the distance between the original and rewritten text. Theoretically, we demonstrate that employing an adaptively learned distance function is more effective for detection than using a fixed distance. Empirically, we conduct extensive experiments with over 100 settings, and find that our approach demonstrates superior performance over baseline algorithms in the majority of scenarios. In particular, it achieves relative improvements from 57.8\% to 80.6\% over the strongest baseline across different target LLMs (e.g., GPT, Claude, and Gemini).

Hongyi Zhou, Jin Zhu, Erhan Xu, Kai Ye, Ying Yang, Chengchun Shi• 2026

Related benchmarks

TaskDatasetResultRank
AI-generated text detectionAcademicResearch
AUC99.5
36
AI-generated text detectionBusiness
AUC98.5
24
AI-generated text detectionCode
AUC0.979
24
AI-generated text detectionFinance
AUC0.998
24
AI-generated text detectionLegalDocument
AUC1
24
AI-generated text detectionLiteratureCreativeWriting
AUC100
24
AI-generated text detectionNewsArticle
AUC99.9
24
AI-generated text detectionOnlineContent
AUC97.3
24
AI-generated text detectionPersonalCommunication
AUC0.95
24
AI-generated text detectionProductReview
AUC99.5
24
Showing 10 of 65 rows

Other info

Follow for update