Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

StyloAI: Distinguishing AI-Generated Content with Stylometric Analysis

About

The emergence of large language models (LLMs) capable of generating realistic texts and images has sparked ethical concerns across various sectors. In response, researchers in academia and industry are actively exploring methods to distinguish AI-generated content from human-authored material. However, a crucial question remains: What are the unique characteristics of AI-generated text? Addressing this gap, this study proposes StyloAI, a data-driven model that uses 31 stylometric features to identify AI-generated texts by applying a Random Forest classifier on two multi-domain datasets. StyloAI achieves accuracy rates of 81% and 98% on the test set of the AuTextification dataset and the Education dataset, respectively. This approach surpasses the performance of existing state-of-the-art models and provides valuable insights into the differences between AI-generated and human-authored texts.

Chidimma Opara• 2024

Related benchmarks

TaskDatasetResultRank
AIGT Classification (binary)RedNote-Vibe
Precision75.23
11
Model Identification (17-way)RedNote-Vibe
Precision21.68
11
Provider Identification (6-way)RedNote-Vibe
Precision37.33
11
Showing 3 of 3 rows

Other info

Follow for update