Share your thoughts, 1 month free Claude Pro on usSee more

HC3

Benchmarks

Task Name	Dataset Name	SOTA Result
AIGT detection	HC3 PWWS attack, AI to Human (in-domain)	Overall Accuracy100	28
AI Text Detection	HC3 AI-Prob	Finance Accuracy84.1	24
AI-generated text detection	HC3 (test)	F1 (Overall)99.93	18
AIGT detection	HC3 Pruthi attack Overall (in-domain)	Overall Accuracy100	14
AIGT detection	HC3 Pruthi attack Human to AI (in-domain)	OA1	14
AIGT detection	HC3 Pruthi attack AI to Human (in-domain)	Overall Accuracy100	14
AIGT detection	HC3 Deep-Word-Bug attack Overall (in-domain)	OA100	14
AIGT detection	HC3 Deep-Word-Bug attack Human to AI (in-domain)	Overall Accuracy100	14
AIGT detection	HC3 Deep-Word-Bug attack AI to Human (in-domain)	Overall Accuracy100	14
AIGT detection	HC3 PWWS attack, Human to AI (in-domain)	OA100	14
Machine-Generated Text Detection	HC3 COLING2025 (val)	AUC99.99	13
LLM-generated text detection	HC3 Gemini-3.1 pro preview generated text	ROC AUC0.813	10
LLM-generated text detection	HC3 Gemini-3 flash preview generated text	ROC AUC85.3	10
LLM-generated text detection	HC3 Gemini-3.1 flash-lite preview generated text	ROC AUC0.86	10
LLM-generated text detection	HC3 GPT-5.4 mini generated text	ROC AUC0.967	10
LLM-generated text detection	HC3 GPT-5.4 generated text	ROC AUC0.974	10
AI-text detection	HC3	AUROC100	10
Machine-Generated Text Detection	HC3 (full)	Accuracy99.92	9
LLM-Generated Text Detection	HC3 Plus (test)	AUROC0.9845	5
LLM-Generated Text Detection	HC3 (test)	AUROC0.9947	5
AI-generated text detection	HC3 cross-benchmark transfer	AUROC (Finance)99.8	3
LLM-generated text detection	HC3 Chi-Psy	TPR @ FPR=1%90	3
Machine-generated text detection	HC3 Cross-domain	Score (Medicine)80.48	3
AI-Text Detection	HC3 PLUS (test_si)	Balanced Accuracy (BA)87.21	2
AI-Text Detection	HC3 PLUS QA (test)	Balanced Accuracy99.47	2

Showing 25 of 27 rows