Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Validation of a Small Language Model for DSM-5 Substance Category Classification in Child Welfare Records

About

Background: Recent studies have demonstrated that large language models (LLMs) can perform binary classification tasks on child welfare narratives, detecting the presence or absence of constructs such as substance-related problems, domestic violence, and firearms involvement. Whether smaller, locally deployable models can move beyond binary detection to classify specific substance types from these narratives remains untested. Objective: To validate a locally hosted LLM classifier for identifying specific substance types aligned with DSM-5 categories in child welfare investigation narratives. Methods: A locally hosted 20-billion-parameter LLM classified child maltreatment investigation narratives from a Midwestern U.S. state. Records previously identified as containing substance-related problems were passed to a second classification stage targeting seven DSM-5 substance categories. Expert human review of 900 stratified cases assessed classification precision, recall, and inter-method reliability (Cohen's kappa). Test-retest stability was evaluated using approximately 15,000 independently classified records. Results: Five substance categories achieved almost perfect inter-method agreement (kappa = 0.94-1.00): alcohol, cannabis, opioid, stimulant, and sedative/hypnotic/anxiolytic. Classification precision ranged from 92% to 100% for these categories. Two low-prevalence categories (hallucinogen, inhalant) performed poorly. Test-retest agreement ranged from 92.1% to 99.1% across the seven categories. Conclusions: A small, locally hosted LLM can reliably classify substance types from child welfare administrative text, extending prior work on binary classification to multi-label substance identification.

Brian E. Perron, Dragan Stoll, Bryan G. Victor, Zia Qia, Andreas Jud, Joseph P. Ryan• 2026

Related benchmarks

TaskDatasetResultRank
Information ExtractionSubstance Alcohol DSM-5 (val)
Extraction Precision99.3
1
Information ExtractionDSM-5 Substance Cannabis (val)
Precision0.989
1
Information ExtractionDSM-5 Substance Opioid (val)
Precision98.1
1
Information ExtractionSubstance Stimulant DSM-5 (val)
Precision94.6
1
Information ExtractionSubstance Sedative/Hypnotic/Anxiolytic DSM-5 (val)
Precision92.8
1
Information ExtractionDSM-5 Substance Hallucinogen (val)
Extraction Precision75.5
1
Information ExtractionSubstance Inhalant DSM-5 (val)
Extraction Precision58.6
1
Substance ClassificationSubstance Alcohol DSM-5 (val)
Precision100
1
Substance ClassificationDSM-5 Substance Cannabis (val)
Precision99
1
Substance ClassificationSubstance Opioid DSM-5 (val)
Precision (%)100
1
Showing 10 of 14 rows

Other info

Follow for update