Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WhoSaidIt: Human-LLM Collaborative Annotation for Text-Based Multilingual Speaker-Attribute Classification

About

Annotating speaker attributes from text is inherently ambiguous, particularly in multilingual settings where demographic and social cues are implicit and culturally variable. We propose a human-large language model (LLM) collaborative re-annotation framework for stabilizing multilingual speaker-attribute labels under practical resource constraints. Starting from a noisy corpus, we use LLMs to surface recurring annotation rationales through iterative interaction with experts, and apply disagreement-focused sampling for targeted re-annotation. Using this framework, we construct WhoSaidIt, a multilingual dataset covering nine speaker-attribute labels. We quantify divergence between original and revised annotations, benchmark recent LLMs, and analyze the effect of explicit rationales on model behavior. Our results reveal substantial cross-lingual differences in annotation decisions and demonstrate both the strengths and limitations of LLMs in speaker-attribute classification.

Lingyu Gao, Will Monroe, David Smith, Meghan Jemison, Jackie Lee• 2026

Related benchmarks

TaskDatasetResultRank
Attribute ClassificationWhoSaidIt public release
Accuracy (EN)100
10
Speaker-attribute classificationIntermediate Dataset original corpus labels (test)--
9
Speaker-attribute classificationWhoSaidIt re-annotated (test)--
8
Speaker-attribute classificationWhoSaidIt English public release
Accuracy (Male)100
1
Speaker-attribute classificationWhoSaidIt Spanish public release
Accuracy (Male)87
1
Speaker-attribute classificationWhoSaidIt Italian public release
Attribute Accuracy: Male87.9
1
Speaker-attribute classificationWhoSaidIt Korean public release subset
Male Accuracy89.9
1
Speaker-attribute classificationWhoSaidIt Chinese public release
Accuracy (Male)94.9
1
Speaker-attribute classificationWhoSaidIt Pooled Languages public release
Accuracy (Male)92
1
Showing 9 of 9 rows

Other info

Follow for update