Large Language Models as Automatic Annotators and Annotation Adjudicators for Fine-Grained Opinion Analysis

About

Fine-grained opinion analysis of text provides a detailed understanding of expressed sentiments, including the addressed entity. Although this level of detail is valuable, annotating opinions in datasets for model training requires considerable human effort and substantial cost, especially across diverse domains and real-world applications. To address this shortage of domain-specific labelled datasets, we explore the feasibility of LLMs as automatic annotators for fine-grained opinion analysis. We use a declarative annotation pipeline, an approach that reduces the variability of manual prompt engineering when using LLMs to identify fine-grained opinion spans in text. We also present a dedicated methodology for an LLM to adjudicate multiple labels and produce final annotations. We trial the pipeline with models of different sizes for the Aspect Sentiment Triplet Extraction (ASTE) and Aspect-Category-Opinion-Sentiment (ACOS) analysis tasks. In this work, we attempt to develop fully autonomous LLM-based annotators, but our results reveal an uneven picture characterised by a critical performance bifurcation: LLMs are reliable at the span level yet struggle to faithfully reproduce the relational structures that connect those spans. This suggests that LLMs are better positioned as high-fidelity annotation assistants and data augmentation tools to expand fine-grained opinion-annotated datasets, rather than replacing human annotators entirely.

Gaurav Negi, MA Waskow, John McCrae, Omnia Zayed, Paul Buitelaar• 2026

Related benchmarks

Task	Dataset	Result
aspect sentiment triplet extraction	Lap14	F1 Score46.81	29
aspect sentiment triplet extraction	Res 15	F1 Score62.51	20
Aspect Category Opinion Sentiment Quad Prediction	Lap-ACOS (test)	F1 Score18.99	16
aspect sentiment triplet extraction	res 14	Precision68.81	12
aspect sentiment triplet extraction	res16	Precision64.28	12
Aspect-Category-Opinion-Sentiment (ACOS) quadruple extraction	Restaurant ACOS (test)	Precision41.12	12

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord