Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Large Language Models as Automatic Annotators and Annotation Adjudicators for Fine-Grained Opinion Analysis

About

Fine-grained opinion analysis of text provides a detailed understanding of expressed sentiments, including the addressed entity. Although this level of detail is valuable, annotating opinions in datasets for model training requires considerable human effort and substantial cost, especially across diverse domains and real-world applications. To address this shortage of domain-specific labelled datasets, we explore the feasibility of LLMs as automatic annotators for fine-grained opinion analysis. We use a declarative annotation pipeline, an approach that reduces the variability of manual prompt engineering when using LLMs to identify fine-grained opinion spans in text. We also present a dedicated methodology for an LLM to adjudicate multiple labels and produce final annotations. We trial the pipeline with models of different sizes for the Aspect Sentiment Triplet Extraction (ASTE) and Aspect-Category-Opinion-Sentiment (ACOS) analysis tasks. In this work, we attempt to develop fully autonomous LLM-based annotators, but our results reveal an uneven picture characterised by a critical performance bifurcation: LLMs are reliable at the span level yet struggle to faithfully reproduce the relational structures that connect those spans. This suggests that LLMs are better positioned as high-fidelity annotation assistants and data augmentation tools to expand fine-grained opinion-annotated datasets, rather than replacing human annotators entirely.

Gaurav Negi, MA Waskow, John McCrae, Omnia Zayed, Paul Buitelaar• 2026

Related benchmarks

TaskDatasetResultRank
aspect sentiment triplet extractionLap14
F1 Score46.81
29
aspect sentiment triplet extractionRes 15
F1 Score62.51
20
Aspect Category Opinion Sentiment Quad PredictionLap-ACOS (test)
F1 Score18.99
16
aspect sentiment triplet extractionres 14
Precision68.81
12
aspect sentiment triplet extractionres16
Precision64.28
12
Aspect-Category-Opinion-Sentiment (ACOS) quadruple extractionRestaurant ACOS (test)
Precision41.12
12
Showing 6 of 6 rows

Other info

Follow for update