Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TalkTag: Fine-Grained Morphosyntactic Error Annotation for Transcribed Speech

About

Fine-grained morphosyntactic error annotation is important in clinical and developmental language research, yet it is labour-intensive, expert-dependent, and difficult to scale. We present TalkTag, an LLM-based lightweight tool fine-tuned to automate CHAT-style error annotation in spoken-language transcripts. Developed under conditions of extreme data scarcity using children's narrative data, the system shows the feasibility of linguistic analysis in low-resource settings. Our evaluation demonstrates that TalkTag produces encouragingly precise annotation while effectively identifying instances where linguistic ambiguity makes automated tagging genuinely complex. In summary, with TalkTag, we provide a scalable alternative to manual error annotation and practically viable support for morphosyntactic error annotation.

Shamira Venturini, Oliver Hennh\"ofer, Steffen Kinkel, Jannik Str\"otgen (2) __INSTITUTION_4__ Karlsruhe Institute of Technology, (2) Karlsruhe University of Applied Sciences)• 2026

Related benchmarks

TaskDatasetResultRank
Morphosyntactic Error AnnotationENNI tagged utterances raw (test)
Exact Match (EM)82.8
2
Morphosyntactic Error AnnotationENNI raw (val)
EM95.4
1
Morphosyntactic Error AnnotationENNI raw (test)
Exact Match (EM)93.6
1
Showing 3 of 3 rows

Other info

Follow for update