Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ToMMeR -- Efficient Entity Mention Detection from Large Language Models

About

Identifying which text spans refer to entities - mention detection - is both foundational for information extraction and a known performance bottleneck. We introduce ToMMeR, a lightweight model (<300K parameters) probing mention detection capabilities from early LLM layers. Across 13 NER benchmarks, ToMMeR achieves 93% recall zero-shot, with an estimated 90% precision under a human-calibrated LLM-judge protocol, showing that ToMMeR rarely produces spurious predictions despite high recall. Cross-model analysis reveals that diverse architectures (14M-15B parameters) converge on similar mention boundaries (DICE >75%), confirming that mention detection emerges naturally from language modeling. When extended with span classification heads, ToMMeR achieves competitive NER performance (80-87% F1 on standard benchmarks). Our work provides evidence that structured entity representations exist in early transformer layers and can be efficiently recovered with minimal parameters.

Victor Morand, Nadi Tomeh, Josiane Mothe, Benjamin Piwowarski• 2025

Related benchmarks

TaskDatasetResultRank
Named Entity RecognitionCoNLL 03--
135
Named Entity RecognitionmultiNERD
Entity F145.5
50
Entity Mention DetectionACE05 (351/80/80)
Precision31.9
14
Named Entity RecognitionCrossNER AI
F1 Score64.5
12
Named Entity RecognitionGENIA
Micro-F170.1
8
Named Entity RecognitionNCBI
Micro-F182.1
8
Named Entity RecognitionOntoNotes
Micro-F185.4
7
Mention DetectionConll 2003
Recall94.8
2
Mention DetectionCrossNER Politics
Recall97
2
Mention DetectionCrossNER literature
Recall94.4
2
Showing 10 of 24 rows

Other info

Follow for update