Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SERE: Structural Example Retrieval for Enhancing LLMs in Event Causality Identification

About

Event Causality Identification (ECI) requires models to determine whether a given pair of events in a context exhibits a causal relationship. While Large Language Models (LLMs) have demonstrated strong performance across various NLP tasks, their effectiveness in ECI remains limited due to biases in causal reasoning, often leading to overprediction of causal relationships (causal hallucination). To mitigate these issues and enhance LLM performance in ECI, we propose SERE, a structural example retrieval framework that leverages LLMs' few-shot learning capabilities. SERE introduces an innovative retrieval mechanism based on three structural concepts: (i) Conceptual Path Metric, which measures the conceptual relationship between events using edit distance in ConceptNet; (ii) Syntactic Metric, which quantifies structural similarity through tree edit distance on syntactic trees; and (iii) Causal Pattern Filtering, which filters examples based on predefined causal structures using LLMs. By integrating these structural retrieval strategies, SERE selects more relevant examples to guide LLMs in causal reasoning, mitigating bias and improving accuracy in ECI tasks. Extensive experiments on multiple ECI datasets validate the effectiveness of SERE. The source code is publicly available at https://github.com/DMIRLAB-Group/SERE.

Zhifeng Hao, Zhongjie Chen, Junhao Lu, Shengyin Yu, Guimin Hu, Keli Zhang, Ruichu Cai, Boyan Xu• 2026

Related benchmarks

TaskDatasetResultRank
Event causality identificationCTB (test)
Precision84.3
5
Event causality identificationESC-intra (test)
Precision75.5
5
Event causality identificationESC inter-sentence
Precision78.8
3
Event causality identificationESC (full)
Precision79.3
3
Event causality identificationESC intra-sentence
Precision77.4
3
Event causality identificationCTB
Precision91.3
3
Event causality identificationESC inter (test)
Precision66.7
2
Showing 7 of 7 rows

Other info

Follow for update