Segment Any Events with Language

About

Scene understanding with free-form language has been widely explored within diverse modalities such as images, point clouds, and LiDAR. However, related studies on event sensors are scarce or narrowly centered on semantic-level understanding. We introduce SEAL, the first Semantic-aware Segment Any Events framework that addresses Open-Vocabulary Event Instance Segmentation (OV-EIS). Given the visual prompt, our model presents a unified framework to support both event segmentation and open-vocabulary mask classification at multiple levels of granularity, including instance-level and part-level. To enable thorough evaluation on OV-EIS, we curate four benchmarks that cover label granularity from coarse to fine class configurations and semantic granularity from instance-level to part-level understanding. Extensive experiments show that our SEAL largely outperforms proposed baselines in terms of performance and inference speed with a parameter-efficient architecture. In the Appendix, we further present a simple variant of our SEAL achieving generic spatiotemporal OV-EIS that does not require any visual prompts from users in the inference. Check out our project page in https://0nandon.github.io/SEAL

Seungjun Lee, Gim Hee Lee• 2026

Related benchmarks

Task	Dataset	Result
Event Instance Segmentation	DSEC Detection	AP17.8	12
Event Part Segmentation	DSEC Part	AP18.3	10
Class-agnostic Object Detection	DSEC-Detection (test)	AUC43.32	2

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord