Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Segment Any Events with Language

About

Scene understanding with free-form language has been widely explored within diverse modalities such as images, point clouds, and LiDAR. However, related studies on event sensors are scarce or narrowly centered on semantic-level understanding. We introduce SEAL, the first Semantic-aware Segment Any Events framework that addresses Open-Vocabulary Event Instance Segmentation (OV-EIS). Given the visual prompt, our model presents a unified framework to support both event segmentation and open-vocabulary mask classification at multiple levels of granularity, including instance-level and part-level. To enable thorough evaluation on OV-EIS, we curate four benchmarks that cover label granularity from coarse to fine class configurations and semantic granularity from instance-level to part-level understanding. Extensive experiments show that our SEAL largely outperforms proposed baselines in terms of performance and inference speed with a parameter-efficient architecture. In the Appendix, we further present a simple variant of our SEAL achieving generic spatiotemporal OV-EIS that does not require any visual prompts from users in the inference. Check out our project page in https://0nandon.github.io/SEAL

Seungjun Lee, Gim Hee Lee• 2026

Related benchmarks

TaskDatasetResultRank
Event Instance SegmentationDSEC Detection
AP17.8
12
Event Part SegmentationDSEC Part
AP18.3
10
Class-agnostic Object DetectionDSEC-Detection (test)
AUC43.32
2
Showing 3 of 3 rows

Other info

Follow for update