DDNet: A Dual-Stream Graph Learning and Disentanglement Framework for Temporal Forgery Localization

About

The rapid evolution of AIGC technology enables misleading viewers by tampering mere small segments within a video, rendering video-level detection inaccurate and unpersuasive. Consequently, temporal forgery localization (TFL), which aims to precisely pinpoint tampered segments, becomes critical. However, existing methods are often constrained by \emph{local view}, failing to capture global anomalies. To address this, we propose a \underline{d}ual-stream graph learning and \underline{d}isentanglement framework for temporal forgery localization (DDNet). By coordinating a \emph{Temporal Distance Stream} for local artifacts and a \emph{Semantic Content Stream} for long-range connections, DDNet prevents global cues from being drowned out by local smoothness. Furthermore, we introduce Trace Disentanglement and Adaptation (TDA) to isolate generic forgery fingerprints, alongside Cross-Level Feature Embedding (CLFE) to construct a robust feature foundation via deep fusion of hierarchical features. Experiments on ForgeryNet and TVIL benchmarks demonstrate that our method outperforms state-of-the-art approaches by approximately 9\% in AP@0.95, with significant improvements in cross-domain robustness.

Boyang Zhao, Xin Liao, Jiaxin Chen, Xiaoshuai Wu, Yufeng Wu• 2026

Related benchmarks

Task	Dataset	Result
Temporal Forgery Localization	TVIL 5 (test)	AP@0.511.72	11
Temporal Forgery Localization	ForgeryNet Balanced (test)	AP@0.5151	6
Temporal Forgery Localization	ForgeryNet (Balanced)	AP@0.574.89	5
Temporal Forgery Localization	TVIL Inpainting	AP @ IoU=0.581.64	5
Temporal Forgery Localization	ForgeryNet (Standard)	AP @ IoU=0.500.8759	4

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord