Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

{\alpha}Depth: Learning Single-Pass Soft Boundary Decomposition for Stereo Conversion

About

Accurately modeling soft boundaries, e.g., hair and defocus blur, is a fundamental challenge in stereo conversion due to the ambiguous blending of foreground and background. Existing depth models primarily predict single-layer depth, leading to ambiguity in depth correspondence at soft boundaries. While matting techniques can capture opacity for layered modeling, they often struggle in complex scenes with multiple targets and usually require user intervention. This paper introduces {\alpha}Depth, a layered representation that decomposes soft boundaries for high-fidelity stereo conversion. Specifically, we first resolve mixed color and depth ambiguity by estimating layered color and depth values at soft boundaries. Considering complex multi-target scenes, we design a Circular Alpha Representation (CAR) that shifts the paradigm from global target extraction to local boundary decomposition. Unlike prior matting methods restricted to a single foreground/background, CAR enables efficient scene-level inference without manual guidance. Extensive evaluations demonstrate that {\alpha}Depth achieves state-of-the-art performance in stereo conversion, eliminating background bleeding and structural distortions at soft boundaries.

Xiang Zhang, Yang Zhang, Lukas Mehl, Karlis Martins Briedis, Markus Gross, Christopher Schroers• 2026

Related benchmarks

TaskDatasetResultRank
Stereo ConversionMono2Stereo
PSNR33.31
14
Stereo Image ConversionMarvel-10K
PSNR31.03
14
Stereo Image ConversionMono2Stereo (test)
S-PSNR25.6
6
Stereo Video ConversionMarvel-10K (test)
S-PSNR28.46
6
Alpha MattingAIM-500
SAD7.24
4
Alpha MattingP3M-10K
SAD4.09
4
Warping PerformanceMono2Stereo full image
PSNR29.38
3
Warping PerformanceMarvel-10K full image
PSNR27.38
3
Showing 8 of 8 rows

Other info

Follow for update