Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PoolNet: Deep Learning for 2D to 3D Video Process Validation

About

Lifting Structure-from-Motion (SfM) information from sequential and non-sequential image data is a time-consuming and computationally expensive task. In addition to this, the majority of publicly available data is unfit for processing due to inadequate camera pose variation, obscuring scene elements, and noisy data. To solve this problem, we introduce PoolNet, a versatile deep learning framework for frame-level and scene-level validation of in-the-wild data. We demonstrate that our model successfully differentiates SfM ready scenes from those unfit for processing while significantly undercutting the amount of time state of the art algorithms take to obtain structure-from-motion data.

Sanchit Kaul, Joseph Luna, Shray Arora• 2025

Related benchmarks

TaskDatasetResultRank
3D Reconstruction FiltrationCables
Time (s)13
5
3D Reconstruction FiltrationCeiling
Time (s)19.2
5
3D Reconstruction FiltrationDesk
Execution Time (s)25.9
5
3D Reconstruction FiltrationEinstein
Latency (s)6.1
5
3D Reconstruction FiltrationKidnap
Time (seconds)11.3
5
3D Reconstruction FiltrationLarge
Time (s)17.8
5
3D Reconstruction FiltrationMannequin
Time (s)8.1
5
3D Reconstruction FiltrationMotion
Time (s)30.6
5
3D Reconstruction FiltrationPlanar
Time (s)7.9
5
3D Reconstruction FiltrationPlant
Time (s)1
5
Showing 10 of 18 rows

Other info

Follow for update