Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

EigeNet: Geometry-Informed Multi-Modal Learning for Few-shot Novel View RIR Prediction

About

Predicting spatially varying Room Impulse Response (RIR) from sparse observations is a critical but highly challenging inverse problem for immersive spatial audio rendering. In this work, we present EIGENET, a geometry-informed multi-modal framework for few-shot novel view RIR prediction. At its core is a Cross-view Alternate-attention Transformer that iteratively refines local intra-view acoustic structures and global cross-view spatial relationships. We empirically demonstrate that this architecture is capable of making full use of the multi-view multi-modal context while performing spatial-temporal reasoning for RIR prediction. Inspired by acoustic ray tracing, we design a geometry-informed modulation block to formulate the connection between geometric features and RIR power spectrum. In the mean time, an auxiliary loss is introduced to transform the single-target waveform prediction into a multi-task learning framework. Through ablation studies, we demonstrate that this design yields consistent performance gains regardless of the underlying backbone, thereby confirming its foundational utility and architecture-agnostic generalizability for RIR prediction task. Evaluated on both simulated and real-world benchmarks, EIGENET achieves both state-of-the-art performance in few-shot novel view RIR prediction and sim-to-real generalization. Codes and checkpoints are available on https://github.com/FEAfeatherTHER/EigeNet.

Chong Jing, Zitong Lan, Junan Zhang, Zhizheng Wu• 2026

Related benchmarks

TaskDatasetResultRank
Sim-to-Real Acoustic Parameter PredictionHAA Classroom 8:2 ratio (test)
EDT0.022
28
Sim-to-Real Acoustic Parameter PredictionHAA Hallway 8:2 ratio (test)
EDT (s)0.036
14
Sim-to-Real Acoustic Parameter PredictionHAA Dampened 8:2 ratio (test)
EDT0.025
14
Room Impulse Response PredictionAcousticRooms 16 kHz (test)
EDT (s)0.041
13
Showing 4 of 4 rows

Other info

Follow for update