Eff-GRot: Efficient and Generalizable Rotation Estimation with Transformers
About
We introduce Eff-GRot, an approach for efficient and generalizable rotation estimation from RGB images. Given a query image and a set of reference images with known orientations, our method directly predicts the object's rotation in a single forward pass, without requiring object- or category-specific training. At the core of our framework is a transformer that performs a comparison in the latent space, jointly processing rotation-aware representations from multiple references alongside a query. This design enables a favorable balance between accuracy and computational efficiency while remaining simple, scalable, and fully end-to-end. Experimental results show that Eff-GRot offers a promising direction toward more efficient rotation estimation, particularly in latency-sensitive applications.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Rotation Estimation | ShapeNet synthetic (novel object categories) | Accuracy @ 15°94.8 | 6 | |
| Rotation Estimation | LineMOD | Estimation Time (s)0.019 | 6 | |
| Rotation Estimation | LINEMOD novel objects (test) | Acc @ 15° (benchvise)82.6 | 6 | |
| Rotation Estimation | LineMOD | Peak Memory (MB)256 | 5 |