A Novel Convolution and Attention Mechanism-based Model for 6D Object Pose Estimation
arXiv:2501.01993v2 Announce Type: replace-cross Abstract: This paper proposes PoseLecTr, a graph-based encoder-decoder framework that integrates a novel Legendre convolution with attention mechanisms for six-degree-of-freedom (6-DOF) object pose estimation from monocular RGB images. Conventional learning-based approaches predominantly rely on grid-structured convolutions,…
