CMA: Cross-modal attention for 6D object pose estimation. (June 2021)