Dual self-attention with co-attention networks for visual question answering. (September 2021)