Multi-modal graph reasoning for structured video text extraction. (April 2023)