VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification. (July 2023)