Vision-based method for semantic information extraction in construction by integrating deep learning object detection and image captioning. (August 2022)