Rotated cascade R-CNN: A shape robust detector with coordinate regression. (December 2019)
- Record Type:
- Journal Article
- Title:
- Rotated cascade R-CNN: A shape robust detector with coordinate regression. (December 2019)
- Main Title:
- Rotated cascade R-CNN: A shape robust detector with coordinate regression
- Authors:
- Zhu, Yixing
Ma, Chixiang
Du, Jun - Abstract:
- Highlights: We present a novel LocSLPR method that can handle quadrangular/curved objects and well address the ambiguity problem of vertex order compared with direct regression. LocSLPR requires fewer parameters and achieves better results than segmentation-based methods. We present an RCR-CNN, which can gradually regress the object in a two-stage manner and significantly improves the performance of our system. Our proposed method won first place in the ICPR 2018 Contest for Robust Reading for Multi-Type Web Images with a score of 0:796 and was our best single model in the ICPR 2018 Contest on Object Detection in Aerial Images (ODAI) with a 69:2% mean average precision (mAP), where we won first place. In addition, we also achieved the best results on the curved text detection dataset CTW1500, demonstrating the effectiveness and flexibility of our method. Abstract: General object detection task mainly takes axis-aligned bounding-boxes as the detection outputs. To address more challenging scenarios, such as curved text detection and multi-oriented object detection in aerial images, we propose a novel two-stage approach for shape robust object detection. In the first stage, a locally sliding line-based point regression (LocSLPR) approach is presented to estimate the outline of the object, which is denoted as the intersections of the sliding lines and the bounding-box of the object. To make full use of information, we only regress partial coordinates and calculate the remainingHighlights: We present a novel LocSLPR method that can handle quadrangular/curved objects and well address the ambiguity problem of vertex order compared with direct regression. LocSLPR requires fewer parameters and achieves better results than segmentation-based methods. We present an RCR-CNN, which can gradually regress the object in a two-stage manner and significantly improves the performance of our system. Our proposed method won first place in the ICPR 2018 Contest for Robust Reading for Multi-Type Web Images with a score of 0:796 and was our best single model in the ICPR 2018 Contest on Object Detection in Aerial Images (ODAI) with a 69:2% mean average precision (mAP), where we won first place. In addition, we also achieved the best results on the curved text detection dataset CTW1500, demonstrating the effectiveness and flexibility of our method. Abstract: General object detection task mainly takes axis-aligned bounding-boxes as the detection outputs. To address more challenging scenarios, such as curved text detection and multi-oriented object detection in aerial images, we propose a novel two-stage approach for shape robust object detection. In the first stage, a locally sliding line-based point regression (LocSLPR) approach is presented to estimate the outline of the object, which is denoted as the intersections of the sliding lines and the bounding-box of the object. To make full use of information, we only regress partial coordinates and calculate the remaining coordinates by the sliding rule. We find that regression can achieve higher precision with fewer parameters than the segmentation method. In the second stage, a rotated cascade region-based convolutional neural network (RCR-CNN) is used to gradually regress the target object, which can further improve the performance of our system. Experiments demonstrate that our method achieves state-of-the-art performance in several quadrangular object detection tasks. For example, our method yielded a score of 0.796 in the ICPR 2018 Contest on Robust Reading for Multi-Type Web Images, where we won first place for text detection tasks. The method also achieved 69.2% mAP on Task 1 of the ICPR 2018 Contest on Object Detection in Aerial Images, which was our best single model, where we also won first place. In addition, the method outperforms the previously published best record on the curved text dataset (CTW1500). … (more)
- Is Part Of:
- Pattern recognition. Volume 96(2019:Dec.)
- Journal:
- Pattern recognition
- Issue:
- Volume 96(2019:Dec.)
- Issue Display:
- Volume 96 (2019)
- Year:
- 2019
- Volume:
- 96
- Issue Sort Value:
- 2019-0096-0000-0000
- Page Start:
- Page End:
- Publication Date:
- 2019-12
- Subjects:
- Object detection -- Text detection -- Aerial images -- Curved text -- Rotated cascade R-CNN
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2019.106964 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11627.xml