A light‐weight stereo matching network based on multi‐scale features fusion and robust disparity refinement. Issue 6 (15th February 2023)
- Record Type:
- Journal Article
- Title:
- A light‐weight stereo matching network based on multi‐scale features fusion and robust disparity refinement. Issue 6 (15th February 2023)
- Main Title:
- A light‐weight stereo matching network based on multi‐scale features fusion and robust disparity refinement
- Authors:
- Yang, Xiaowei
Zhao, Yong
Feng, Zhiguo
Sang, Haiwei
Zhang, Zhenbo
Zhang, Guiying
He, Lin - Abstract:
- Abstract: In recent years, convolutional‐neural‐network based stereo matching methods have achieved significant gains compared to conventional methods in terms of both speed and accuracy. Current state‐of‐the‐art disparity estimation algorithms require many parameters and large amounts of computational resources and are not suited for applications on edge devices. In this paper, an end‐to‐end light‐weight network (LWNet) for fast stereo matching is proposed, which consists of an efficient backbone with multi‐scale feature fusion for feature extraction, a 3D U‐Net aggregation architecture for disparity computation, and color guidance in a 2D convolutional neural network (CNN) for disparity refinement. MobileNetV2 is adopted as an efficient backbone in feature extraction. The channel attention module is applied to improve the representational capacity of features and multi‐resolution information is adaptively incorporated into the cost volume via cross‐scale connections. Further, a left‐right consistency check and color guidance refinement are introduced and a robust disparity refinement network is designed with skip connections and dilated convolutions to capture global context information and improve disparity estimation accuracy with little computational cost and memory space. Extensive experiments on Scene Flow, KITTI 2015, and KITTI 2012 demonstrate that the proposed LWNet achieves competitive accuracy and speed when compared with state‐of‐the‐art stereo matching methods.Abstract: In recent years, convolutional‐neural‐network based stereo matching methods have achieved significant gains compared to conventional methods in terms of both speed and accuracy. Current state‐of‐the‐art disparity estimation algorithms require many parameters and large amounts of computational resources and are not suited for applications on edge devices. In this paper, an end‐to‐end light‐weight network (LWNet) for fast stereo matching is proposed, which consists of an efficient backbone with multi‐scale feature fusion for feature extraction, a 3D U‐Net aggregation architecture for disparity computation, and color guidance in a 2D convolutional neural network (CNN) for disparity refinement. MobileNetV2 is adopted as an efficient backbone in feature extraction. The channel attention module is applied to improve the representational capacity of features and multi‐resolution information is adaptively incorporated into the cost volume via cross‐scale connections. Further, a left‐right consistency check and color guidance refinement are introduced and a robust disparity refinement network is designed with skip connections and dilated convolutions to capture global context information and improve disparity estimation accuracy with little computational cost and memory space. Extensive experiments on Scene Flow, KITTI 2015, and KITTI 2012 demonstrate that the proposed LWNet achieves competitive accuracy and speed when compared with state‐of‐the‐art stereo matching methods. Abstract : We propose a novel end‐to‐end light‐weight network for stereo matching. The proposed LWNet is trained and evaluated on the Scene Flow, KITTI 2015, and KITTI 2012 datasets, and the experimental results show that our method achieves state‐of‐the‐art performance. … (more)
- Is Part Of:
- IET image processing. Volume 17:Issue 6(2023)
- Journal:
- IET image processing
- Issue:
- Volume 17:Issue 6(2023)
- Issue Display:
- Volume 17, Issue 6 (2023)
- Year:
- 2023
- Volume:
- 17
- Issue:
- 6
- Issue Sort Value:
- 2023-0017-0006-0000
- Page Start:
- 1797
- Page End:
- 1811
- Publication Date:
- 2023-02-15
- Subjects:
- computer vision -- image processing -- stereo image processing
Image processing -- Periodicals
621.36705 - Journal URLs:
- http://digital-library.theiet.org/content/journals/iet-ipr ↗
http://ieeexplore.ieee.org/servlet/opac?punumber=4149689 ↗
http://www.ietdl.org/IET-IPR ↗
https://ietresearch.onlinelibrary.wiley.com/journal/17519667 ↗
http://www.theiet.org/ ↗ - DOI:
- 10.1049/ipr2.12756 ↗
- Languages:
- English
- ISSNs:
- 1751-9659
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4363.252600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 27099.xml