Towards dense people detection with deep learning and depth images. (November 2021)
- Record Type:
- Journal Article
- Title:
- Towards dense people detection with deep learning and depth images. (November 2021)
- Main Title:
- Towards dense people detection with deep learning and depth images
- Authors:
- Fuentes-Jimenez, David
Losada-Gutierrez, Cristina
Casillas-Perez, David
Macias-Guarasa, Javier
Pizarro, Daniel
Martin-Lopez, Roberto
Luna, Carlos A. - Abstract:
- Abstract: This paper describes a novel DNN-based system, named PD3net, that detects multiple people from a single depth image, in real time. The proposed neural network processes a depth image and outputs a likelihood map in image coordinates, where each detection corresponds to a Gaussian-shaped local distribution, centered at each person's head. This likelihood map encodes both the number of detected people as well as their position in the image, from which the 3D position can be computed. The proposed DNN includes spatially separated convolutions to increase performance, and runs in real-time with low budget GPUs. We use synthetic data for initially training the network, followed by fine tuning with a small amount of real data. This allows adapting the network to different scenarios without needing large and manually labeled image datasets. Due to that, the people detection system presented in this paper has numerous potential applications in different fields, such as capacity control, automatic video-surveillance, people or groups behavior analysis, healthcare or monitoring and assistance of elderly people in ambient assisted living environments. In addition, the use of depth information does not allow recognizing the identity of people in the scene, thus enabling their detection while preserving their privacy. The proposed DNN has been experimentally evaluated and compared with other state-of-the-art approaches, including both classical and DNN-based solutions, under aAbstract: This paper describes a novel DNN-based system, named PD3net, that detects multiple people from a single depth image, in real time. The proposed neural network processes a depth image and outputs a likelihood map in image coordinates, where each detection corresponds to a Gaussian-shaped local distribution, centered at each person's head. This likelihood map encodes both the number of detected people as well as their position in the image, from which the 3D position can be computed. The proposed DNN includes spatially separated convolutions to increase performance, and runs in real-time with low budget GPUs. We use synthetic data for initially training the network, followed by fine tuning with a small amount of real data. This allows adapting the network to different scenarios without needing large and manually labeled image datasets. Due to that, the people detection system presented in this paper has numerous potential applications in different fields, such as capacity control, automatic video-surveillance, people or groups behavior analysis, healthcare or monitoring and assistance of elderly people in ambient assisted living environments. In addition, the use of depth information does not allow recognizing the identity of people in the scene, thus enabling their detection while preserving their privacy. The proposed DNN has been experimentally evaluated and compared with other state-of-the-art approaches, including both classical and DNN-based solutions, under a wide range of experimental conditions. The achieved results allows concluding that the proposed architecture and the training strategy are effective, and the network generalize to work with scenes different from those used during training. We also demonstrate that our proposal outperforms existing methods and can accurately detect people in scenes with significant occlusions. Highlights: Robust system to detect people only using depth information from a depth camera. System outperforms state-of-the-art methods in different datasets without fine-tuning. Proposal runs in real time using conventional GPUs. Computational demands are independent of the number of people in the scene. Generated database is available to the research community. … (more)
- Is Part Of:
- Engineering applications of artificial intelligence. Volume 106(2021)
- Journal:
- Engineering applications of artificial intelligence
- Issue:
- Volume 106(2021)
- Issue Display:
- Volume 106, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 106
- Issue:
- 2021
- Issue Sort Value:
- 2021-0106-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-11
- Subjects:
- People detection -- Depth camera information -- Interest regions estimation -- Feature extraction -- Deep learning -- Convolutional Neural Networks
Engineering -- Data processing -- Periodicals
Artificial intelligence -- Periodicals
Expert systems (Computer science) -- Periodicals
Ingénierie -- Informatique -- Périodiques
Intelligence artificielle -- Périodiques
Systèmes experts (Informatique) -- Périodiques
Artificial intelligence
Engineering -- Data processing
Expert systems (Computer science)
Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09521976 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.engappai.2021.104484 ↗
- Languages:
- English
- ISSNs:
- 0952-1976
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3755.704500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20373.xml