A novel one-stage framework for visual pulse rate estimation using deep neural networks. (April 2021)
- Record Type:
- Journal Article
- Title:
- A novel one-stage framework for visual pulse rate estimation using deep neural networks. (April 2021)
- Main Title:
- A novel one-stage framework for visual pulse rate estimation using deep neural networks
- Authors:
- Huang, Bin
Lin, Chun-Liang
Chen, Weihai
Juang, Chia-Feng
Wu, Xingming - Abstract:
- Highlights: PRnet can extract PR with a 2s video, and adapt for a sharp variation PR. PRnet is a complete neural networks pipeline with joint optimization framework. PRnet is a spatio-temporal frramwork that can extract the latent temporal information. PRnet achieves competitive performance than state-of-the-art methods. Abstract: Estimation of the visual pulse rate (also called heart rate) refers to extraction of the pulse rate from a facial video. With the studies on extracting photoplethysmography (PPG) signals from a facial video, the non-contacted measurement method has aroused great interest among researchers over the past few years. In this study, a novel one-stage spatio-temporal framework, namely PRnet, is proposed to estimate the pulse rate from a stationary facial video. First, visual pulse rate estimation is defined as a regression task based on deep neural networks, in which a video is mapped to a pulse rate value. Then, 3D convolutional neural networks (Conv3D) and Long short-term memory (LSTM) modules are used to extract spatial and latent temporal information that is hidden in a video. Subsequently, one fully connected layer is applied in the last layer of PRnet to estimate the pulse rate directly. Based on the exquisite framework design, our proposed method realizes competitive performance, especially in terms of processing latency, since it does not rely on power spectral density (PSD) and traditional Fast Fourier Transform (FFT) algorithms. Using ourHighlights: PRnet can extract PR with a 2s video, and adapt for a sharp variation PR. PRnet is a complete neural networks pipeline with joint optimization framework. PRnet is a spatio-temporal frramwork that can extract the latent temporal information. PRnet achieves competitive performance than state-of-the-art methods. Abstract: Estimation of the visual pulse rate (also called heart rate) refers to extraction of the pulse rate from a facial video. With the studies on extracting photoplethysmography (PPG) signals from a facial video, the non-contacted measurement method has aroused great interest among researchers over the past few years. In this study, a novel one-stage spatio-temporal framework, namely PRnet, is proposed to estimate the pulse rate from a stationary facial video. First, visual pulse rate estimation is defined as a regression task based on deep neural networks, in which a video is mapped to a pulse rate value. Then, 3D convolutional neural networks (Conv3D) and Long short-term memory (LSTM) modules are used to extract spatial and latent temporal information that is hidden in a video. Subsequently, one fully connected layer is applied in the last layer of PRnet to estimate the pulse rate directly. Based on the exquisite framework design, our proposed method realizes competitive performance, especially in terms of processing latency, since it does not rely on power spectral density (PSD) and traditional Fast Fourier Transform (FFT) algorithms. Using our method, only 60 frames of video (2 s) are required for the robust prediction of the pulse rate, whereas 6–30 s of video are typically required for other methods. Finally, a novel visual pulse rate estimation database, which includes pulse rate range at various times of day, is collected to evaluate the proposed framework. The results of extensive experiments demonstrate that PRnet performs competitively while compared with state-of-the-art methods. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 66(2021)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 66(2021)
- Issue Display:
- Volume 66, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 66
- Issue:
- 2021
- Issue Sort Value:
- 2021-0066-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-04
- Subjects:
- Visual pulse rate estimation -- Remote PPG -- Pulse rate monitoring -- Deep learning -- 3D convolutional neural networks -- LSTM (long short-term memory)
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2020.102387 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23779.xml