Environmental sound classification with dilated convolutions. (May 2019)
- Record Type:
- Journal Article
- Title:
- Environmental sound classification with dilated convolutions. (May 2019)
- Main Title:
- Environmental sound classification with dilated convolutions
- Authors:
- Chen, Yan
Guo, Qian
Liang, Xinyan
Wang, Jiang
Qian, Yuhua - Abstract:
- Highlights: Dilated CNN is introduced to address ESC problem. Dilated CNN achieves better results than that of CNN with max-pooling. Exploring effect of dilation rate for performance. Exploring effect of the layers of networks for performance. Abstract: In sound information retrieval (SIR) area, environmental sound classification (ESC) emerges as a new issue, which aims at classifying environments by analysing the complex features extracted from the various sound data. As one of the most efficient feature extraction methods, convolution neural networks (CNN) has made its success in speech and music signal processing, and in particular, CNN with pooling has worked effectively in classifying environmental and urban sound sources. However, pooling causes information loss. In this paper, dilated CNN, being introduced to ESC problem, achieves better results than that of CNN with max-pooling and other state-of-the-art approaches. At the same time, we explore the effect of different dilation rate and the number of layers of dilated convolution to the experimental results, and find that expanding the number of covered frames or enlarging the dilation rate will make the accuracy reduce. That may be the sound signal has short-term stability, the size of the overlay frame seriously affects the feature extraction of the sound signal, and there is an inherent "gridding" in the dilation model conjunction defect.
- Is Part Of:
- Applied acoustics. Volume 148(2019)
- Journal:
- Applied acoustics
- Issue:
- Volume 148(2019)
- Issue Display:
- Volume 148, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 148
- Issue:
- 2019
- Issue Sort Value:
- 2019-0148-2019-0000
- Page Start:
- 123
- Page End:
- 132
- Publication Date:
- 2019-05
- Subjects:
- Sound information retrieval -- Environmental sound classification -- Dilated convolutions
Acoustical engineering -- Periodicals
Periodicals
620.2 - Journal URLs:
- http://www.sciencedirect.com/science/journal/0003682X ↗
http://www.elsevier.com/journals ↗
http://www.elsevier.com/homepage/elecserv.htt ↗ - DOI:
- 10.1016/j.apacoust.2018.12.019 ↗
- Languages:
- English
- ISSNs:
- 0003-682X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 1571.400000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9547.xml