A novel spiral pattern and 2D M4 pooling based environmental sound classification method. (15th December 2020)
- Record Type:
- Journal Article
- Title:
- A novel spiral pattern and 2D M4 pooling based environmental sound classification method. (15th December 2020)
- Main Title:
- A novel spiral pattern and 2D M4 pooling based environmental sound classification method
- Authors:
- Tuncer, Turker
Subasi, Abdulhamit
Ertam, Fatih
Dogan, Sengul - Abstract:
- Highlights: A novel spiral pattern and a multi statistical pooling method (2D M4) are presented. We proposed a multileveled environmental sound classification. The proposed method achieved higher classification rates than human auditory system for ESC10 and ESC50 datasets. The proposed method outperforms than other selected state-of-art method. Abstract: One of the crucial problems of the signal processing, digital forensics and machine learning is the environmental sound classification (ESC). Several ESC methods have been presented to obtain highly accurate model. In this work, a novel multileveled ESC method is presented. The presented ESC method uses two novel algorithms namely Spiral Pattern and two dimensional maximum, minimum, median and mean (2D-M4) pooling. By using these methods (Spiral Pattern and 2D-M4 pooling), 9 level feature generation approach is presented. Since the proposed Spiral Pattern has nine arrows, it extracts 9 and 18 bits using signum and ternary functions respectively. As a result, 1536 features are extracted in each level and totally 15, 360 features are generated using from 0th to 9th levels. In order to select the discriminative features, neighbourhood component analysis (NCA) is used and 700 most distinctive features are selected. In the classification phase, deep neural network is trained and tested with the ESC-10 and ESC-50 datasets. 98.75% and 85.75% average classification accuracies were achieved with 10-folds cross validation for ESC-10Highlights: A novel spiral pattern and a multi statistical pooling method (2D M4) are presented. We proposed a multileveled environmental sound classification. The proposed method achieved higher classification rates than human auditory system for ESC10 and ESC50 datasets. The proposed method outperforms than other selected state-of-art method. Abstract: One of the crucial problems of the signal processing, digital forensics and machine learning is the environmental sound classification (ESC). Several ESC methods have been presented to obtain highly accurate model. In this work, a novel multileveled ESC method is presented. The presented ESC method uses two novel algorithms namely Spiral Pattern and two dimensional maximum, minimum, median and mean (2D-M4) pooling. By using these methods (Spiral Pattern and 2D-M4 pooling), 9 level feature generation approach is presented. Since the proposed Spiral Pattern has nine arrows, it extracts 9 and 18 bits using signum and ternary functions respectively. As a result, 1536 features are extracted in each level and totally 15, 360 features are generated using from 0th to 9th levels. In order to select the discriminative features, neighbourhood component analysis (NCA) is used and 700 most distinctive features are selected. In the classification phase, deep neural network is trained and tested with the ESC-10 and ESC-50 datasets. 98.75% and 85.75% average classification accuracies were achieved with 10-folds cross validation for ESC-10 and ESC-50 datasets respectively. The experimental results reveal that the proposed Spiral Pattern and 2D-M4 pooling based ESC method is superior than the human auditory system (HAS) for environmental sound classification. … (more)
- Is Part Of:
- Applied acoustics. Volume 170(2020)
- Journal:
- Applied acoustics
- Issue:
- Volume 170(2020)
- Issue Display:
- Volume 170, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 170
- Issue:
- 2020
- Issue Sort Value:
- 2020-0170-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-12-15
- Subjects:
- Environmental sound classification -- Spiral pattern -- 2D M4 pooling -- Deep neural network -- Machine learning -- Digital forensics
Acoustical engineering -- Periodicals
Periodicals
620.2 - Journal URLs:
- http://www.sciencedirect.com/science/journal/0003682X ↗
http://www.elsevier.com/journals ↗
http://www.elsevier.com/homepage/elecserv.htt ↗ - DOI:
- 10.1016/j.apacoust.2020.107508 ↗
- Languages:
- English
- ISSNs:
- 0003-682X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 1571.400000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 13930.xml