Shoelace pattern-based speech emotion recognition of the lecturers in distance education: ShoePat23. (15th March 2022)
- Record Type:
- Journal Article
- Title:
- Shoelace pattern-based speech emotion recognition of the lecturers in distance education: ShoePat23. (15th March 2022)
- Main Title:
- Shoelace pattern-based speech emotion recognition of the lecturers in distance education: ShoePat23
- Authors:
- Tanko, Dahiru
Dogan, Sengul
Burak Demir, Fahrettin
Baygin, Mehmet
Engin Sahin, Sakir
Tuncer, Turker - Abstract:
- Highlights: Two novel speech emotion datasets were collected. The main aim of this paper is to detect emotions of the lecturer using their speeches. We proposed a new sholace based feature extractor is proposed. A hand-crafted learning model is named ShoePat23 is presented. Our ShoePat23 attained over 94% accuracy on both datasets. Abstract: Background and objective: We are living in the pandemic age, and many educational institutions have shifted to a distance education system to ensure learning continuity while at the same time curtailing the spread of the Covid-19 virus. Automated speech emotion classification models can be used to measure the lecturer's performance during the lecture. Material and method: In this work, we collected a new lecturer's speech dataset to detect three emotions: positive, neutral, and negative. The dataset is divided into segments with a length of five seconds per segment. Each segment has been utilized as an observation and contains 9541 observations. To automatically classify these emotions, a hand-modeled learning approach is presented. This approach has a comprehensive feature extraction method. In the feature extraction, a shoelace-based local feature generator is introduced, called Shoelace Pattern. The suggested feature extractor generates features at a low level. To further improve the feature generation capability of the Shoelace Pattern, tunable q wavelet transform (TQWT) is used to create sub-bands. Shoelace Pattern generatesHighlights: Two novel speech emotion datasets were collected. The main aim of this paper is to detect emotions of the lecturer using their speeches. We proposed a new sholace based feature extractor is proposed. A hand-crafted learning model is named ShoePat23 is presented. Our ShoePat23 attained over 94% accuracy on both datasets. Abstract: Background and objective: We are living in the pandemic age, and many educational institutions have shifted to a distance education system to ensure learning continuity while at the same time curtailing the spread of the Covid-19 virus. Automated speech emotion classification models can be used to measure the lecturer's performance during the lecture. Material and method: In this work, we collected a new lecturer's speech dataset to detect three emotions: positive, neutral, and negative. The dataset is divided into segments with a length of five seconds per segment. Each segment has been utilized as an observation and contains 9541 observations. To automatically classify these emotions, a hand-modeled learning approach is presented. This approach has a comprehensive feature extraction method. In the feature extraction, a shoelace-based local feature generator is introduced, called Shoelace Pattern. The suggested feature extractor generates features at a low level. To further improve the feature generation capability of the Shoelace Pattern, tunable q wavelet transform (TQWT) is used to create sub-bands. Shoelace Pattern generates features from raw speech and sub-bands, and the proposed feature extraction method selects the most suitable feature vectors. The top four feature vectors are selected and merged to obtain the final feature vector. By deploying neighborhood component analysis (NCA), we chose the most informative 512 features, and these features are classified using a support vector machine (SVM) classifier using 10-fold cross-validation. Results: The proposed learning model based on the shoelace pattern (ShoePat23) attained 94.97% and 96.41% classification accuracies on the collected speech databases consecutively. Conclusions: The findings demonstrate the success of the ShoePat23 on speech emotion recognition. Moreover, this model has been used in the distance education system to detect the performance of the lecturers. … (more)
- Is Part Of:
- Applied acoustics. Volume 190(2022)
- Journal:
- Applied acoustics
- Issue:
- Volume 190(2022)
- Issue Display:
- Volume 190, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 190
- Issue:
- 2022
- Issue Sort Value:
- 2022-0190-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-03-15
- Subjects:
- Speech emotion recognition -- Distance education -- Shoelace Pattern -- NCA -- SVM
Acoustical engineering -- Periodicals
Periodicals
620.2 - Journal URLs:
- http://www.sciencedirect.com/science/journal/0003682X ↗
http://www.elsevier.com/journals ↗
http://www.elsevier.com/homepage/elecserv.htt ↗ - DOI:
- 10.1016/j.apacoust.2022.108637 ↗
- Languages:
- English
- ISSNs:
- 0003-682X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 1571.400000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20832.xml