Automated prediction of children's age from voice acoustics. (March 2023)
- Record Type:
- Journal Article
- Title:
- Automated prediction of children's age from voice acoustics. (March 2023)
- Main Title:
- Automated prediction of children's age from voice acoustics
- Authors:
- Novotny, Michal
Cmejla, Roman
Tykalova, Tereza - Abstract:
- Highlights: This study presents a clinically interpretable technology that enables a robust estimate of child speaker age. The Formant and fundamental frequencies measured from vowels of 255 child speakers between 4 and 15 years were used. The MLR model provided a precise age estimate with direct insight into morphological and motor control development. The voice analysis revealed larger RMSE values in girl speakers, likely due to faster vocal tract growth in boys. Abstract: The emergence of a variety of applications aimed at video gaming, parental control, education, specific language impairment, child development assessment, and speech therapy create demands for age-targeted approaches. Yet, there is a lack of methods providing robust and easily interpretable age estimation of speakers from early childhood to post-pubertal stage. This study aims to provide a fully-automated approach for children's age prediction based on voice acoustics. Sustained phonation of vowels /a/, /e/, /i/, /o/, and /u/ recorded from 255 speakers (132 girls and 123 boys) ranging between 4 and 15 years of age were analysed. The first three formant frequencies and fundamental frequency across each vowel were automatically evaluated and used as features for linear and nonlinear regressors to estimate the prediction model. We demonstrate rapid, accurate age estimation with reasonable accuracy of an average 1.3-year difference from actual children's chronological age. The lower age prediction error ofHighlights: This study presents a clinically interpretable technology that enables a robust estimate of child speaker age. The Formant and fundamental frequencies measured from vowels of 255 child speakers between 4 and 15 years were used. The MLR model provided a precise age estimate with direct insight into morphological and motor control development. The voice analysis revealed larger RMSE values in girl speakers, likely due to faster vocal tract growth in boys. Abstract: The emergence of a variety of applications aimed at video gaming, parental control, education, specific language impairment, child development assessment, and speech therapy create demands for age-targeted approaches. Yet, there is a lack of methods providing robust and easily interpretable age estimation of speakers from early childhood to post-pubertal stage. This study aims to provide a fully-automated approach for children's age prediction based on voice acoustics. Sustained phonation of vowels /a/, /e/, /i/, /o/, and /u/ recorded from 255 speakers (132 girls and 123 boys) ranging between 4 and 15 years of age were analysed. The first three formant frequencies and fundamental frequency across each vowel were automatically evaluated and used as features for linear and nonlinear regressors to estimate the prediction model. We demonstrate rapid, accurate age estimation with reasonable accuracy of an average 1.3-year difference from actual children's chronological age. The lower age prediction error of 1.2 years was achieved for boys compared to 1.5 years for girls. The early childhood age from 4 to 5 years was less accurate for prediction. No effect of utterance duration on estimated results was observed. Our results present a robust technology with clinically interpretable outcomes insusceptible to overfitting that enables to predict children's age in a wide range of ages. Better prediction accuracy for boys than girls appears to reflect the faster vocal tract growth for men. The lower prediction accuracy in early childhood can be attributed to rapid nonlinear development and greater variability in the level of motor control maturation. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 81(2023)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 81(2023)
- Issue Display:
- Volume 81, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 81
- Issue:
- 2023
- Issue Sort Value:
- 2023-0081-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-03
- Subjects:
- Automatic age prediction -- Ageing -- Children -- Formant -- Fundamental frequency -- Voice
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2022.104490 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25985.xml