Acoustic and articulatory analysis and synthesis of shouted vowels. (March 2021)
- Record Type:
- Journal Article
- Title:
- Acoustic and articulatory analysis and synthesis of shouted vowels. (March 2021)
- Main Title:
- Acoustic and articulatory analysis and synthesis of shouted vowels
- Authors:
- Xue, Yawen
Marxen, Michael
Akagi, Masato
Birkholz, Peter - Abstract:
- Highlights: Acoustic and articulatory differences between spoken and shouted vowels were analyzed for two male and two female subjects by means of acoustic recordings and midsagittal magnetic resonance images of the vocal tract. Fundamental frequencies, intensities, and formant frequencies were all generally higher for shouted than for spoken vowels. The harmonics-to-noise ratios and H1-H2 measures were generally lower for shouted vowels than for spoken vowels. All subjects used an increased lip opening, an increased jaw opening, and a lower tongue position for shouted vowels. However, the changes of vertical larynx position, uvula elevation, and jaw protrusion between spoken and shouted vowels were inconsistent among subjects. Perception experiments were conducted to examine how changes of fundamental frequency, subglottal pressure, vocal tract shape, and phonation type contribute to the perception of stimuli created by articulatory synthesis as being shouted. Here, fundamental frequency had the greatest effect, followed by vocal tract shape and lung pressure, with no measurable effect of phonation type. Abstract: Acoustic and articulatory differences between spoken and shouted vowels were analyzed for two male and two female subjects by means of acoustic recordings and midsagittal magnetic resonance images of the vocal tract. In accordance with previous acoustic findings, the fundamental frequencies, intensities, and formant frequencies were all generally higher forHighlights: Acoustic and articulatory differences between spoken and shouted vowels were analyzed for two male and two female subjects by means of acoustic recordings and midsagittal magnetic resonance images of the vocal tract. Fundamental frequencies, intensities, and formant frequencies were all generally higher for shouted than for spoken vowels. The harmonics-to-noise ratios and H1-H2 measures were generally lower for shouted vowels than for spoken vowels. All subjects used an increased lip opening, an increased jaw opening, and a lower tongue position for shouted vowels. However, the changes of vertical larynx position, uvula elevation, and jaw protrusion between spoken and shouted vowels were inconsistent among subjects. Perception experiments were conducted to examine how changes of fundamental frequency, subglottal pressure, vocal tract shape, and phonation type contribute to the perception of stimuli created by articulatory synthesis as being shouted. Here, fundamental frequency had the greatest effect, followed by vocal tract shape and lung pressure, with no measurable effect of phonation type. Abstract: Acoustic and articulatory differences between spoken and shouted vowels were analyzed for two male and two female subjects by means of acoustic recordings and midsagittal magnetic resonance images of the vocal tract. In accordance with previous acoustic findings, the fundamental frequencies, intensities, and formant frequencies were all generally higher for shouted than for spoken vowels. The harmonics-to-noise ratios and H1-H2 measures were generally lower for shouted vowels than for spoken vowels. With regard to articulation, all subjects used an increased lip opening, an increased jaw opening, and a lower tongue position for shouted vowels. However, the changes of vertical larynx position, uvula elevation, and jaw protrusion between spoken and shouted vowels were inconsistent among subjects. Based on the analysis results, a perception experiment was conducted to examine how changes of fundamental frequency, subglottal pressure, vocal tract shape, and phonation type contribute to the perception of stimuli created by articulatory synthesis as being shouted. Here, fundamental frequency had the greatest effect, followed by vocal tract shape and lung pressure, with no measurable effect of phonation type. … (more)
- Is Part Of:
- Computer speech & language. Volume 66(2021)
- Journal:
- Computer speech & language
- Issue:
- Volume 66(2021)
- Issue Display:
- Volume 66, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 66
- Issue:
- 2021
- Issue Sort Value:
- 2021-0066-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-03
- Subjects:
- Shouted speech -- Articulatory analysis -- Articulatory synthesis -- Magnetic Resonance Imaging
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2020.101156 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15413.xml