Factors influencing automatic segmental alignment of sociophonetic corpora. Issue 3 (November 2016)
- Record Type:
- Journal Article
- Title:
- Factors influencing automatic segmental alignment of sociophonetic corpora. Issue 3 (November 2016)
- Main Title:
- Factors influencing automatic segmental alignment of sociophonetic corpora
- Authors:
- Fromont, Robert
Watson, Kevin - Abstract:
- Abstract : Automatically time-aligning utterances at the segmental level is increasingly common practice in phonetic and sociophonetic work because of the obvious benefits it brings in allowing the efficient scaling up of the amount of speech data that can be analysed. The field is arriving at a set of recommended practices for improving alignment accuracy, but methodological differences across studies (e.g., the use of different languages and different measures of accuracy) often mean that direct comparison of the factors which facilitate or hinder alignment can be difficult. In this paper, following a review of the state of the art in automatic segmental alignment, we test the effects of a number of factors on its accuracy. Namely, we test the effects of: ( 1 ) the presence or absence of pause markers in the training data, ( 2 ) the presence of overlapping speech or other noise, ( 3 ) using training data from single or multiple speakers, ( 4 ) using different sampling rates, ( 5 ) using pre-trained acoustic models versus models trained 'from scratch', and ( 6 ) using different amounts of training data. For each test, we examine three different varieties of English, from New Zealand, the USA and the UK. The paper concludes with some recommendations for automatic segmental alignment in general.
- Is Part Of:
- Corpora. Volume 11:Issue 3(2016)
- Journal:
- Corpora
- Issue:
- Volume 11:Issue 3(2016)
- Issue Display:
- Volume 11, Issue 3 (2016)
- Year:
- 2016
- Volume:
- 11
- Issue:
- 3
- Issue Sort Value:
- 2016-0011-0003-0000
- Page Start:
- 401
- Page End:
- 431
- Publication Date:
- 2016-11
- Subjects:
- Alignment -- American English -- Liverpool English -- New Zealand English -- sociophonetics
Corpora (Linguistics) -- Periodicals
410.188 - Journal URLs:
- http://www.euppublishing.com/journal/cor ↗
http://www.euppublishing.com/journals ↗ - DOI:
- 10.3366/cor.2016.0101 ↗
- Languages:
- English
- ISSNs:
- 1749-5032
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5044.xml