From categories to gradience: Auto-coding sociophonetic variation with random forests. Issue 1 (10th June 2020)
- Record Type:
- Journal Article
- Title:
- From categories to gradience: Auto-coding sociophonetic variation with random forests. Issue 1 (10th June 2020)
- Main Title:
- From categories to gradience: Auto-coding sociophonetic variation with random forests
- Authors:
- Villarreal, Dan
Clark, Lynn
Hay, Jennifer
Watson, Kevin - Abstract:
- The time-consuming nature of coding sociophonetic variables that are typically treated as categorical represents an impediment to addressing research questions around these variables that require large volumes of data. In this paper, we apply a machine learning method, random forest classification (Breiman, 2001 ), to automate coding (categorical prediction) of two English sociophonetic variables traditionally treated as categorical, non-prevocalic /r/ and word-medial intervocalic /t/, based on tokens' acoustic signatures. We found good performance for binary classifiers of non-prevocalic /r/ (Absent versus Present) and medial /t/ (Voiced versus Voiceless), but not for medial /t/ with a six-way coding distinction (largely due to some codes being sparsely represented in the training data). This method also yields rankings of acoustic measures in terms of importance in classification. Beyond any individual measures, this method generates probabilistic predictions of variation (classifier probabilities) that represent a composite of the acoustic cues fed into the model. In a listening experiment, we found that not only did classifier probabilities significantly capture gradience in trained listeners' perceptions of rhoticity, they better predicted listeners' perceptions than individual acoustic measures. This method thus represents a new approach to reconciling the categorical and continuous dimensions of sociophonetic variation.
- Is Part Of:
- Laboratory phonology. Volume 11:Issue 1(2020)
- Journal:
- Laboratory phonology
- Issue:
- Volume 11:Issue 1(2020)
- Issue Display:
- Volume 11, Issue 1 (2020)
- Year:
- 2020
- Volume:
- 11
- Issue:
- 1
- Issue Sort Value:
- 2020-0011-0001-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-06-10
- Subjects:
- Sociophonetic variation -- machine learning -- rhoticity -- New Zealand English
Grammar, Comparative and general -- Phonology -- Periodicals
Phonetics -- Periodicals
Psycholinguistics -- Periodicals
Linguistic change -- Periodicals
415.05 - Journal URLs:
- http://www.degruyter.com ↗
http://www.journal-labphon.org/ ↗ - DOI:
- 10.5334/labphon.216 ↗
- Languages:
- English
- ISSNs:
- 1868-6346
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5140.950000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 14951.xml