Language models, surprisal and fantasy in Slavic intercomprehension. (January 2019)
- Record Type:
- Journal Article
- Title:
- Language models, surprisal and fantasy in Slavic intercomprehension. (January 2019)
- Main Title:
- Language models, surprisal and fantasy in Slavic intercomprehension
- Authors:
- Jágrová, Klára
Avgustinova, Tania
Stenger, Irina
Fischer, Andrea - Abstract:
- Abstract: In monolingual human language processing, the predictability of a word given its surrounding sentential context is crucial. With regard to receptive multilingualism, it is unclear to what extent predictability in context interplays with other linguistic factors in understanding a related but unknown language – a process called intercomprehension. We distinguish two dimensions influencing processing effort during intercomprehension: surprisal in sentential context and linguistic distance. Based on this hypothesis, we formulate expectations regarding the difficulty of designed experimental stimuli and compare them to the results from think-aloud protocols of experiments in which Czech native speakers decode Polish sentences by agreeing on an appropriate translation. On the one hand, orthographic and lexical distances are reliable predictors of linguistic similarity. On the other hand, we obtain the predictability of words in a sentence with the help of trigram language models. We find that linguistic distance (encoding similarity) and in-context surprisal (predictability in context) appear to be complementary, with neither factor outweighing the other, and that our distinguishing of these two measurable dimensions is helpful in understanding certain unexpected effects in human behaviour.
- Is Part Of:
- Computer speech & language. Volume 53(2019)
- Journal:
- Computer speech & language
- Issue:
- Volume 53(2019)
- Issue Display:
- Volume 53, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 53
- Issue:
- 2019
- Issue Sort Value:
- 2019-0053-2019-0000
- Page Start:
- 242
- Page End:
- 275
- Publication Date:
- 2019-01
- Subjects:
- Statistical language modelling -- Surprisal -- Receptive multilingualism -- Slavic languages -- Sentential context -- Think-aloud protocols -- Polish -- Czech -- Reading
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2018.04.005 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7651.xml