Clustering first-order co-occurrences as a way to explore semantic heterogeneity. (21st February 2015)
- Record Type:
- Journal Article
- Title:
- Clustering first-order co-occurrences as a way to explore semantic heterogeneity. (21st February 2015)
- Main Title:
- Clustering first-order co-occurrences as a way to explore semantic heterogeneity
- Authors:
- Bertels, Ann
Speelman, Dirk - Abstract:
- Abstract : This paper addresses the contribution of quantitative analysis and statistical techniques to qualitative semantic analysis, as it discusses the methodological issues for clustering and plotting the most significant first-order co-occurrences of a word as a way to explore its degree of semantic heterogeneity in a technical corpus. Since distributional (dis)similarity reflects semantic (dis)similarity, first-order co-occurrences are clustered with respect to the second and/or third-order co-occurrences they have in common. In this comparative and exploratory study, several experiments are carried out in order to evaluate the impact of various parameters for clustering and in order to find the most reliable configuration of parameters, including association measures, distance measures and lower and upper thresholds. Multidimensional scaling techniques and the visual exploration of semantic proximity between first-order co-occurrences of a node allow us to gain insight into the phenomena of semantic homogeneity and heterogeneity in a technical corpus. As a consequence, we can come to a better understanding of the semantic characteristics of specialized language. However, the methodology for understanding this area is still being implemented and worked out. With the experiments described in this paper, we are contributing to the ongoing methodological analysis of measures and parameters to be used in the field of distributional semantics.
- Is Part Of:
- Journal of research design and statistics in linguistics and communication science. Volume 1:Number 2(2014)
- Journal:
- Journal of research design and statistics in linguistics and communication science
- Issue:
- Volume 1:Number 2(2014)
- Issue Display:
- Volume 1, Issue 2 (2014)
- Year:
- 2014
- Volume:
- 1
- Issue:
- 2
- Issue Sort Value:
- 2014-0001-0002-0000
- Page Start:
- 123
- Page End:
- 146
- Publication Date:
- 2015-02-21
- Subjects:
- co-occurrence analysis -- association measures -- distance measures -- multidimensional scaling (mds) -- quantitative semantics
Linguistics -- Research -- Periodicals
Linguistics -- Statistical methods -- Periodicals
Mathematical linguistics -- Periodicals
410.72 - Journal URLs:
- http://www.equinoxpub.com/journals/index.php/JRDS/index ↗
- DOI:
- 10.1558/jrds.v1i2.22182 ↗
- Languages:
- English
- ISSNs:
- 2052-417X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 7802.xml