Phoneme sequence recognition via DTW-based classification. Issue 2 (August 2016)
- Record Type:
- Journal Article
- Title:
- Phoneme sequence recognition via DTW-based classification. Issue 2 (August 2016)
- Main Title:
- Phoneme sequence recognition via DTW-based classification
- Authors:
- Hamooni, Hossein
Mueen, Abdullah
Neel, Amy - Abstract:
- Abstract Phonemes are the smallest units of sound produced by a human being. Automatic classification of phonemes is a well-researched topic in linguistics due to its potential for robust speech recognition. With the recent advancement of phonetic segmentation algorithms, it is now possible to generate datasets of millions of phonemes automatically. Phoneme classification on such datasets is a challenging data mining task because of the large number of classes (over a hundred) and complexities of the existing methods. In this paper, we introduce the phoneme classification problem as a data mining task. We propose a dual-domain (time and frequency) hierarchical classification algorithm. Our method uses a dynamic time warping (DTW)-based classifier in the top layers and time–frequency features in the lower layer. We cross-validate our method on phonemes from three online dictionaries and achieved up to 35 % improvement in classification compared with existing techniques. We further modify our vowel classifier by adopting DTW distance over time–frequency coefficients and gain an additional 3 % improvement. We provide case studies on classifying accented phonemes and speaker-invariant phoneme classification. Finally, we show a demonstration of how phoneme classification can be used to recognize speech.
- Is Part Of:
- Knowledge and information systems. Volume 48:Issue 2(2016:Aug.)
- Journal:
- Knowledge and information systems
- Issue:
- Volume 48:Issue 2(2016:Aug.)
- Issue Display:
- Volume 48, Issue 2 (2016)
- Year:
- 2016
- Volume:
- 48
- Issue:
- 2
- Issue Sort Value:
- 2016-0048-0002-0000
- Page Start:
- 253
- Page End:
- 275
- Publication Date:
- 2016-08
- Subjects:
- Phoneme classification -- DTW-based classification -- Phonetic time series -- Big data -- Sequence recognition
Expert systems (Computer science) -- Periodicals
Information storage and retrieval systems -- Periodicals
006.33 - Journal URLs:
- http://link.springer-ny.com/link/service/journals/10115/index.htm ↗
http://www.springerlink.com/content/0219-1377 ↗
http://www.springer.com/gb/ ↗ - DOI:
- 10.1007/s10115-015-0885-9 ↗
- Languages:
- English
- ISSNs:
- 0219-1377
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5100.437300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9906.xml