Two‐stage spoken term detection system for under‐resourced languages. Issue 9 (6th October 2020)

Record Type:: Journal Article
Title:: Two‐stage spoken term detection system for under‐resourced languages. Issue 9 (6th October 2020)
Main Title:: Two‐stage spoken term detection system for under‐resourced languages
Authors:: G, Deekshitha
Mary, Leena
Abstract:: Abstract : Spoken Term Detection (STD) is the process of locating the occurrences of spoken queries in a given speech database. Generally, two methods are adopted for STD: an ASR based sequence matching and ASR‐free, feature‐based template matching. If a well‐performing ASR is available, the former STD method is accurate. However, to build an ASR with consistent performance, several hours of labelled corpora is required. Template matching methods work well for small or chopped utterances. However, in practice, the volume of the search database can be huge, containing sentences of varying lengths. Hence time complexity of template matching techniques will be high, which makes them impractical for realistic search applications. In this work, a two‐stage STD system is proposed, which combines the ASR‐based phoneme sequence matching in the first stage and feature sequence template matching of selected locations in the second stage. The time complexity of the second stage is reduced by performing DTW‐based template matching only at probable query locations identified by the first stage. 'Split and match' approach helps to reduce the false‐positives in case of longer query words. Effectiveness of the proposed method is demonstrated using English and Malayalam datasets.
Is Part Of:: IET signal processing. Volume 14:Issue 9(2020)
Journal:: IET signal processing
Issue:: Volume 14:Issue 9(2020)
Issue Display:: Volume 14, Issue 9 (2020)
Year:: 2020
Volume:: 14
Issue:: 9
Issue Sort Value:: 2020-0014-0009-0000
Page Start:: 602
Page End:: 613
Publication Date:: 2020-10-06
Subjects:: speech recognition -- probability -- feature extraction -- speech processing -- natural language processing -- image matching -- query processing
stage spoken term detection system -- spoken queries -- given speech database -- automatic speech recognition based sequence matching -- ASR‐free -- STD method -- labelled corpora -- template matching methods work -- search database -- time complexity -- template matching techniques -- two‐stage STD -- phoneme label sequence matching -- feature sequence template matching -- available annotated corpora -- sequence matching technique -- erroneous label sequences -- feature level template matching -- probable query locations -- template matching approach -- longer query words -- labelled story database -- STD task -- STD system -- query length -- time 3.5 hour
Signal processing -- Periodicals
621.3822
Journal URLs:: http://digital-library.theiet.org/content/journals/iet-spr ↗
http://ieeexplore.ieee.org/servlet/opac?punumber=4159607 ↗
http://www.ietdl.org/IET-SPR ↗
https://ietresearch.onlinelibrary.wiley.com/journal/17519683 ↗
http://www.theiet.org/ ↗
DOI:: 10.1049/iet-spr.2019.0131 ↗
Languages:: English
ISSNs:: 1751-9675
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 4363.253535
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 16478.xml