Method to integrate speaker identification, speech recognition, and information retrieval algorithms for speaker-based information retrieval. Issue 3 (27th September 2022)
- Record Type:
- Journal Article
- Title:
- Method to integrate speaker identification, speech recognition, and information retrieval algorithms for speaker-based information retrieval. Issue 3 (27th September 2022)
- Main Title:
- Method to integrate speaker identification, speech recognition, and information retrieval algorithms for speaker-based information retrieval
- Authors:
- Muneeb, Muhammad
- Abstract:
- This article proposes speakers' voice-based information (audio and video) retrieval systems, which combines speaker identification, speech recognition, and information retrieval algorithms. Information retrieval systems encompass system structure and a way to query the system for information retrieval. This article illustrates both, including how it is deployed on top of existing systems. The input to the system is a speaker voice sample and a text query. Based on the speaker's voice, the size of the corpus is reduced, and based on the text query, documents are retrieved and ranked. For the speaker identification, we used the LPC coefficient, for voice recognition, we used a Python speech recognition library, and for ranking, we used cosine similarity and TF-IDF. Other algorithms can replace any intermediate modules depending on the system, like crime investigation, news analysis, and lecture retrieval. We demonstrated the proposed method on simulated data generated from online websites.
- Is Part Of:
- International journal of knowledge engineering and data mining. Volume 7:Issue 3/4(2022)
- Journal:
- International journal of knowledge engineering and data mining
- Issue:
- Volume 7:Issue 3/4(2022)
- Issue Display:
- Volume 7, Issue 3/4 (2022)
- Year:
- 2022
- Volume:
- 7
- Issue:
- 3/4
- Issue Sort Value:
- 2022-0007-NaN-0000
- Page Start:
- 234
- Page End:
- 251
- Publication Date:
- 2022-09-27
- Subjects:
- audio retrieval -- information retrieval -- speaker identification -- TF-IDF -- voice recognition
Knowledge representation (Information theory) -- Periodicals
Data mining -- Periodicals
006.305 - Journal URLs:
- http://www.inderscience.com/browse/index.php?journalCODE=ijkedm ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1755-2087
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23465.xml