A Comprehensive Video Dataset for Multi-Modal Recognition Systems. (8th November 2019)
- Record Type:
- Journal Article
- Title:
- A Comprehensive Video Dataset for Multi-Modal Recognition Systems. (8th November 2019)
- Main Title:
- A Comprehensive Video Dataset for Multi-Modal Recognition Systems
- Authors:
- Handa, Anand
Agarwal, Rashi
Kohli, Narendra - Abstract:
- This paper presents a comprehensive, highly defined and fully labelled video dataset. This dataset consists of videos related to 67 different subjects. The videos contain similar text and the text contains digits from 1 to 20 recited by 67 different subjects using the same experimental setup. This dataset can be used as a unique resource for researchers and analysts for training deep neural networks to build highly efficient and accurate recognition models in various domains of computer vision such as face recognition model, expression recognition model, speech recognition model, text recognition, etc. In this paper, we also train models related to face recognition and speech recognition on our dataset and also compare the results with the publically available datasets to show the effectiveness of our dataset. The experimental results show that our comprehensive dataset is more accurate than other dataset on which the models are tested.
- Is Part Of:
- Data science journal. Volume 18(2019)
- Journal:
- Data science journal
- Issue:
- Volume 18(2019)
- Issue Display:
- Volume 18, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 18
- Issue:
- 2019
- Issue Sort Value:
- 2019-0018-2019-0000
- Page Start:
- Page End:
- Publication Date:
- 2019-11-08
- Subjects:
- Machine leaning -- Deep learning -- video datasets -- Convolutional Neural Network
Science -- Data processing -- Periodicals
Database management -- Periodicals
502.85 - Journal URLs:
- http://datascience.codata.org/ ↗
http://www.codata.org/dsj/index.html ↗ - DOI:
- 10.5334/dsj-2019-055 ↗
- Languages:
- English
- ISSNs:
- 1683-1470
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 14750.xml