Rough-fuzzy based scene categorization for text detection and recognition in video. (August 2018)
- Record Type:
- Journal Article
- Title:
- Rough-fuzzy based scene categorization for text detection and recognition in video. (August 2018)
- Main Title:
- Rough-fuzzy based scene categorization for text detection and recognition in video
- Authors:
- Roy, Sangheeta
Shivakumara, Palaiahnakote
Jain, Namita
Khare, Vijeta
Dutta, Anjan
Pal, Umapada
Lu, Tong - Abstract:
- Highlights: A new combination of rough set and fuzzy concept to study shapes. New stroke based features semantic features are extracted for classification. A new method for categorizing videos into different classes. Abstract: Scene image or video understanding is a challenging task especially when number of video types increases drastically with high variations in background and foreground. This paper proposes a new method for categorizing scene videos into different classes, namely, Animation, Outlet, Sports, e-Learning, Medical, Weather, Defense, Economics, Animal Planet and Technology, for the performance improvement of text detection and recognition, which is an effective approach for scene image or video understanding. For this purpose, at first, we present a new combination of rough and fuzzy concept to study irregular shapes of edge components in input scene videos, which helps to classify edge components into several groups. Next, the proposed method explores gradient direction information of each pixel in each edge component group to extract stroke based features by dividing each group into several intra and inter planes. We further extract correlation and covariance features to encode semantic features located inside planes or between planes. Features of intra and inter planes of groups are then concatenated to get a feature matrix. Finally, the feature matrix is verified with temporal frames and fed to a neural network for categorization. Experimental resultsHighlights: A new combination of rough set and fuzzy concept to study shapes. New stroke based features semantic features are extracted for classification. A new method for categorizing videos into different classes. Abstract: Scene image or video understanding is a challenging task especially when number of video types increases drastically with high variations in background and foreground. This paper proposes a new method for categorizing scene videos into different classes, namely, Animation, Outlet, Sports, e-Learning, Medical, Weather, Defense, Economics, Animal Planet and Technology, for the performance improvement of text detection and recognition, which is an effective approach for scene image or video understanding. For this purpose, at first, we present a new combination of rough and fuzzy concept to study irregular shapes of edge components in input scene videos, which helps to classify edge components into several groups. Next, the proposed method explores gradient direction information of each pixel in each edge component group to extract stroke based features by dividing each group into several intra and inter planes. We further extract correlation and covariance features to encode semantic features located inside planes or between planes. Features of intra and inter planes of groups are then concatenated to get a feature matrix. Finally, the feature matrix is verified with temporal frames and fed to a neural network for categorization. Experimental results show that the proposed method outperforms the existing state-of-the-art methods, at the same time, the performances of text detection and recognition methods are also improved significantly due to categorization. … (more)
- Is Part Of:
- Pattern recognition. Volume 80(2018:Aug.)
- Journal:
- Pattern recognition
- Issue:
- Volume 80(2018:Aug.)
- Issue Display:
- Volume 80 (2018)
- Year:
- 2018
- Volume:
- 80
- Issue Sort Value:
- 2018-0080-0000-0000
- Page Start:
- 64
- Page End:
- 82
- Publication Date:
- 2018-08
- Subjects:
- Rough set -- Fuzzy set -- Video categorization -- Scene image classification -- Video text detection -- Video text recognition
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2018.02.014 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 6403.xml