An analysis of rotation matrix and colour constancy data augmentation in classifying images of animals. Issue 4 (2nd October 2018)
- Record Type:
- Journal Article
- Title:
- An analysis of rotation matrix and colour constancy data augmentation in classifying images of animals. Issue 4 (2nd October 2018)
- Main Title:
- An analysis of rotation matrix and colour constancy data augmentation in classifying images of animals
- Authors:
- Okafor, Emmanuel
Schomaker, Lambert
Wiering, Marco A. - Abstract:
- ABSTRACT: In this paper, we examine a novel data augmentation (DA) method that transforms an image into a new image containing multiple rotated copies of the original image. The DA method creates a grid of cells, in which each cell contains a different randomly rotated image and introduces a natural background in the newly created image. We investigate the use of deep learning to assess the classification performance on the rotation matrix or original dataset with colour constancy versions of the datasets. For the colour constancy methods, we use two well-known retinex techniques: the multi-scale retinex and the multi-scale retinex with colour restoration for enhancing both original (ORIG) and rotation matrix (ROT) images. We perform experiments on three datasets containing images of animals, from which the first dataset is collected by us and contains aerial images of cows or non-cow backgrounds. To classify the Aerial UAV images, we use a convolutional neural network (CNN) architecture and compare two loss functions (hinge loss and cross-entropy loss). Additionally, we compare the CNN to classical feature-based techniques combined with a k -nearest neighbour classifier or a support vector machine. The best approach is then used to examine the colour constancy DA variants, ORIG and ROT-DA alone for three datasets (Aerial UAV, Bird-600 and Croatia fish). The results show that the rotation matrix data augmentation is very helpful for the Aerial UAV dataset. Furthermore, theABSTRACT: In this paper, we examine a novel data augmentation (DA) method that transforms an image into a new image containing multiple rotated copies of the original image. The DA method creates a grid of cells, in which each cell contains a different randomly rotated image and introduces a natural background in the newly created image. We investigate the use of deep learning to assess the classification performance on the rotation matrix or original dataset with colour constancy versions of the datasets. For the colour constancy methods, we use two well-known retinex techniques: the multi-scale retinex and the multi-scale retinex with colour restoration for enhancing both original (ORIG) and rotation matrix (ROT) images. We perform experiments on three datasets containing images of animals, from which the first dataset is collected by us and contains aerial images of cows or non-cow backgrounds. To classify the Aerial UAV images, we use a convolutional neural network (CNN) architecture and compare two loss functions (hinge loss and cross-entropy loss). Additionally, we compare the CNN to classical feature-based techniques combined with a k -nearest neighbour classifier or a support vector machine. The best approach is then used to examine the colour constancy DA variants, ORIG and ROT-DA alone for three datasets (Aerial UAV, Bird-600 and Croatia fish). The results show that the rotation matrix data augmentation is very helpful for the Aerial UAV dataset. Furthermore, the colour constancy data augmentation is helpful for the Bird-600 dataset. Finally, the results show that the fine-tuned CNNs significantly outperform the CNNs trained from scratch on the Croatia fish and the Bird-600 datasets, and obtain very high accuracies on the Aerial UAV and Bird-600 datasets. … (more)
- Is Part Of:
- Journal of information and telecommunication. Volume 2:Issue 4(2018)
- Journal:
- Journal of information and telecommunication
- Issue:
- Volume 2:Issue 4(2018)
- Issue Display:
- Volume 2, Issue 4 (2018)
- Year:
- 2018
- Volume:
- 2
- Issue:
- 4
- Issue Sort Value:
- 2018-0002-0004-0000
- Page Start:
- 465
- Page End:
- 491
- Publication Date:
- 2018-10-02
- Subjects:
- Image recognition -- data augmentation -- colour constancy -- convolutional neural networks -- feature descriptors
Telecommunication -- Periodicals
Information technology -- Periodicals
621.382 - Journal URLs:
- https://www.tandfonline.com/toc/tjit20/current ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/24751839.2018.1479932 ↗
- Languages:
- English
- ISSNs:
- 2475-1839
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8471.xml