Robust active learning for the diagnosis of parasites. Issue 11 (November 2015)
- Record Type:
- Journal Article
- Title:
- Robust active learning for the diagnosis of parasites. Issue 11 (November 2015)
- Main Title:
- Robust active learning for the diagnosis of parasites
- Authors:
- Saito, Priscila T.M.
Suzuki, Celso T.N.
Gomes, Jancarlo F.
de Rezende, Pedro J.
Falcão, Alexandre X. - Abstract:
- Abstract: We have developed an automated system for the diagnosis of intestinal parasites from optical microscopy images. The objects (species of parasites and impurities) segmented from these images form a large dataset. We are interested in the active learning problem of selecting a reasonably small number of objects to be labeled under an expert׳s supervision for use in training a pattern classifier. However, impurities are very numerous, constitute several clusters in the feature space, and can be quite similar to some species of parasites, leading to a significant challenge for active learning methods. We propose a technique that pre-organizes the data and then properly balances the selection of samples from all classes and uncertain samples for training. Early data organization avoids reprocessing of the large dataset at each learning iteration, enabling the halting of sample selection after a desired number of samples per iteration, yielding interactive response time. We validate our method by comparing it with state-of-the-art approaches, using a previously labeled dataset of almost 6000 objects. Moreover, we report results from experiments on a very realistic scenario, consisting of a dataset with over 140, 000 unlabeled objects, under unbalanced classes, the absence of some classes, and the presence of a very large set of impurities. Abstract : Highlights: A robust active learning method, called RDS, based on a priori data organization. RDS properly balances sampleAbstract: We have developed an automated system for the diagnosis of intestinal parasites from optical microscopy images. The objects (species of parasites and impurities) segmented from these images form a large dataset. We are interested in the active learning problem of selecting a reasonably small number of objects to be labeled under an expert׳s supervision for use in training a pattern classifier. However, impurities are very numerous, constitute several clusters in the feature space, and can be quite similar to some species of parasites, leading to a significant challenge for active learning methods. We propose a technique that pre-organizes the data and then properly balances the selection of samples from all classes and uncertain samples for training. Early data organization avoids reprocessing of the large dataset at each learning iteration, enabling the halting of sample selection after a desired number of samples per iteration, yielding interactive response time. We validate our method by comparing it with state-of-the-art approaches, using a previously labeled dataset of almost 6000 objects. Moreover, we report results from experiments on a very realistic scenario, consisting of a dataset with over 140, 000 unlabeled objects, under unbalanced classes, the absence of some classes, and the presence of a very large set of impurities. Abstract : Highlights: A robust active learning method, called RDS, based on a priori data organization. RDS properly balances sample diversity and uncertainty for useful sample selection. It provides high classification accuracy for the automated diagnosis of parasites. Comparisons with different clustering, classification and other literature methods. RDS was evaluated by an experienced expert in parasitology using a realistic scenario. … (more)
- Is Part Of:
- Pattern recognition. Volume 48:Issue 11(2015:Nov.)
- Journal:
- Pattern recognition
- Issue:
- Volume 48:Issue 11(2015:Nov.)
- Issue Display:
- Volume 48, Issue 11 (2015)
- Year:
- 2015
- Volume:
- 48
- Issue:
- 11
- Issue Sort Value:
- 2015-0048-0011-0000
- Page Start:
- 3572
- Page End:
- 3583
- Publication Date:
- 2015-11
- Subjects:
- Active learning -- Pattern recognition -- Automated diagnosis of intestinal parasites -- Microscopy image analysis -- Optimum-path forest classifiers
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2015.05.020 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20959.xml