A semi-supervised learning approach for model selection based on class-hypothesis testing. (30th December 2017)
- Record Type:
- Journal Article
- Title:
- A semi-supervised learning approach for model selection based on class-hypothesis testing. (30th December 2017)
- Main Title:
- A semi-supervised learning approach for model selection based on class-hypothesis testing
- Authors:
- Gorriz, Juan M.
Ramirez, Javier
Suckling, John
Martinez-Murcia, F.J.
Illán, I.A.
Segovia, F.
Ortiz, A.
Salas-González, D.
Castillo-Barnés, D.
Puntonet, C.G. - Abstract:
- Highlights: A model selection method is proposed by hypothesis testing and feature extraction. Partial least squares is applied to obtain the extended datasets. The model selection is performed by means of a Likelihood ratio test. Experiments were carried out on several databases yielding a clear improvement. Abstract: This paper deals with the topic of learning from unlabeled or noisy-labeled data in the context of a classification problem. In the classification problem the outcome yields one of a discrete set of values thus, assumptions on them could be established to obtain the most likely prediction model at the training stage . In this paper, a novel case-based model selection method is proposed, which combines hypothesis testing from a discrete set of expected outcomes and feature extraction within a cross-validated classification stage. This wrapper-type procedure acts on fully-observable variables under hypothesis-testing and improves the classification accuracy on the test set, or keeps its performance at least at the level of the statistical classifier. The model selection strategy in the cross validation loop allows building an ensemble classifier that could improve the performance of any expert and intelligence system, particularly on small sample-size datasets. Experiments were carried out on several databases yielding a clear improvement on the baseline, i.e., SPECT dataset A c c = 86.35 ± 1.51, with S e n = 91.10 ± 2.77, and S p e = 81.11 ± 1.61 . In addition,Highlights: A model selection method is proposed by hypothesis testing and feature extraction. Partial least squares is applied to obtain the extended datasets. The model selection is performed by means of a Likelihood ratio test. Experiments were carried out on several databases yielding a clear improvement. Abstract: This paper deals with the topic of learning from unlabeled or noisy-labeled data in the context of a classification problem. In the classification problem the outcome yields one of a discrete set of values thus, assumptions on them could be established to obtain the most likely prediction model at the training stage . In this paper, a novel case-based model selection method is proposed, which combines hypothesis testing from a discrete set of expected outcomes and feature extraction within a cross-validated classification stage. This wrapper-type procedure acts on fully-observable variables under hypothesis-testing and improves the classification accuracy on the test set, or keeps its performance at least at the level of the statistical classifier. The model selection strategy in the cross validation loop allows building an ensemble classifier that could improve the performance of any expert and intelligence system, particularly on small sample-size datasets. Experiments were carried out on several databases yielding a clear improvement on the baseline, i.e., SPECT dataset A c c = 86.35 ± 1.51, with S e n = 91.10 ± 2.77, and S p e = 81.11 ± 1.61 . In addition, the CV error estimate for the classifier under our approach was found to be an almost unbiased estimate (as the baseline approach) of the true error that the classifier would incur on independent data. … (more)
- Is Part Of:
- Expert systems with applications. Volume 90(2017)
- Journal:
- Expert systems with applications
- Issue:
- Volume 90(2017)
- Issue Display:
- Volume 90, Issue 2017 (2017)
- Year:
- 2017
- Volume:
- 90
- Issue:
- 2017
- Issue Sort Value:
- 2017-0090-2017-0000
- Page Start:
- 40
- Page End:
- 49
- Publication Date:
- 2017-12-30
- Subjects:
- Statistical learning and decision theory -- Semi-supervised learning -- Support vector machines (SVM) -- Hypothesis testing -- Partial least squares
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2017.08.006 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 4633.xml