SMT: Sparse multivariate tree. (30th December 2013)
- Record Type:
- Journal Article
- Title:
- SMT: Sparse multivariate tree. (30th December 2013)
- Main Title:
- SMT: Sparse multivariate tree
- Authors:
- Deng, Houtao
Baydogan, Mustafa Gokce
Runger, George - Abstract:
- <abstract abstract-type="main" xml:lang="en"> <title>Abstract</title> <p>A multivariate decision tree attempts to improve upon the single variable split in a traditional tree. With the increase in datasets with many features and a small number of labeled instances in a variety of domains (bioinformatics, text mining, etc.), a traditional tree‐based approach with a greedy variable selection at a node may omit important information. Therefore, the recursive partitioning idea of a simple decision tree combined with the intrinsic feature selection of <italic>L</italic><sub>1</sub> regularized logistic regression (LR) at each node is a natural choice for a multivariate tree model that is simple, but broadly applicable. This natural solution leads to the sparse multivariate tree (SMT) considered here. SMT can naturally handle non‐time‐series data and is extended to handle time‐series classification problems with the power of extracting interpretable temporal patterns (e.g., means, slopes, and deviations). Binary <italic>L</italic><sub>1</sub> regularized LR models are used here for binary classification problems. However, SMT may be extended to solve multiclass problems with multinomial LR models. The accuracy and computational efficiency of SMT is compared to a large number of competitors on time series and non‐time‐series data. © 2013 Wiley Periodicals, Inc. Statistical Analysis and Data Mining, 2013</p> </abstract>
- Is Part Of:
- Statistical analysis and data mining. Volume 7:Number 1(2014)
- Journal:
- Statistical analysis and data mining
- Issue:
- Volume 7:Number 1(2014)
- Issue Display:
- Volume 7, Issue 1 (2014)
- Year:
- 2014
- Volume:
- 7
- Issue:
- 1
- Issue Sort Value:
- 2014-0007-0001-0000
- Page Start:
- 53
- Page End:
- 69
- Publication Date:
- 2013-12-30
- Subjects:
- Data mining -- Statistical methods -- Periodicals
006.312 - Journal URLs:
- http://www3.interscience.wiley.com/journal/112701062/home ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/sam.11208 ↗
- Languages:
- English
- ISSNs:
- 1932-1864
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8447.424100
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 3305.xml