A hybrid tuple selection pipeline for smartphone based Human Activity Recognition. (1st May 2023)
- Record Type:
- Journal Article
- Title:
- A hybrid tuple selection pipeline for smartphone based Human Activity Recognition. (1st May 2023)
- Main Title:
- A hybrid tuple selection pipeline for smartphone based Human Activity Recognition
- Authors:
- Panja, Ayan Kumar
Rayala, Adityar
Agarwala, Abhay
Neogy, Sarmistha
Chowdhury, Chandreyee - Abstract:
- Abstract: With the increasing use of sensors in almost every device and application worldwide, the volume of data increases exponentially. Instance selection is an important data preprocessing step that can be applied to many machine learning models. It is even crucial for executing machine learning tasks in constrained environments as demanded by many Internet of Things (IoT) enabled applications today such as the Human Activity Recognition (HAR) domain of applications. In this paper, we have proposed a hybrid selection and training pipeline methodology that combines the nearest neighbor concept with evolutionary computing to address the instance selection problem of smartphone sensing-based HAR. A clustering algorithm has been proposed first followed by a Genetic Algorithm based instance selection approach. It is so far the first of its kind in the aforementioned domain of HAR based on smartphone sensing data. The proposed instance selection has been tested on two of the popular benchmark HAR datasets — UCI-HAR and WISDM. The experimented outputs report that the proposed method has effectively reduced the dataset size by around 40% for the benchmark datasets while retaining the recognition accuracy above 94%. This is a clear depiction of the removal of the outliers from the instance set. The proposed approach has also been compared with other state-of-the-art under-sampling approaches and the results show that our approach has performed better. Highlights: Tuple selectionAbstract: With the increasing use of sensors in almost every device and application worldwide, the volume of data increases exponentially. Instance selection is an important data preprocessing step that can be applied to many machine learning models. It is even crucial for executing machine learning tasks in constrained environments as demanded by many Internet of Things (IoT) enabled applications today such as the Human Activity Recognition (HAR) domain of applications. In this paper, we have proposed a hybrid selection and training pipeline methodology that combines the nearest neighbor concept with evolutionary computing to address the instance selection problem of smartphone sensing-based HAR. A clustering algorithm has been proposed first followed by a Genetic Algorithm based instance selection approach. It is so far the first of its kind in the aforementioned domain of HAR based on smartphone sensing data. The proposed instance selection has been tested on two of the popular benchmark HAR datasets — UCI-HAR and WISDM. The experimented outputs report that the proposed method has effectively reduced the dataset size by around 40% for the benchmark datasets while retaining the recognition accuracy above 94%. This is a clear depiction of the removal of the outliers from the instance set. The proposed approach has also been compared with other state-of-the-art under-sampling approaches and the results show that our approach has performed better. Highlights: Tuple selection pipeline for Human Activity Recognition in constrained environments. Combination of unsupervised approach with Genetic Algorithm for Instance selection. 40% reduction of instances while retaining accuracy above 94%. Proposed pipeline performs better than state-of-the art undersampling approaches. Hardness property of instances is found to decrease. … (more)
- Is Part Of:
- Expert systems with applications. Volume 217(2023)
- Journal:
- Expert systems with applications
- Issue:
- Volume 217(2023)
- Issue Display:
- Volume 217, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 217
- Issue:
- 2023
- Issue Sort Value:
- 2023-0217-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-05-01
- Subjects:
- Instance selection -- Data reduction -- Evolutionary Computing -- Genetic Algorithm -- Distance based approach -- K-D Tree -- Undersampling
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2023.119536 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25731.xml