Symbolic sequence representation with Markovian state optimization. (November 2022)
- Record Type:
- Journal Article
- Title:
- Symbolic sequence representation with Markovian state optimization. (November 2022)
- Main Title:
- Symbolic sequence representation with Markovian state optimization
- Authors:
- Chen, Lifei
Wu, Haiyan
Kang, Wenxuan
Wang, Shengrui - Abstract:
- Highlights: The first effort on the HMM state optimization problem, i.e., optimizing the number of states and the discriminative quality of the states itself. New representation model for symbolic sequences using their transition probability distributions over the optimized HMM states called topics. Formalization of the hierarchical model selection problem for topic learning with a novel topic-scatter criterion. Learning the underlying topics by a newly defined HMM state clustering algorithm. Experimental evaluation on human activity recognition and protein recognition with comparisons to the neural network-based auto-encoder. Abstract: Sequence representation, which is aimed at embedding sequentially symbolic data in a real space, is a foundational task in sequence pattern recognition. It is a difficult problem due to the challenges entailed in learning the intrinsic structural features within sequences in small sample size cases, in an unsupervised way. In this paper, we propose to represent each symbolic sequence by its transition probability distribution over discriminating topics, formalized by a set of optimized Hidden Markov Model (HMM) states shared by all sequences. An efficient method, called Markovian state clustering with hierarchical model selection, is proposed to optimize the Markovian states and to adaptively determine the number of topics. The proposed method is experimentally evaluated on human activity recognition and protein recognition, and resultsHighlights: The first effort on the HMM state optimization problem, i.e., optimizing the number of states and the discriminative quality of the states itself. New representation model for symbolic sequences using their transition probability distributions over the optimized HMM states called topics. Formalization of the hierarchical model selection problem for topic learning with a novel topic-scatter criterion. Learning the underlying topics by a newly defined HMM state clustering algorithm. Experimental evaluation on human activity recognition and protein recognition with comparisons to the neural network-based auto-encoder. Abstract: Sequence representation, which is aimed at embedding sequentially symbolic data in a real space, is a foundational task in sequence pattern recognition. It is a difficult problem due to the challenges entailed in learning the intrinsic structural features within sequences in small sample size cases, in an unsupervised way. In this paper, we propose to represent each symbolic sequence by its transition probability distribution over discriminating topics, formalized by a set of optimized Hidden Markov Model (HMM) states shared by all sequences. An efficient method, called Markovian state clustering with hierarchical model selection, is proposed to optimize the Markovian states and to adaptively determine the number of topics. The proposed method is experimentally evaluated on human activity recognition and protein recognition, and results obtained demonstrate its effectiveness and efficiency. … (more)
- Is Part Of:
- Pattern recognition. Volume 131(2022)
- Journal:
- Pattern recognition
- Issue:
- Volume 131(2022)
- Issue Display:
- Volume 131, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 131
- Issue:
- 2022
- Issue Sort Value:
- 2022-0131-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-11
- Subjects:
- Sequence representation -- Hidden Markov model -- State clustering -- Hierarchical model selection -- Activity recognition
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2022.108849 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22669.xml