A novel community answer matching approach based on phrase fusion heterogeneous information network. Issue 1 (January 2021)
- Record Type:
- Journal Article
- Title:
- A novel community answer matching approach based on phrase fusion heterogeneous information network. Issue 1 (January 2021)
- Main Title:
- A novel community answer matching approach based on phrase fusion heterogeneous information network
- Authors:
- Wu, Yongliang
Zhao, Shuliang
Guo, Ruiqiang - Abstract:
- Highlights: To the best of our knowledge, it is the first work to propose the phrase information network and employ it to construct a fusion heterogeneous information network (HIN) to represent complex entity relationships in community question answering (CQA). We define the distance of entities with the same or different types in HIN and propose a novel Type-constrained Top-k similarity entity finding algorithm (TTSEF) for answer selection, which innovatively combines entity attributes and semantic features to achieve answer selection in CQA. Abundant experimental demonstrate that proposed algorithm precedes the state-of-the-art similar entity matching methods in CQA. A meta-path analysis of the optimal matching answers proves that phrase can serve as a bridge to connect different types of entities in CQA effectively. Abstract: Community Question Answering (CQA) allows users to ask or answer questions in a social way, so it is becoming the primary means for people acquiring knowledge. However, the asker must wait until a satisfactory answer appears, which reduces user activity. In this paper, we propose an innovative answering method that matches the most relevant answers for the new issue automatically. Firstly, we utilize phrases to represent the semantic of the posts (answers/questions) and construct a Phrase Fusion Heterogeneous Information Network, called PFHIN, to represent complex entity relationships in CQA. So, the answer selection is regarded as the related entityHighlights: To the best of our knowledge, it is the first work to propose the phrase information network and employ it to construct a fusion heterogeneous information network (HIN) to represent complex entity relationships in community question answering (CQA). We define the distance of entities with the same or different types in HIN and propose a novel Type-constrained Top-k similarity entity finding algorithm (TTSEF) for answer selection, which innovatively combines entity attributes and semantic features to achieve answer selection in CQA. Abundant experimental demonstrate that proposed algorithm precedes the state-of-the-art similar entity matching methods in CQA. A meta-path analysis of the optimal matching answers proves that phrase can serve as a bridge to connect different types of entities in CQA effectively. Abstract: Community Question Answering (CQA) allows users to ask or answer questions in a social way, so it is becoming the primary means for people acquiring knowledge. However, the asker must wait until a satisfactory answer appears, which reduces user activity. In this paper, we propose an innovative answering method that matches the most relevant answers for the new issue automatically. Firstly, we utilize phrases to represent the semantic of the posts (answers/questions) and construct a Phrase Fusion Heterogeneous Information Network, called PFHIN, to represent complex entity relationships in CQA. So, the answer selection is regarded as the related entity retrieval task. Then, we define the distance between entities in PFHIN, which is independent of the meta path. Finally, the Type-constrained Top-k Similarity Entity Finding Algorithm (TTSEF) is proposed for finding the nearest entities according to the known start entity and end-entity type, which can match the most relevant answers automatically.To the best of our knowledge, it is the first work to define the phrase information network for answer selection and provide a novel idea for the heterogeneous information network fusion. Experimental results on three large-scale datasets (Stack Overflow, Super User, and Mathematics) from Stack Exchange demonstrate that our proposed approaches significantly outperform the state-of-the-art answer retrieval methods. Moreover, we conduct an in-depth analysis of the meta path to the optimal answer and reveal the critical role of phrases in community answer matching. … (more)
- Is Part Of:
- Information processing & management. Volume 58:Issue 1(2021)
- Journal:
- Information processing & management
- Issue:
- Volume 58:Issue 1(2021)
- Issue Display:
- Volume 58, Issue 1 (2021)
- Year:
- 2021
- Volume:
- 58
- Issue:
- 1
- Issue Sort Value:
- 2021-0058-0001-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-01
- Subjects:
- Community question answering -- Heterogeneous information network fusion -- Phrase embedding -- Related entity matching
Information storage and retrieval systems -- Periodicals
Information science -- Periodicals
Systèmes d'information -- Périodiques
Sciences de l'information -- Périodiques
Information science
Information storage and retrieval systems
Periodicals
658.4038 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064573 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.ipm.2020.102408 ↗
- Languages:
- English
- ISSNs:
- 0306-4573
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4493.893000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 14930.xml