Query expansion based on statistical learning from code changes. (27th February 2018)
- Record Type:
- Journal Article
- Title:
- Query expansion based on statistical learning from code changes. (27th February 2018)
- Main Title:
- Query expansion based on statistical learning from code changes
- Authors:
- Huang, Qing
Yang, Yangrui
Zhan, Xue
Wan, Hongyan
Wu, Guoqing - Abstract:
- Summary: Thesaurus‐based, code‐related, and software‐specific query expansion techniques are the main contributions in free‐form query search. However, these techniques still could not put the most relevant query result in the first position because they lack the ability to infer the expansion words that represent the user needs based on a given query. In this paper, we discover that code changes can imply what users want and propose a novel query expansion technique with code changes (QECC). It exploits (changes, contexts) pairs from changed methods. On the basis of statistical learning from pairs, it can infer code changes for a given query. In this way, it expands a query with code changes and recommends the query results that meet actual needs perfectly. In addition, we implement InstaRec to perform QECC and evaluate it with 195 039 change commits from GitHub and our code tracker. The results show that QECC can improve the precision of 3 code search algorithms (ie, IR, Portfolio, and VF) by up to 52% to 62% and outperform the state‐of‐the‐art query expansion techniques (ie, query expansion based on crowd knowledge and CodeHow) by 13% to 16% when the top 1 result is inspected.
- Is Part Of:
- Software, practice & experience. Volume 48:Number 7(2018)
- Journal:
- Software, practice & experience
- Issue:
- Volume 48:Number 7(2018)
- Issue Display:
- Volume 48, Issue 7 (2018)
- Year:
- 2018
- Volume:
- 48
- Issue:
- 7
- Issue Sort Value:
- 2018-0048-0007-0000
- Page Start:
- 1333
- Page End:
- 1351
- Publication Date:
- 2018-02-27
- Subjects:
- code changes -- code search -- information retrieval -- software reuse -- statistical learning -- query expansion
Computer software -- Periodicals
Computer programming -- Periodicals
Computer programs -- Periodicals
005.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/spe.2574 ↗
- Languages:
- English
- ISSNs:
- 0038-0644
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8321.453000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 6875.xml