A feature location approach supported by time-aware weighting of terms associated with developer expertise profiles. Issue 2 (November 2016)
- Record Type:
- Journal Article
- Title:
- A feature location approach supported by time-aware weighting of terms associated with developer expertise profiles. Issue 2 (November 2016)
- Main Title:
- A feature location approach supported by time-aware weighting of terms associated with developer expertise profiles
- Authors:
- Zamani, Sima
Lee, Sai
Shokripour, Ramin
Anvik, John - Abstract:
- Abstract Feature location is a frequent software maintenance activity that aims to identify initial source code location pertinent to a software feature. Most of feature location approaches are based, at least in part, on text analysis methods which originate from the natural language context. However, the natural language context and the text data in software repositories have different properties that reveal the need for adaption of the methods to apply in the context of software repositories. One of the differences is the existence of a set of metadata, such as developer information and time stamp, which is associated with the data in the repositories. However, this difference has not been fully considered in previous feature location research studies. This study proposes a feature location approach that analyzes developer expertise profiles, which contain source code entities modified by the associated software developers, to identify the most similar location pertinent to a desired feature. This approach uses a time-aware term-weighting technique to determine the similarity. An experimental evaluation on four open-source projects shows an improvement in the accuracy, performance, and effectiveness up to 55, 39, and 29 %, respectively, compared to the high-performing information retrieval methods used in feature location. Moreover, the proposed time-aware technique increases the accuracy, performance, and effectiveness of the typical term-weighting technique, tf-idf, asAbstract Feature location is a frequent software maintenance activity that aims to identify initial source code location pertinent to a software feature. Most of feature location approaches are based, at least in part, on text analysis methods which originate from the natural language context. However, the natural language context and the text data in software repositories have different properties that reveal the need for adaption of the methods to apply in the context of software repositories. One of the differences is the existence of a set of metadata, such as developer information and time stamp, which is associated with the data in the repositories. However, this difference has not been fully considered in previous feature location research studies. This study proposes a feature location approach that analyzes developer expertise profiles, which contain source code entities modified by the associated software developers, to identify the most similar location pertinent to a desired feature. This approach uses a time-aware term-weighting technique to determine the similarity. An experimental evaluation on four open-source projects shows an improvement in the accuracy, performance, and effectiveness up to 55, 39, and 29 %, respectively, compared to the high-performing information retrieval methods used in feature location. Moreover, the proposed time-aware technique increases the accuracy, performance, and effectiveness of the typical term-weighting technique, tf-idf, as much as 15, 11, and 13 %, respectively. Finally, the proposed approach outperforms our previous approach, noun-based feature location, as much as 17 %. These experimental results demonstrate that time-aware analysis of developers' expertise significantly improves the feature location process. … (more)
- Is Part Of:
- Knowledge and information systems. Volume 49:Issue 2(2016:Nov.)
- Journal:
- Knowledge and information systems
- Issue:
- Volume 49:Issue 2(2016:Nov.)
- Issue Display:
- Volume 49, Issue 2 (2016)
- Year:
- 2016
- Volume:
- 49
- Issue:
- 2
- Issue Sort Value:
- 2016-0049-0002-0000
- Page Start:
- 629
- Page End:
- 659
- Publication Date:
- 2016-11
- Subjects:
- Mining software repositories -- Text analysis -- Term weighting -- Time aware -- Developer expertise
Expert systems (Computer science) -- Periodicals
Information storage and retrieval systems -- Periodicals
006.33 - Journal URLs:
- http://link.springer-ny.com/link/service/journals/10115/index.htm ↗
http://www.springerlink.com/content/0219-1377 ↗
http://www.springer.com/gb/ ↗ - DOI:
- 10.1007/s10115-015-0909-5 ↗
- Languages:
- English
- ISSNs:
- 0219-1377
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5100.437300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9955.xml