Large‐scale inter‐system clone detection using suffix trees and hashing. Issue 8 (10th February 2013)
- Record Type:
- Journal Article
- Title:
- Large‐scale inter‐system clone detection using suffix trees and hashing. Issue 8 (10th February 2013)
- Main Title:
- Large‐scale inter‐system clone detection using suffix trees and hashing
- Authors:
- Koschke, Rainer
Mens, T.
Cleve, A. - Abstract:
- <abstract abstract-type="main"> <title>SUMMARY</title> <p>Detecting a similar code between two systems has various applications such as comparing two software variants or versions or finding potential license violations. Techniques detecting suspiciously similar code must scale in terms of resources needed to very large code corpora and need to have high precision because a human needs to inspect the results. This paper demonstrates how suffix trees can be used to obtain a scalable comparison. The evaluation is carried out for very large code corpora. Our evaluation shows that our approach is faster than index‐based techniques when the analysis is run only once. If the analysis is to be conducted multiple times, creating an index pays off. We report how much code can be filtered out from the analysis using an index‐based filter. In addition to that, this paper proposes a method to improve precision through user feedback. A user validates a sample of the found clone candidates. An automated data mining technique learns a decision tree on the basis of the user decisions and different code metrics. We investigate the relevance of several metrics and whether criteria learned from one application domain can be generalized to other domains. Copyright © 2013 John Wiley & Sons, Ltd.</p> </abstract>
- Is Part Of:
- Journal of software. Volume 26:Issue 8(2014:Aug.)
- Journal:
- Journal of software
- Issue:
- Volume 26:Issue 8(2014:Aug.)
- Issue Display:
- Volume 26, Issue 8 (2014)
- Year:
- 2014
- Volume:
- 26
- Issue:
- 8
- Issue Sort Value:
- 2014-0026-0008-0000
- Page Start:
- 747
- Page End:
- 769
- Publication Date:
- 2013-02-10
- Subjects:
- Software engineering -- Periodicals
Computer software -- Development -- Periodicals
Software maintenance -- Periodicals
005.1 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)2047-7481 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/smr.1592 ↗
- Languages:
- English
- ISSNs:
- 2047-7473
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 3449.xml