A parallel and efficient approach to large scale clone detection. Issue 6 (25th March 2015)
- Record Type:
- Journal Article
- Title:
- A parallel and efficient approach to large scale clone detection. Issue 6 (25th March 2015)
- Main Title:
- A parallel and efficient approach to large scale clone detection
- Authors:
- Sajnani, Hitesh
Saini, Vaibhav
Lopes, Cristina
Koschke, Rainer
Rilling, Juergen - Abstract:
- <abstract abstract-type="main"> <title>Abstract</title> <p>We propose a new token‐based approach for large ‐scale code clone detection, which is based on a filtering heuristic that reduces the number of token comparisons when the two code blocks are compared. We also present a MapReduce based parallel algorithm that uses the filtering heuristic and scales to thousands of projects. The filtering heuristic is generic and can also be used in conjunction with other token‐based approaches. In that context, we demonstrate how it can increase the retrieval speed and decrease the memory usage of the index‐based approaches. In our experiments on 36 open source Java projects, we found that: (i) filtering reduces token comparisons by a factor of 10, and thus increasing the speed of clone detection by a factor of 1.5; (ii) the speed‐up and scale‐up of the parallel approach using filtering is near‐linear on a cluster of 2–32 nodes for 150–2800 projects; and (iii) filtering decreases the memory usage of index‐based approach by half and the search time by a factor of 5. Copyright © 2015 John Wiley & Sons, Ltd.</p> </abstract>
- Is Part Of:
- Journal of software. Volume 27:Issue 6(2015:Jun.)
- Journal:
- Journal of software
- Issue:
- Volume 27:Issue 6(2015:Jun.)
- Issue Display:
- Volume 27, Issue 6 (2015)
- Year:
- 2015
- Volume:
- 27
- Issue:
- 6
- Issue Sort Value:
- 2015-0027-0006-0000
- Page Start:
- 402
- Page End:
- 429
- Publication Date:
- 2015-03-25
- Subjects:
- Software engineering -- Periodicals
Computer software -- Development -- Periodicals
Software maintenance -- Periodicals
005.1 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)2047-7481 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/smr.1707 ↗
- Languages:
- English
- ISSNs:
- 2047-7473
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 3757.xml