A collaborative method for code clone detection using a deep learning model. (December 2022)
- Record Type:
- Journal Article
- Title:
- A collaborative method for code clone detection using a deep learning model. (December 2022)
- Main Title:
- A collaborative method for code clone detection using a deep learning model
- Authors:
- Karthik, S.
Rajdeepa, B. - Abstract:
- Highlights: Code Cloning (CC) is the process of copying and reconfiguring a code fragment from another part of software. As a result, CC detection (CCD) seems to to be an active research concept on the software side. Detecting Large-Variance (LV-CC) became a considerable struggle as the number of Lines of Code (LOC) in source code expanded. This model then uses back-propagation in the following element to modify the weight matrix based on the training set. The simulation results reveal that the proposed method efficiently resolves distances-based problems than the classical methods for the CCD. Abstract: Code cloning (CC) is the process of copying and reconfiguring a code fragment and using it in another part of a software project. This clones increases the running overhead of the software. As a result, Code Clone Detection (CCD) has become an active research area in software development research. The detection of Large-Variance Code Clones (LV-CCs) is very difficult when the lines of codes (LOCs) in the source code are very large. The distance metrics have been used in LV-CC detection by calculating the distance between training feature sets of source codes and testing feature sets of source codes. However, threshold selection for detecting clones is a challenging issue in distance-based LV-CC detection. To solve this, a Collaborative CCD using Deep Learning (CCCD-DL) is developed in this paper by utilising lexical, syntactic, semantic and structural features forHighlights: Code Cloning (CC) is the process of copying and reconfiguring a code fragment from another part of software. As a result, CC detection (CCD) seems to to be an active research concept on the software side. Detecting Large-Variance (LV-CC) became a considerable struggle as the number of Lines of Code (LOC) in source code expanded. This model then uses back-propagation in the following element to modify the weight matrix based on the training set. The simulation results reveal that the proposed method efficiently resolves distances-based problems than the classical methods for the CCD. Abstract: Code cloning (CC) is the process of copying and reconfiguring a code fragment and using it in another part of a software project. This clones increases the running overhead of the software. As a result, Code Clone Detection (CCD) has become an active research area in software development research. The detection of Large-Variance Code Clones (LV-CCs) is very difficult when the lines of codes (LOCs) in the source code are very large. The distance metrics have been used in LV-CC detection by calculating the distance between training feature sets of source codes and testing feature sets of source codes. However, threshold selection for detecting clones is a challenging issue in distance-based LV-CC detection. To solve this, a Collaborative CCD using Deep Learning (CCCD-DL) is developed in this paper by utilising lexical, syntactic, semantic and structural features for identifying all types of clones together. A lexical feature is extracted from Clone Pairs (CPs) identified by LV-Mapper. Syntactic and semantic features are identified by the Abstract Syntax Tree (AST) and Control Flow Graph (CFG). The structural features are extracted by code size metrics (CZMs) and object-oriented metrics (OOMs). All features are coordinated and fed into the input layer of DNN. The hidden layer then transforms the inputs into the neural vertices in the multi-classification stage using linear transformation preceded by suppressing non-linearity. This process can generate a complicated and non-hypothetical prototype with a weight matrix for fitting the training sequence. Thus, the feed-forward step has been successfully completed. This model then uses back-propagation in the following element to modify the weight matrix based on the training set. Finally, a softmax layer converts the clone detection task into a classification process. The results of the experiments show that the proposed method solves distance-based problems more quickly and effectively than the traditional methods for the CCD. … (more)
- Is Part Of:
- Advances in engineering software. Volume 174(2022)
- Journal:
- Advances in engineering software
- Issue:
- Volume 174(2022)
- Issue Display:
- Volume 174, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 174
- Issue:
- 2022
- Issue Sort Value:
- 2022-0174-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-12
- Subjects:
- Code cloning -- LV-Mapper -- Deep learning -- Feed-forward step -- Softmax layer
Computer-aided engineering -- Periodicals
Engineering -- Computer programs -- Periodicals
Engineering -- Software -- Periodicals
Periodicals
620.0028553 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09659978 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.advengsoft.2022.103327 ↗
- Languages:
- English
- ISSNs:
- 0965-9978
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 0705.450000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24217.xml