CroLSSim: Cross‐language software similarity detector using hybrid approach of LSA‐based AST‐MDrep features and CNN‐LSTM model. Issue 9 (9th January 2022)

Record Type:: Journal Article
Title:: CroLSSim: Cross‐language software similarity detector using hybrid approach of LSA‐based AST‐MDrep features and CNN‐LSTM model. Issue 9 (9th January 2022)
Main Title:: CroLSSim: Cross‐language software similarity detector using hybrid approach of LSA‐based AST‐MDrep features and CNN‐LSTM model
Authors:: Ullah, Farhan
Naeem, Muhammad Rashid
Naeem, Hamad
Cheng, Xiaochun
Alazab, Mamoun
Abstract:: Abstract: Software similarity in different programming codes is a rapidly evolving field because of its numerous applications in software development, software cloning, software plagiarism, and software forensics. Currently, software researchers and developers search cross‐language open‐source repositories for similar applications for a variety of reasons, such as reusing programming code, analyzing different implementations, and looking for a better application. However, it is a challenging task because each programming language has a unique syntax and semantic structure. In this paper, a novel tool called Cross‐Language Software Similarity (CroLSSim) is designed to detect similar software applications written in different programming codes. First, the Abstract Syntax Tree (AST) features are collected from different programming codes. These are high‐quality features that can show the abstract view of each program. Then, Methods Description (MDrep) in combination with AST is used to examine the relationship among different method calls. Second, the Term Frequency Inverse Document Frequency approach is used to retrieve the local and global weights from AST‐MDrep features. Third, the Latent Semantic Analysis‐based features extraction and selection method is proposed to extract the semantic anchors in reduced dimensional space. Fourth, the Convolution Neural Network (CNN)‐based features extraction method is proposed to mine the deep features. Finally, a hybrid deep learning … (more)
Is Part Of:: International journal of intelligent systems. Volume 37:Issue 9(2022)
Journal:: International journal of intelligent systems
Issue:: Volume 37:Issue 9(2022)
Issue Display:: Volume 37, Issue 9 (2022)
Year:: 2022
Volume:: 37
Issue:: 9
Issue Sort Value:: 2022-0037-0009-0000
Page Start:: 5768
Page End:: 5795
Publication Date:: 2022-01-09
Subjects:: abstract syntax tree -- data mining -- deep learning -- latent semantic analysis -- software similarity
Artificial intelligence -- Periodicals
Expert systems (Computer science) -- Periodicals
Intelligence artificielle -- Périodiques
Systèmes experts (Informatique) -- Périodiques
006.3
Journal URLs:: http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1098-111X ↗
https://www.hindawi.com/journals/ijis ↗
http://onlinelibrary.wiley.com/ ↗
DOI:: 10.1002/int.22813 ↗
Languages:: English
ISSNs:: 0884-8173
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 4542.310500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 22798.xml