A New Software Birthmark based on Weight Sequences of Dynamic Control Flow Graph for Plagiarism Detection. (1st June 2018)
- Record Type:
- Journal Article
- Title:
- A New Software Birthmark based on Weight Sequences of Dynamic Control Flow Graph for Plagiarism Detection. (1st June 2018)
- Main Title:
- A New Software Birthmark based on Weight Sequences of Dynamic Control Flow Graph for Plagiarism Detection
- Authors:
- Yuan, Baoguo
Wang, Junfeng
Fang, Zhiyang
Qi, Li - Abstract:
- Abstract: With the large-scale development of open source software, software plagiarism has become a serious threat to software industry and intellectual property. As the latest technique of plagiarism detection, dynamic software birthmark has attracted much attention in recent years. Most of the existing dynamic birthmarks focus on how to resist obfuscation techniques such as compiler optimizations and strong obfuscations implemented in tools. However, they pay little attention to packers, especially encryption packer which is commonly used in software protection as well as plagiarism. When used to encrypt software, the decryption code is added to the binary. It is hard to distinguish the original parts of software from the decryption parts using traditional dynamic birthmarks. In this paper, we propose a new dynamic software birthmark called weight sequences birthmark (WSB) which is based on weight sequences of dynamic control flow graph (DCFG). The weight sequences are used as characteristics, which make full use of the different patterns of dynamic basic block replications between the original code and the decryption code. Compared with the-state-of-art dynamic key instruction sequence birthmark (DKISB), the new birthmark can resist encryption packer effectively. Furthermore, WSB shows better credibility than DKISB when distinguishing independent programs. The comprehensive experiments illustrate that the value of extended F -measure can reach 96.8%, indicating that itAbstract: With the large-scale development of open source software, software plagiarism has become a serious threat to software industry and intellectual property. As the latest technique of plagiarism detection, dynamic software birthmark has attracted much attention in recent years. Most of the existing dynamic birthmarks focus on how to resist obfuscation techniques such as compiler optimizations and strong obfuscations implemented in tools. However, they pay little attention to packers, especially encryption packer which is commonly used in software protection as well as plagiarism. When used to encrypt software, the decryption code is added to the binary. It is hard to distinguish the original parts of software from the decryption parts using traditional dynamic birthmarks. In this paper, we propose a new dynamic software birthmark called weight sequences birthmark (WSB) which is based on weight sequences of dynamic control flow graph (DCFG). The weight sequences are used as characteristics, which make full use of the different patterns of dynamic basic block replications between the original code and the decryption code. Compared with the-state-of-art dynamic key instruction sequence birthmark (DKISB), the new birthmark can resist encryption packer effectively. Furthermore, WSB shows better credibility than DKISB when distinguishing independent programs. The comprehensive experiments illustrate that the value of extended F -measure can reach 96.8%, indicating that it is a high-quality birthmark which satisfies both the credibility and the resiliency. … (more)
- Is Part Of:
- Computer journal. Volume 61:Number 8(2018)
- Journal:
- Computer journal
- Issue:
- Volume 61:Number 8(2018)
- Issue Display:
- Volume 61, Issue 8 (2018)
- Year:
- 2018
- Volume:
- 61
- Issue:
- 8
- Issue Sort Value:
- 2018-0061-0008-0000
- Page Start:
- 1202
- Page End:
- 1215
- Publication Date:
- 2018-06-01
- Subjects:
- plagiarism detection -- dynamic software birthmark -- weighted DCFG -- weight sequences
Computers -- Periodicals
005.1 - Journal URLs:
- http://comjnl.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/comjnl/bxy055 ↗
- Languages:
- English
- ISSNs:
- 0010-4620
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.060000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12141.xml