Revisiting the Challenges and Opportunities in Software Plagiarism Detection

Loading...
Thumbnail Image

Access rights

openAccess

URL

Journal Title

Journal ISSN

Volume Title

A4 Artikkeli konferenssijulkaisussa

Date

2020-02

Major/Subject

Mcode

Degree programme

Language

en

Pages

5
537-541

Series

SANER 2020 - Proceedings of the 2020 IEEE 27th International Conference on Software Analysis, Evolution, and Reengineering

Abstract

Software plagiarism seriously impedes the healthy development of open source software. To fight against code obfuscation and inherent non-determinism of thread scheduling applied against software plagiarism detection, we proposed a new dynamic birthmark called DYnamic Key Instruction Sequence (DYKIS) and a framework called Thread-oblivious dynamic Birthmark (TOB) for the purpose of reviving the existing birthmarks and a thread-aware dynamic birthmark called Thread-related System call Birthmark (TreSB). Though many approaches have been proposed for software plagiarism detection, they are still limited to satisfy the following highly desired requirements: the applicability to handle binary, the capability to detect partial plagiarism, the resiliency to code obfuscation, the interpretability on detection results, and the scalability to process large-scale software. In this position paper, we discuss and outline the research opportunities and challenges in the field of software plagiarism detection in order to stimulate brilliant innovations and direct our future research efforts.

Description

Keywords

binary code similarity, software birthmark, software plagiarism detection, source code similarity

Other note

Citation

Xu, X, Fan, M, Jia, A, Wang, Y, Yan, Z, Zheng, Q & Liu, T 2020, Revisiting the Challenges and Opportunities in Software Plagiarism Detection . in K Kontogiannis, F Khomh, A Chatzigeorgiou, M-E Fokaefs & M Zhou (eds), SANER 2020 - Proceedings of the 2020 IEEE 27th International Conference on Software Analysis, Evolution, and Reengineering ., 9054847, IEEE, pp. 537-541, IEEE International Conference on Software Analysis, Evolution, and Reengineering, London, Canada, 18/02/2020 . https://doi.org/10.1109/SANER48275.2020.9054847