SPPlagiarise: A tool for generating simulated semantics-preserving plagiarism of java source code

No Thumbnail Available
File version
Author(s)
Cheers, H
Lin, Y
Smith, SP
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2019
Size
File type(s)
Location

Beijing, China

License
Abstract

Source code plagiarism is a common occurrence in undergraduate computer science education. Studies have indicated at least 50% of students plagiarize during their undergraduate career. To identity cases of source code plagiarism, many source code plagiarism detection tools have been proposed. However, conclusively determining the effectiveness these tools at identifying cases of source code plagiarism is difficult. Evaluations are typically performed using unreleased data sets. Without a comprehensive publicly available data set for source code plagiarism detection evaluation, it is difficult to perform an unbiased and reproducible evaluations of tools. To address this problem, this paper presents a tool, SPPlagiarise, which is designed to produce simulated source code plagiarism of Java source code. SPPlagiarise applies a random number of semantics-preserving source code obfuscations at random locations to a Java code base to simulate source code plagiarism. In this paper the design of the tool and an evaluation of a generated plagiarism data set is presented.

Journal Title
Conference Title

Proceedings of the IEEE International Conference on Software Engineering and Service Sciences, ICSESS

Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
Access the data
Related item(s)
Subject
Persistent link to this record
Citation

Cheers, H; Lin, Y; Smith, SP, SPPlagiarise: A tool for generating simulated semantics-preserving plagiarism of java source code, Proceedings of the IEEE International Conference on Software Engineering and Service Sciences, ICSESS, 2019, pp. 617-622