PDE4Java: Plagiarism Detection Engine for Java source code: a clustering approach. (29th September 2008)