Corpus and evaluation measures for automatic plagiarism detection