CRIS Current Research Information System

When automatic plagiarism detection is carried out considering a reference corpus, a suspicious text is compared to a set of original documents in order to relate the plagiarised text fragments to their potential source. One of the biggest difficulties in this task is to locateplagiarised fragments that have been modified (by rewording, insertion or deletion, for example) from the source text. The definition of proper text chunks as comparison units of the suspicious and original texts is crucial for the success of this kind of applications. Our experiments with the METER corpus show that the best results are obtained when considering low level word n-grams comparisons (n = 2, 3). © Springer-Verlag Berlin Heidelberg 2009.

Barron-Cedeno A., Rosso P. (2009). On automatic plagiarism detection based on n-grams comparison [10.1007/978-3-642-00958-7_69].

On automatic plagiarism detection based on n-grams comparison

Barron-Cedeno A.;Rosso P.

2009

Abstract

When automatic plagiarism detection is carried out considering a reference corpus, a suspicious text is compared to a set of original documents in order to relate the plagiarised text fragments to their potential source. One of the biggest difficulties in this task is to locateplagiarised fragments that have been modified (by rewording, insertion or deletion, for example) from the source text. The definition of proper text chunks as comparison units of the suspicious and original texts is crucial for the success of this kind of applications. Our experiments with the METER corpus show that the best results are obtained when considering low level word n-grams comparisons (n = 2, 3). © Springer-Verlag Berlin Heidelberg 2009.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2009
			
	Titolo del volume
	
				Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
			
	Pagina iniziale
	
				696
			
	Pagina finale
	
				700
			
	Collana/Serie
	
				LECTURE NOTES IN ARTIFICIAL INTELLIGENCE
			
	Codice DOI
	
				https://dx.doi.org/10.1007/978-3-642-00958-7_69
			
	Citazione
	
				Barron-Cedeno A.,  Rosso P. (2009). On automatic plagiarism detection based on n-grams comparison [10.1007/978-3-642-00958-7_69].
			
	Tutti gli autori
	
						Barron-Cedeno A.; Rosso P.
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/709281

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

89

53

ND

social impact