CRIS Current Research Information System

The author identification task at PAN-2014 focuses on author verification. Similar to PAN-2013 we are given a set of documents by the same author along with exactly one document of questioned authorship, and the task is to determine whether the known and the questioned documents are by the same author or not. In comparison to PAN-2013, a significantly larger corpus was built comprising hundreds of documents in four natural languages (Dutch, English, Greek, and Spanish) and four genres (essays, reviews, novels, opinion articles). In addition, more suitable performance measures are used focusing on the accuracy and the confidence of the predictions as well as the ability of the submitted methods to leave some problems unanswered in case there is great uncertainty. To this end, we adopt the c@1 measure, originally proposed for the question answering task. We received 13 software submissions that were evaluated in the TIRA framework. Analytical evaluation results are presented where one language-independent approach serves as a challenging baseline. Moreover, we continue the successful practice of the PAN labs to examine meta-models based on the combination of all submitted systems. Last but not least, we provide statistical significance tests to demonstrate the important differences between the submitted approaches.

Stamatatos E., Daelemans W., Verhoeven B., Potthast M., Stein B., Juola P., et al. (2014). Overview of the author identification task at PAN 2014. CEUR-WS.

Overview of the author identification task at PAN 2014

Stamatatos E.;Daelemans W.;Verhoeven B.;Potthast M.;Stein B.;Juola P.;Sanchez-Perez M. A.;Barron-Cedeno A.

2014

Abstract

The author identification task at PAN-2014 focuses on author verification. Similar to PAN-2013 we are given a set of documents by the same author along with exactly one document of questioned authorship, and the task is to determine whether the known and the questioned documents are by the same author or not. In comparison to PAN-2013, a significantly larger corpus was built comprising hundreds of documents in four natural languages (Dutch, English, Greek, and Spanish) and four genres (essays, reviews, novels, opinion articles). In addition, more suitable performance measures are used focusing on the accuracy and the confidence of the predictions as well as the ability of the submitted methods to leave some problems unanswered in case there is great uncertainty. To this end, we adopt the c@1 measure, originally proposed for the question answering task. We received 13 software submissions that were evaluated in the TIRA framework. Analytical evaluation results are presented where one language-independent approach serves as a challenging baseline. Moreover, we continue the successful practice of the PAN labs to examine meta-models based on the combination of all submitted systems. Last but not least, we provide statistical significance tests to demonstrate the important differences between the submitted approaches.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2014
			
	Titolo del volume
	
				CEUR Workshop Proceedings
			
	Pagina iniziale
	
				877
			
	Pagina finale
	
				897
			
	Collana/Serie
	
				CEUR WORKSHOP PROCEEDINGS
			
	Citazione
	
				Stamatatos E.,  Daelemans W.,  Verhoeven B.,  Potthast M.,  Stein B.,  Juola P., et al. (2014). Overview of the author identification task at PAN 2014. CEUR-WS.
			
	Tutti gli autori
	
						Stamatatos E.; Daelemans W.; Verhoeven B.; Potthast M.; Stein B.; Juola P.; Sanchez-Perez M.A.; Barron-Cedeno A.
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
CLEF2014wn-Pan-StamatosEt2014.pdf accesso aperto Tipo: Versione (PDF) editoriale Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY) Dimensione 1.46 MB Formato Adobe PDF Visualizza/Apri	1.46 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/709289

Citazioni

ND

68

ND

social impact