Topical tags vs non-topical tags: Towards a bipartite classification?

Basile, Valerio; Peroni, Silvio; Tamburini, Fabio; Vitali, Fabio

doi:10.1177/0165551515585283

In this paper we investigate whether it is possible to create a computational approach that allows us to distinguish topical tags (i.e. talking about the topic of a resource) and non-topical tags (i.e. describing aspects of a resource that are not related to its topic) in folksonomies, in a way that correlates with humans. Towards this goal, we collected 21 million tags (1.2 million unique terms) from Delicious and developed an unsupervised statistical algorithm that classifies such tags by applying a word space model adapted to the folksonomy space. Our algorithm analyses the co-occurrence network of tags to a target tag and exploits graph-based metrics for their classification. We validated its outcomes against a reference classification made by humans on a limited number of terms in three separate tests. The analysis of the outcomes of our algorithm shows, in some cases, a consistent disagreement among humans and between humans and our algorithm about what constitutes a topical tag, and suggests the rise of a new category of overly generic tags (i.e. umbrella tags).

Basile, V., Peroni, S., Tamburini, F., Vitali, F. (2015). Topical tags vs non-topical tags: Towards a bipartite classification?. JOURNAL OF INFORMATION SCIENCE, 41(4), 486-505 [10.1177/0165551515585283].

Topical tags vs non-topical tags: Towards a bipartite classification?

BASILE, VALERIO;PERONI, SILVIO;TAMBURINI, FABIO;VITALI, FABIO

2015

Abstract

In this paper we investigate whether it is possible to create a computational approach that allows us to distinguish topical tags (i.e. talking about the topic of a resource) and non-topical tags (i.e. describing aspects of a resource that are not related to its topic) in folksonomies, in a way that correlates with humans. Towards this goal, we collected 21 million tags (1.2 million unique terms) from Delicious and developed an unsupervised statistical algorithm that classifies such tags by applying a word space model adapted to the folksonomy space. Our algorithm analyses the co-occurrence network of tags to a target tag and exploits graph-based metrics for their classification. We validated its outcomes against a reference classification made by humans on a limited number of terms in three separate tests. The analysis of the outcomes of our algorithm shows, in some cases, a consistent disagreement among humans and between humans and our algorithm about what constitutes a topical tag, and suggests the rise of a new category of overly generic tags (i.e. umbrella tags).

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2015
			
	Rivista
	
				JOURNAL OF INFORMATION SCIENCE
			
	Codice DOI
	
				https://dx.doi.org/10.1177/0165551515585283
			
	Citazione
	
				Basile, V., Peroni, S., Tamburini, F., Vitali, F. (2015). Topical tags vs non-topical tags: Towards a bipartite classification?. JOURNAL OF INFORMATION SCIENCE, 41(4), 486-505 [10.1177/0165551515585283].
			
	Tutti gli autori
	
						Basile, Valerio; Peroni, Silvio; Tamburini, Fabio; Vitali, Fabio
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/543615

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

3

3

CRIS Current Research Information System