The heterogeneity and the lack of structure of World Wide Web make automated discovery, organization, and management of Web-based information a non-trivial task. Traditional search and indexing tools provide some comfort to users, but they generally provide neither structured information nor categorize, filter, or interpret documents in an automated way. In recent years, these factors have prompted the need for developing data mining techniques applied to the web, giving rise to the term “Web Mining”. This paper introduces the problem of web data extraction and gives a brief analysis of the various techniques to address it. Then, News Miner, a tool for Web Content Mining applied to the news retrieval is presented.

Managing Web-Based Information / Scotto M; Silitti A; Succi G; Vernazza T. - STAMPA. - (2004), pp. 575-578. (Intervento presentato al convegno International Conference on Enterprise Information Systems (ICEIS 2004) tenutosi a Porto, Portugal nel April).

Managing Web-Based Information

Succi G;
2004

Abstract

The heterogeneity and the lack of structure of World Wide Web make automated discovery, organization, and management of Web-based information a non-trivial task. Traditional search and indexing tools provide some comfort to users, but they generally provide neither structured information nor categorize, filter, or interpret documents in an automated way. In recent years, these factors have prompted the need for developing data mining techniques applied to the web, giving rise to the term “Web Mining”. This paper introduces the problem of web data extraction and gives a brief analysis of the various techniques to address it. Then, News Miner, a tool for Web Content Mining applied to the news retrieval is presented.
2004
International Conference on Enterprise Information Systems
575
578
Managing Web-Based Information / Scotto M; Silitti A; Succi G; Vernazza T. - STAMPA. - (2004), pp. 575-578. (Intervento presentato al convegno International Conference on Enterprise Information Systems (ICEIS 2004) tenutosi a Porto, Portugal nel April).
Scotto M; Silitti A; Succi G; Vernazza T
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/897521
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 6
  • ???jsp.display-item.citation.isi??? ND
social impact