In this article we discuss four different ways of using the Web as a corpus, focusing particularly on those taking the lion share of this volume of working papers: the Web as a “corpus shop”, and the “mega-corpus/mini-Web” as a new object. The latter in particular will be described in some detail, and special attention will be paid to the design of this resource and the challenges posed by its development.

A WaCky introduction

BERNARDINI, SILVIA;BARONI, MARCO;
2006

Abstract

In this article we discuss four different ways of using the Web as a corpus, focusing particularly on those taking the lion share of this volume of working papers: the Web as a “corpus shop”, and the “mega-corpus/mini-Web” as a new object. The latter in particular will be described in some detail, and special attention will be paid to the design of this resource and the challenges posed by its development.
WaCky! Working Papers on the Web as Corpus
9
40
S. Bernardini; M. Baroni; S. Evert
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/130482
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact