In this article we discuss four different ways of using the Web as a corpus, focusing particularly on those taking the lion share of this volume of working papers: the Web as a “corpus shop”, and the “mega-corpus/mini-Web” as a new object. The latter in particular will be described in some detail, and special attention will be paid to the design of this resource and the challenges posed by its development.
S. Bernardini, M. Baroni, S. Evert (2006). A WaCky introduction. BOLOGNA : GEDIT.
A WaCky introduction
BERNARDINI, SILVIA;BARONI, MARCO;
2006
Abstract
In this article we discuss four different ways of using the Web as a corpus, focusing particularly on those taking the lion share of this volume of working papers: the Web as a “corpus shop”, and the “mega-corpus/mini-Web” as a new object. The latter in particular will be described in some detail, and special attention will be paid to the design of this resource and the challenges posed by its development.File in questo prodotto:
Eventuali allegati, non sono esposti
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.