A representative corpus of written Italian - CORIS - constructed at the Centre for Theoretical and Applied Linguistics of Bologna University (CILTA) is available on-line. Considering the importance of the comparability of reference corpora in interlinguistic studies, a further corpus - CODIS - was designed. Aimed at specialist needs, CODIS presents a dynamic and adaptive structure providing for the selection of the subcorpora pertinent to a specific research project and allowing the researcher to define the size of each subcorpus. CODIS is designed to be dynamically adapted by the scholar to different comparative needs by a careful combination of small corpus chunks of various types and sizes. The chunk sizes were carefully selected in order to allow for various combinations creating subcorpora of different sizes, ranging from 0 to the maximum size of each CORIS subcorpus. This fine granularity provides a wide range of corpora composition options, satisfying almost all comparative needs.

A dynamic model for reference corpora structure definition / Tamburini F.. - STAMPA. - (2002), pp. 1847-1850. (Intervento presentato al convegno 3rd International Conference on Language Resources and Evaluation, LREC 2002 tenutosi a esp nel 2002).

A dynamic model for reference corpora structure definition

Tamburini F.
2002

Abstract

A representative corpus of written Italian - CORIS - constructed at the Centre for Theoretical and Applied Linguistics of Bologna University (CILTA) is available on-line. Considering the importance of the comparability of reference corpora in interlinguistic studies, a further corpus - CODIS - was designed. Aimed at specialist needs, CODIS presents a dynamic and adaptive structure providing for the selection of the subcorpora pertinent to a specific research project and allowing the researcher to define the size of each subcorpus. CODIS is designed to be dynamically adapted by the scholar to different comparative needs by a careful combination of small corpus chunks of various types and sizes. The chunk sizes were carefully selected in order to allow for various combinations creating subcorpora of different sizes, ranging from 0 to the maximum size of each CORIS subcorpus. This fine granularity provides a wide range of corpora composition options, satisfying almost all comparative needs.
2002
Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002
1847
1850
A dynamic model for reference corpora structure definition / Tamburini F.. - STAMPA. - (2002), pp. 1847-1850. (Intervento presentato al convegno 3rd International Conference on Language Resources and Evaluation, LREC 2002 tenutosi a esp nel 2002).
Tamburini F.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/897422
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact