Programs that extract information from corpora and display it in different ways are fundamental tools in contemporary lexicography. This article surveys the fundamental features that a state-of-the-art corpus query program should offer to lexicographers at different stages of their work, including options and issues related to tool set-up, extracting frequency information, finding collocations and inspecting keywords in context. We focus on what we think are the most important functionalities and desiderata for current and future tools, while also providing short descriptions of four popular and representative programs (TextSTAT, AntConc, the Sketch Engine, and the Corpus WorkBench). We conclude with some general considerations about user-friendliness, usability and the increasing role that NLP-based automation plays in corpus query tools.

Corpus Query Tools for lexicography

BERNARDINI, SILVIA
2013

Abstract

Programs that extract information from corpora and display it in different ways are fundamental tools in contemporary lexicography. This article surveys the fundamental features that a state-of-the-art corpus query program should offer to lexicographers at different stages of their work, including options and issues related to tool set-up, extracting frequency information, finding collocations and inspecting keywords in context. We focus on what we think are the most important functionalities and desiderata for current and future tools, while also providing short descriptions of four popular and representative programs (TextSTAT, AntConc, the Sketch Engine, and the Corpus WorkBench). We conclude with some general considerations about user-friendliness, usability and the increasing role that NLP-based automation plays in corpus query tools.
2013
Dictionaries. An international encyclopedia of lexicography. Supplementary volume: recent developments with focus on electronic and computational lexicography
1395
1405
Marco Baroni; Silvia Bernardini
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/243677
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact