This chapter describes translation-relevant types of corpora and the main ways in which they can be used to (learn to) translate and to study translation. Virtually all professional translators nowadays, and also most non-professional translators and students of translation, are familiar with translation memories (TMs). These resources, that lie at the core of computer-assisted translation (CAT) tools, consist of databases of aligned source text (ST) and target text (TT) segment pairs – where a segment is usually the size of a sentence. The same recycling principle and the same textual resources also underlie current approaches to machine translation (MT) systems. Corpora can thus be said to be the engine that has propelled the two major transformations we have witnessed since the 1990s in the translation world: CAT and, more recently, MT. However, this role has remained somewhat hidden, since the main emphasis has been on the efficient retrieval of translation matches by more or less sophisticated algorithms. While responsibility for reviewing and approving suggestions by CAT tools and for post-editing machine-translated output is bound to remain with the translator, in CAT and MT it is the software that does most of the corpus-related work, and translators may be only vaguely aware of the inner workings of the technology they use daily. In the type of corpus work described in this chapter, corpora and corpus users instead take centre stage; efficient retrieval is not a priority, and responsibility for querying corpora and for interpreting results remains with the user.

silvia bernardini (2022). How to use corpora for translation. Abingdon : Routledge [10.4324/9780367076399-34].

How to use corpora for translation

silvia bernardini
2022

Abstract

This chapter describes translation-relevant types of corpora and the main ways in which they can be used to (learn to) translate and to study translation. Virtually all professional translators nowadays, and also most non-professional translators and students of translation, are familiar with translation memories (TMs). These resources, that lie at the core of computer-assisted translation (CAT) tools, consist of databases of aligned source text (ST) and target text (TT) segment pairs – where a segment is usually the size of a sentence. The same recycling principle and the same textual resources also underlie current approaches to machine translation (MT) systems. Corpora can thus be said to be the engine that has propelled the two major transformations we have witnessed since the 1990s in the translation world: CAT and, more recently, MT. However, this role has remained somewhat hidden, since the main emphasis has been on the efficient retrieval of translation matches by more or less sophisticated algorithms. While responsibility for reviewing and approving suggestions by CAT tools and for post-editing machine-translated output is bound to remain with the translator, in CAT and MT it is the software that does most of the corpus-related work, and translators may be only vaguely aware of the inner workings of the technology they use daily. In the type of corpus work described in this chapter, corpora and corpus users instead take centre stage; efficient retrieval is not a priority, and responsibility for querying corpora and for interpreting results remains with the user.
2022
The Routledge Handbook of Corpus Linguistics (2nd edition)
485
498
silvia bernardini (2022). How to use corpora for translation. Abingdon : Routledge [10.4324/9780367076399-34].
silvia bernardini
File in questo prodotto:
File Dimensione Formato  
Bernardini_The Routledge Handbook of Corpus Linguistics.pdf

accesso riservato

Descrizione: pdf editoriale
Tipo: Versione (PDF) editoriale
Licenza: Licenza per accesso riservato
Dimensione 176.85 kB
Formato Adobe PDF
176.85 kB Adobe PDF   Visualizza/Apri   Contatta l'autore
post-print_bernardini_corpora-translation_revised_clean.pdf

Open Access dal 02/07/2023

Tipo: Postprint
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale - Non opere derivate (CCBYNCND)
Dimensione 341.79 kB
Formato Adobe PDF
341.79 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/858530
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact