This chapter describes translation-relevant types of corpora and the main ways in which they can be used to (learn to) translate and to study translation. Virtually all professional translators nowadays, and also most non-professional translators and students of translation, are familiar with translation memories (TMs). These resources, that lie at the core of computer-assisted translation (CAT) tools, consist of databases of aligned source text (ST) and target text (TT) segment pairs – where a segment is usually the size of a sentence. The same recycling principle and the same textual resources also underlie current approaches to machine translation (MT) systems. Corpora can thus be said to be the engine that has propelled the two major transformations we have witnessed since the 1990s in the translation world: CAT and, more recently, MT. However, this role has remained somewhat hidden, since the main emphasis has been on the efficient retrieval of translation matches by more or less sophisticated algorithms. While responsibility for reviewing and approving suggestions by CAT tools and for post-editing machine-translated output is bound to remain with the translator, in CAT and MT it is the software that does most of the corpus-related work, and translators may be only vaguely aware of the inner workings of the technology they use daily. In the type of corpus work described in this chapter, corpora and corpus users instead take centre stage; efficient retrieval is not a priority, and responsibility for querying corpora and for interpreting results remains with the user.
silvia bernardini (2022). How to use corpora for translation. Abingdon : Routledge [10.4324/9780367076399-34].
How to use corpora for translation
silvia bernardini
2022
Abstract
This chapter describes translation-relevant types of corpora and the main ways in which they can be used to (learn to) translate and to study translation. Virtually all professional translators nowadays, and also most non-professional translators and students of translation, are familiar with translation memories (TMs). These resources, that lie at the core of computer-assisted translation (CAT) tools, consist of databases of aligned source text (ST) and target text (TT) segment pairs – where a segment is usually the size of a sentence. The same recycling principle and the same textual resources also underlie current approaches to machine translation (MT) systems. Corpora can thus be said to be the engine that has propelled the two major transformations we have witnessed since the 1990s in the translation world: CAT and, more recently, MT. However, this role has remained somewhat hidden, since the main emphasis has been on the efficient retrieval of translation matches by more or less sophisticated algorithms. While responsibility for reviewing and approving suggestions by CAT tools and for post-editing machine-translated output is bound to remain with the translator, in CAT and MT it is the software that does most of the corpus-related work, and translators may be only vaguely aware of the inner workings of the technology they use daily. In the type of corpus work described in this chapter, corpora and corpus users instead take centre stage; efficient retrieval is not a priority, and responsibility for querying corpora and for interpreting results remains with the user.File | Dimensione | Formato | |
---|---|---|---|
Bernardini_The Routledge Handbook of Corpus Linguistics.pdf
accesso riservato
Descrizione: pdf editoriale
Tipo:
Versione (PDF) editoriale
Licenza:
Licenza per accesso riservato
Dimensione
176.85 kB
Formato
Adobe PDF
|
176.85 kB | Adobe PDF | Visualizza/Apri Contatta l'autore |
post-print_bernardini_corpora-translation_revised_clean.pdf
Open Access dal 02/07/2023
Tipo:
Postprint
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale - Non opere derivate (CCBYNCND)
Dimensione
341.79 kB
Formato
Adobe PDF
|
341.79 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.