CRIS Current Research Information System

This chapter describes translation-relevant types of corpora and the main ways in which they can be used to (learn to) translate and to study translation. Virtually all professional translators nowadays, and also most non-professional translators and students of translation, are familiar with translation memories (TMs). These resources, that lie at the core of computer-assisted translation (CAT) tools, consist of databases of aligned source text (ST) and target text (TT) segment pairs – where a segment is usually the size of a sentence. The same recycling principle and the same textual resources also underlie current approaches to machine translation (MT) systems. Corpora can thus be said to be the engine that has propelled the two major transformations we have witnessed since the 1990s in the translation world: CAT and, more recently, MT. However, this role has remained somewhat hidden, since the main emphasis has been on the efficient retrieval of translation matches by more or less sophisticated algorithms. While responsibility for reviewing and approving suggestions by CAT tools and for post-editing machine-translated output is bound to remain with the translator, in CAT and MT it is the software that does most of the corpus-related work, and translators may be only vaguely aware of the inner workings of the technology they use daily. In the type of corpus work described in this chapter, corpora and corpus users instead take centre stage; efficient retrieval is not a priority, and responsibility for querying corpora and for interpreting results remains with the user.

silvia bernardini (2022). How to use corpora for translation. Abingdon : Routledge [10.4324/9780367076399-34].

How to use corpora for translation

silvia bernardini

2022

Abstract

This chapter describes translation-relevant types of corpora and the main ways in which they can be used to (learn to) translate and to study translation. Virtually all professional translators nowadays, and also most non-professional translators and students of translation, are familiar with translation memories (TMs). These resources, that lie at the core of computer-assisted translation (CAT) tools, consist of databases of aligned source text (ST) and target text (TT) segment pairs – where a segment is usually the size of a sentence. The same recycling principle and the same textual resources also underlie current approaches to machine translation (MT) systems. Corpora can thus be said to be the engine that has propelled the two major transformations we have witnessed since the 1990s in the translation world: CAT and, more recently, MT. However, this role has remained somewhat hidden, since the main emphasis has been on the efficient retrieval of translation matches by more or less sophisticated algorithms. While responsibility for reviewing and approving suggestions by CAT tools and for post-editing machine-translated output is bound to remain with the translator, in CAT and MT it is the software that does most of the corpus-related work, and translators may be only vaguely aware of the inner workings of the technology they use daily. In the type of corpus work described in this chapter, corpora and corpus users instead take centre stage; efficient retrieval is not a priority, and responsibility for querying corpora and for interpreting results remains with the user.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Titolo del volume
	
				The Routledge Handbook of Corpus Linguistics (2nd edition)
			
	Pagina iniziale
	
				485
			
	Pagina finale
	
				498
			
	Collana/Serie
	
				ROUTLEDGE HANDBOOKS IN APPLIED LINGUISTICS
			
	Codice DOI
	
				https://dx.doi.org/10.4324/9780367076399-34
			
	Citazione
	
				silvia bernardini (2022). How to use corpora for translation. Abingdon : Routledge [10.4324/9780367076399-34].
			
	Tutti gli autori
	
						silvia bernardini
					
	Appare nelle tipologie:
	
				2.01 Capitolo / saggio in libro

File in questo prodotto:

File	Dimensione	Formato
Bernardini_The Routledge Handbook of Corpus Linguistics.pdf accesso riservato Descrizione: pdf editoriale Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Licenza per accesso riservato Dimensione 176.85 kB Formato Adobe PDF Visualizza/Apri Contatta l'autore	176.85 kB	Adobe PDF	Visualizza/Apri Contatta l'autore
post-print_bernardini_corpora-translation_revised_clean.pdf Open Access dal 02/07/2023 Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale - Non opere derivate (CCBYNCND) Dimensione 341.79 kB Formato Adobe PDF Visualizza/Apri	341.79 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/858530

Citazioni

ND

12

ND

ND

social impact