It has been suggested that second languages and translated languages are constrained by an interplay of several linguistic systems. This paper reports on a data-driven quantitative study on constrained Finnish. We detect linguistic phenomena that distinguish constrained from non-constrained Finnish across constrained varieties, first/source languages, and registers. Implementing a two-phase method, we first detect key quantitative differences of syntactically defined POS bigrams between each variety-, language-pair- and register-specific constrained dataset and its non-constrained counterpart, using Boruta feature selection. We then use the results as variables in a Multi-dimensional Analysis. The results show that both nominal complexity and verbal/clausal complexity distinguish constrained from non-constrained Finnish. These differences interact with both type of constraint and register: the constrained varieties are less sensitive to register differences, and this tendency is more pronounced in learner Finnish than in translated Finnish. Leaving out any of these variables from the analysis would blur our view of this multi-faceted phenomenon.

Ivaska I., Bernardini S. (2020). Constrained language use in Finnish: A corpus-driven approach. NORDIC JOURNAL OF LINGUISTICS, 43(1), 33-57 [10.1017/S0332586520000013].

Constrained language use in Finnish: A corpus-driven approach

Ivaska I.
;
Bernardini S.
2020

Abstract

It has been suggested that second languages and translated languages are constrained by an interplay of several linguistic systems. This paper reports on a data-driven quantitative study on constrained Finnish. We detect linguistic phenomena that distinguish constrained from non-constrained Finnish across constrained varieties, first/source languages, and registers. Implementing a two-phase method, we first detect key quantitative differences of syntactically defined POS bigrams between each variety-, language-pair- and register-specific constrained dataset and its non-constrained counterpart, using Boruta feature selection. We then use the results as variables in a Multi-dimensional Analysis. The results show that both nominal complexity and verbal/clausal complexity distinguish constrained from non-constrained Finnish. These differences interact with both type of constraint and register: the constrained varieties are less sensitive to register differences, and this tendency is more pronounced in learner Finnish than in translated Finnish. Leaving out any of these variables from the analysis would blur our view of this multi-faceted phenomenon.
2020
Ivaska I., Bernardini S. (2020). Constrained language use in Finnish: A corpus-driven approach. NORDIC JOURNAL OF LINGUISTICS, 43(1), 33-57 [10.1017/S0332586520000013].
Ivaska I.; Bernardini S.
File in questo prodotto:
File Dimensione Formato  
constrained-finnish_revision-for-njl.pdf

accesso aperto

Descrizione: versione già referata, ultima versione pre-proof
Tipo: Postprint
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale - Non opere derivate (CCBYNCND)
Dimensione 2.16 MB
Formato Adobe PDF
2.16 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/767679
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 14
  • ???jsp.display-item.citation.isi??? 9
social impact