CRIS Current Research Information System

English. CorAIt is a non-native speech database for Italian, which is freely accessible online for academic research purposes. It was especially designed to meet the requirements of a larger research project focused on foreign accented Italian speech. The corpus is aimed at providing a uniform collection of speech samples uttered by non-native speakers of Italian. To date, 105 non-native speakers – whose mother tongues are either French, Romanian, Spanish, English, German, or Russian – have been recorded. The corpus includes also a control group made up of 16 Italian speakers. There are almost 8 hours of audio material, both read speech (first and second reading), and spontaneous speech. This paper emphasizes the necessity for this type of database, it describes the steps involved in its construction, and it presents the features of CorAIt.

Combei, C.R. (2017). CorAIt – A non-native speech database for Italian. Torino : Accademia University Press [10.4000/books.aaccademia.2386].

CorAIt – A non-native speech database for Italian

Combei, Claudia Roberta

2017

Abstract

English. CorAIt is a non-native speech database for Italian, which is freely accessible online for academic research purposes. It was especially designed to meet the requirements of a larger research project focused on foreign accented Italian speech. The corpus is aimed at providing a uniform collection of speech samples uttered by non-native speakers of Italian. To date, 105 non-native speakers – whose mother tongues are either French, Romanian, Spanish, English, German, or Russian – have been recorded. The corpus includes also a control group made up of 16 Italian speakers. There are almost 8 hours of audio material, both read speech (first and second reading), and spontaneous speech. This paper emphasizes the necessity for this type of database, it describes the steps involved in its construction, and it presents the features of CorAIt.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2017
			
	Titolo del volume
	
				Proceedings of the Fourth Italian Conference on Computational Linguistics CLiC-it 2017
			
	Pagina iniziale
	
				113
			
	Pagina finale
	
				118
			
	Rivista
	
				CEUR WORKSHOP PROCEEDINGS
			
	Codice DOI
	
				https://dx.doi.org/10.4000/books.aaccademia.2386
			
	Citazione
	
				Combei, C.R. (2017). CorAIt – A non-native speech database for Italian. Torino : Accademia University Press [10.4000/books.aaccademia.2386].
			
	Tutti gli autori
	
						Combei, Claudia Roberta
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/655889

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

0

ND

ND

social impact