English. CorAIt is a non-native speech database for Italian, which is freely accessible online for academic research purposes. It was especially designed to meet the requirements of a larger research project focused on foreign accented Italian speech. The corpus is aimed at providing a uniform collection of speech samples uttered by non-native speakers of Italian. To date, 105 non-native speakers – whose mother tongues are either French, Romanian, Spanish, English, German, or Russian – have been recorded. The corpus includes also a control group made up of 16 Italian speakers. There are almost 8 hours of audio material, both read speech (first and second reading), and spontaneous speech. This paper emphasizes the necessity for this type of database, it describes the steps involved in its construction, and it presents the features of CorAIt.

CorAIt – A non-native speech database for Italian / Combei, Claudia Roberta. - ELETTRONICO. - (2017), pp. 113-118. (Intervento presentato al convegno 4th Italian Conference on Computational Linguistics, CLiC-it 2017 tenutosi a ita nel 2017) [10.4000/books.aaccademia.2386].

CorAIt – A non-native speech database for Italian

Combei, Claudia Roberta
2017

Abstract

English. CorAIt is a non-native speech database for Italian, which is freely accessible online for academic research purposes. It was especially designed to meet the requirements of a larger research project focused on foreign accented Italian speech. The corpus is aimed at providing a uniform collection of speech samples uttered by non-native speakers of Italian. To date, 105 non-native speakers – whose mother tongues are either French, Romanian, Spanish, English, German, or Russian – have been recorded. The corpus includes also a control group made up of 16 Italian speakers. There are almost 8 hours of audio material, both read speech (first and second reading), and spontaneous speech. This paper emphasizes the necessity for this type of database, it describes the steps involved in its construction, and it presents the features of CorAIt.
2017
Proceedings of the Fourth Italian Conference on Computational Linguistics CLiC-it 2017
113
118
CorAIt – A non-native speech database for Italian / Combei, Claudia Roberta. - ELETTRONICO. - (2017), pp. 113-118. (Intervento presentato al convegno 4th Italian Conference on Computational Linguistics, CLiC-it 2017 tenutosi a ita nel 2017) [10.4000/books.aaccademia.2386].
Combei, Claudia Roberta
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/655889
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact