This paper presents work in progress for the creation of a Large Vocabulary Automatic Speech Recogniser for Italian using NVIDIA NeMo. Thanks to this package, we were able to build a reliable recogniser for adults' speech by fine tuning the English model provided by NVIDIA and rescoring it with powerful neural language models, obtaining very good performances. The lack of a standard, reliable and publicy available baseline for Italian motivated this work.
Playing with NeMo for building an automatic speech recogniser for Italian / Tamburini F.. - ELETTRONICO. - 3033:(2021), pp. 1-7. (Intervento presentato al convegno 8th Italian Conference on Computational Linguistics, CLiC-it 2021 tenutosi a Milano nel 29 giugno - 1 luglio 2022).
Playing with NeMo for building an automatic speech recogniser for Italian
Tamburini F.
2021
Abstract
This paper presents work in progress for the creation of a Large Vocabulary Automatic Speech Recogniser for Italian using NVIDIA NeMo. Thanks to this package, we were able to build a reliable recogniser for adults' speech by fine tuning the English model provided by NVIDIA and rescoring it with powerful neural language models, obtaining very good performances. The lack of a standard, reliable and publicy available baseline for Italian motivated this work.File | Dimensione | Formato | |
---|---|---|---|
paper19.pdf
accesso aperto
Descrizione: Articolo
Tipo:
Versione (PDF) editoriale
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione
498.7 kB
Formato
Adobe PDF
|
498.7 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.