CRIS Current Research Information System

Real-world business applications require a trade-off between language model performance and size. We propose a new method for model compression that relies on vocabulary transfer. We evaluate the method on various vertical domains and downstream tasks. Our results indicate that vocabulary transfer can be effectively used in combination with other compression techniques, yielding a significant reduction in model size and inference time while marginally compromising on performance.

Gee, L., Zugarini, A., Rigutini, L., Torroni, P. (2022). Fast Vocabulary Transfer for Language Model Compression. Association for Computational Linguistics [10.18653/v1/2022.emnlp-industry.41].

Fast Vocabulary Transfer for Language Model Compression

Gee L.;Zugarini A.;Rigutini L.;Torroni P.

2022

Abstract

Real-world business applications require a trade-off between language model performance and size. We propose a new method for model compression that relies on vocabulary transfer. We evaluate the method on various vertical domains and downstream tasks. Our results indicate that vocabulary transfer can be effectively used in combination with other compression techniques, yielding a significant reduction in model size and inference time while marginally compromising on performance.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Titolo del volume
	
				Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: Industry Track
			
	Pagina iniziale
	
				409
			
	Pagina finale
	
				416
			
	Codice DOI
	
				https://dx.doi.org/10.18653/v1/2022.emnlp-industry.41
			
	Citazione
	
				Gee, L., Zugarini, A., Rigutini, L., Torroni, P. (2022). Fast Vocabulary Transfer for Language Model Compression. Association for Computational Linguistics [10.18653/v1/2022.emnlp-industry.41].
			
	Tutti gli autori
	
						Gee, L.; Zugarini, A.; Rigutini, L.; Torroni, P.

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1048658

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

30

ND

ND

social impact